Honor Debuts a New AI Agent That Can Read and Understand Your Screen

Date:

Share:


It chose a restaurant, but then couldn’t complete the process as the spot it chose required a credit card to confirm a reservation, at which point the user had to take over. You can be flexible in your query—in another example, asking it to book a “highly rated” restaurant meant it would look at reviews with high scores, though the agent doesn’t do any more research than that. It’s not cross-referencing OpenTable reviews with data from other parts of the web, especially since all of this data is processed on device and isn’t sent to the cloud.

This kind of agentic artificial intelligence is the current buzzword in the tech sphere. My colleague Will Knight recently tested an AI assistant that could browse the web and perform tasks online. Google late last year unveiled its Gemini 2 AI model trained to take actions on your behalf. It also renews the idea of a generative user interface for smartphones—at MWC 2024, we saw a few companies working on ways to interact with apps without using apps at all, instead leaning on AI assistants to generate a user interface as you issued a command.

Honor’s approach feels somewhat like what Rabbit—of the infamous Rabbit R1—is doing with Teach Mode, where you train its assistant manually to complete a task. There’s no need to access an app’s Application Programming Interface (API), which is the traditional way apps or services communicate with each other. The agent memorizes the process, allowing you to then issue the command and have it execute the task.

But Honor says its self-reliant AI execution model isn’t trained to follow strict steps—it’s capable of multimodal screen context recognition to perform tasks autonomously. Instead of having to train the assistant to learn every single part of the OpenTable app, it is capable of understanding the semantic elements of the user interface and will follow-through with a multi-step process to execute your request. Honor highlighted that this process was more cost effective: “Unlike competitors such as Apple, Samsung, and Google, which rely on external APIs—resulting in higher operational costs—Honor’s AI Agent independently manages a wide range of tasks.”

Photograph: Julian Chokkattu



Source link

━ more like this

Visionary to Watch 2025’s Game Changer – Insights Success

Visionary to Watch 2025’s Game Changer The post Visionary to Watch 2025’s Game Changer appeared first on Insights Success. Source link

You can watch Pokémon the Movie 2000 for free on YouTube right now

The official channel is continuing its with another classic: Pokémon the Movie 2000. The entire movie is available to watch now...

Leak claims the PS6 could have triple the performance as the PS5 for the same price

We're nearly five years out from the release of the original PlayStation 5 and rumors of Sony's next-gen console are starting to bubble...

BioShock 4 hits a major development snag, and a remake of the original gets put on ice

BioShock fans will have to wait even longer to find out if we're going to Rapture, Columbia or a brand new city since...

Apple reportedly has a ‘stripped-down’ AI chatbot to compete with ChatGPT in the works

Apple has fallen far behind in the competitive market of AI-powered chatbots, but it may have a plan for an in-house option that...
spot_img