Honor Debuts a New AI Agent That Can Read and Understand Your Screen

Date:

Share:


It chose a restaurant, but then couldn’t complete the process as the spot it chose required a credit card to confirm a reservation, at which point the user had to take over. You can be flexible in your query—in another example, asking it to book a “highly rated” restaurant meant it would look at reviews with high scores, though the agent doesn’t do any more research than that. It’s not cross-referencing OpenTable reviews with data from other parts of the web, especially since all of this data is processed on device and isn’t sent to the cloud.

This kind of agentic artificial intelligence is the current buzzword in the tech sphere. My colleague Will Knight recently tested an AI assistant that could browse the web and perform tasks online. Google late last year unveiled its Gemini 2 AI model trained to take actions on your behalf. It also renews the idea of a generative user interface for smartphones—at MWC 2024, we saw a few companies working on ways to interact with apps without using apps at all, instead leaning on AI assistants to generate a user interface as you issued a command.

Honor’s approach feels somewhat like what Rabbit—of the infamous Rabbit R1—is doing with Teach Mode, where you train its assistant manually to complete a task. There’s no need to access an app’s Application Programming Interface (API), which is the traditional way apps or services communicate with each other. The agent memorizes the process, allowing you to then issue the command and have it execute the task.

But Honor says its self-reliant AI execution model isn’t trained to follow strict steps—it’s capable of multimodal screen context recognition to perform tasks autonomously. Instead of having to train the assistant to learn every single part of the OpenTable app, it is capable of understanding the semantic elements of the user interface and will follow-through with a multi-step process to execute your request. Honor highlighted that this process was more cost effective: “Unlike competitors such as Apple, Samsung, and Google, which rely on external APIs—resulting in higher operational costs—Honor’s AI Agent independently manages a wide range of tasks.”

Photograph: Julian Chokkattu



Source link

━ more like this

You Asked: OLED vs QLED at distance and fixing Dolby Atmos issues

On today’s episode of You Asked: How long should your OLED TV last? Will you actually notice a difference between different TV types?...

Devils on the Moon brings the score-chasing of pinball to the Playdate

Pinball video games have been around for years — I cut my teeth on Space Cadet 3D Pinball, which was pre-loaded on Windows...

Trump tells Iran to ‘open the f***ing strait, you crazy b**tards, or you’ll be living in Hell’ – London Business News | Londonlovesbusiness.com

President Donald Trump increased tensions with Iran on Sunday by warning of imminent strikes against the country’s key infrastructure. This warning aligns with Israel...

5 dead games I still can’t stop thinking about

‘Dead game’ is a term thrown around loosely now. You’ll often hear players say it whenever a game drops a few spots in...

NASA astronauts capture first human view of Moon’s ‘dark side’ – London Business News | Londonlovesbusiness.com

NASA revealed a stunning image of the Moon taken by the Artemis II crew on Sunday, marking the first time humans have directly...
spot_img