Android’s screen reader can now answer questions about images

Date:

Share:


Today is Global Accessibility Awareness Day (GAAD), and, as in years past, many tech companies are marking the occasion with the announcement of new assistive features for their ecosystems. Apple got things rolling on Tuesday, and now Google is joining in on the parade. To start, the company has made TalkBack, Android’s built-in screen reader, more useful. With the help of one of Google’s Gemini models, TalkBack can now answer questions about images displayed on your phone, even they don’t have any alt text describing them.

“That means the next time a friend texts you a photo of their new guitar, you can get a description and ask follow-up questions about the make and color, or even what else is in the image,” explains Google. The fact Gemini can see and understand the image is thanks to the multi-modal capabilities Google built into the model. Additionally, the Q&A functionality works across the entire screen. So, for example, say you’re doing some online shopping, you can first ask your phone to describe the color of the piece of clothing you’re interested in and then ask if it’s on sale.

Separately, Google is rolling out a new version of its Expressive Captions. First announced at the end of last year, the feature generates subtitles that attempt to capture the emotion of what’s being said. For instance, if you’re video chatting with some friends and one of them groans after you make a lame joke, your phone will not only subtitle what they said but it will also include “[groaning]” in the transcription. With the new version of Expressive Captions, the resulting subtitles will reflect when someone drags out the sound of their words. That means the next time you’re watching a live soccer match and the announcer yells “goallllllll,” their excitement will be properly transcribed. Plus, there will be more labels now for sounds like when someone is clearing their throat.

The new version of Expressive Captions is rolling out to English-speaking users in the US, UK, Canada and Australia running Android 15 and above on their phones.



Source link

━ more like this

Playdate Season 2 review: Shadowgate PD and CatchaDiablos

Earlier in this Playdate season, I commented in a review that I "love a game that pisses me off a little." Well, I...

Six US Air Force nuclear capable bombers deploy as tensions boil in the Middle East – London Business News | Londonlovesbusiness.com

The US Air Force has deployed six B-2 stealth heavy strategic bombers as Iran tensions reach their highest point in history. The six stealth...

Russia issues a ‘catastrophic’ warning as Trump has not ruled out using nuclear weapons in Iran – London Business News | Londonlovesbusiness.com

The US President has not ruled out using nuclear weapons in Iran to attack the Fordow nuclear enrichment site which is deep underground. The...

Our favorite Levoit air purifier is $37 off in this early Prime Day deal

We now know that the 2025 edition of Amazon's blockbuster Prime Day sales event will start on July 8, and it's set to...

How a data center company uses stranded renewable energy

“Decisions around where...
spot_img