Google’s new text-to-speech can switch languages on the fly

Date:

Share:


Google is enhancing Gemini’s text-to-speech (TTS). On Tuesday at Google I/O 2025, the company previewed a new TTS feature, built on native audio output, that can “converse in more expressive ways.”

Google’s Tulsee Doshi showed a quick demo of the Gemini 2.5 TTS models onstage in Mountain View. It showed an AI-powered voice that sounds more natural and less robotic, with subtler nuances.

The TTS can converse in over 24 languages, switching between them seamlessly. The demo showed the model beginning in English, shifting to Hindi, and then circling back to English. They all used the same voice, helping create the illusion that it’s the same “person.” The TTS can also switch to a whisper (which sounded a little creepy) on the fly.

The new text-to-speech is available starting today in the Gemini API. Also on Tuesday, the Gemini Live API will have a 2.5 Flash preview of native audio dialog.

Google’s annual I/O developer conference kicked off on Tuesday, May 20. Google is highlighting new and notable improvements to its Gemini AI services, including real-time translation, AI enhancements to Project Astra computer vision and more. Follow Tech Reader’s Google I/O liveblog for real-time coverage of the event. Google previewed some key pre-I/O Android 16 news during its Android Show video stream last week.



Source link

━ more like this

Your future BMW electric M3 will still sound like a real M car

The transition to electric vehicles has always had one major stumbling block for car enthusiasts: the sound. Or rather, the lack of it....

New study shows AI isn’t ready for office work

It has been nearly two years since Microsoft CEO Satya Nadella predicted that generative AI would take over knowledge work, but if you...

This is the tech that makes Volvo’s latest EV a major step forward

When it comes to EVs, Volvo hasn’t been afraid to experiment. It’s tried repurposing a platform from its internal-combustion cars (for the EX40...

Report reveals that OpenAI’s GPT-5.2 model cites Grokipedia

OpenAI may have called GPT-5.2 its "most advanced frontier model for professional work," but tests conducted by the Guardian cast doubt on its...
spot_img