Meta’s AI translator can interpret unwritten languages | Tech Reader

Date:

Share:


Nearly half of the world’s roughly 7,000 known languages four in ten of them exist without an accompanying written component. These unwritten languages pose a unique problem for modern machine learning translation systems, as they typically need to convert verbal speech to written words before translating to the new language and reverting the text back to speech, but one that Meta has reportedly addressed with its latest open-source language AI advancement.

As part of Meta’s Universal Speech Translator (UST) program which is working to develop real-time speech-to-speech translation so that can more easily interact (read: ). As part of this project, Meta researchers looked at Hokkien, an unwritten language spoken throughout Asia’s diaspora and one of Taiwan’s official languages.

Machine learning translation systems typically require extensive labelable examples of the language, both written and spoken, to train on — precisely what unwritten languages like Hokkien don’t have. To get around that, “we used speech-to-unit translation (S2UT) to convert input speech to a sequence of acoustic units directly in the path previously pioneered by Meta,” CEO Mark Zuckerberg explained in a Wednesday blog post. “Then, we generated waveforms from the units. In addition, UnitY was adopted for a two-pass decoding mechanism where the first-pass decoder generates text in a related language (Mandarin), and the second-pass decoder creates units.”

“We leveraged Mandarin as an intermediate language to build pseudo-labels, where we first translated English (or Hokkien) speech to Mandarin text, and we then translated to Hokkien (or English) and added it to training data,” he continued. Currently, the system allows for someone who speaks Hokkien to converse with someone who speaks English, albeit stiltedly. The model can only translate one full sentence at a time but Zuckerberg is confident that the technique can eventually be applied to more languages and will improve to the point of offering real-time translation.

In addition to the models and training data that Meta is already open-sourcing from this project, the company is also releasing a first-of-its-kind speech-to-speech translation benchmarking system based on a Hokkien speech corpus called Taiwanese Across Taiwan, as well as “the speech matrix, a large corpus of speech-to-speech translations mined with Meta’s innovative data mining technique called LASER,” Zuckerberg announced. This system will empower researchers to create speech-to-speech translation (S2ST) systems of their own.

All products recommended by Tech Reader are selected by our editorial team, independent of our parent company. Some of our stories include affiliate links. If you buy something through one of these links, we may earn an affiliate commission. All prices are correct at the time of publishing.



Source link

━ more like this

Prime Day 2024: The best early deals we could find ahead of October Big Deal Days

Since 2022, Amazon has held a second Prime Day of sorts in October and that sale event is coming back this year, too....

Marks & Spencer to hire more than 11,000 festive workers – London Business News | Londonlovesbusiness.com

Marks & Spencer have said they are planning to hire more than 11,000 festive workers to offer support in stores...

25 years ago, the angriest war movie ever made was released | Tech Reader

The year 1999 was, quite famously, a good one for movies. Even the best blockbusters felt political, and more importantly, their themes and...

Bank of England warns companies must ‘prepare’ for vulnerable shocks – London Business News | Londonlovesbusiness.com

The Bank of England has issued a warning to companies that they must “prepare” for shocks in the globally financial...

Dell Inspiron 14 2-in-1 7445 review: almost good | Tech Reader

Dell Inspiron 14 2-in-1 (7445) MSRP $950.00 “The Dell Inspiron 14 2-in-1 (7445) is reasonably fast, but average battery life and a poor display stand in...
spot_img