This robot learned to lip sync like humans by watching YouTube

Date:

Share:


Researchers at Columbia Engineering have trained a human-like robot named Emo to lip-sync speech and songs by studying online videos, showing how machines can now learn complex human behaviour simply by observing it.

Emo is not a full humanoid body but a highly realistic robotic face built to explore how humans communicate. The face is covered with silicone skin and driven by 26 independently controlled facial motors that move the lips, jaw, and cheeks.

These motors allow Emo to form detailed mouth shapes that cover 24 consonants and 16 vowels, which is critical for natural speech and singing. The goal was to reduce the uncanny valley effect, where robots look almost human but still feel unsettling because their facial movements do not match their voice.

How Emo learned to lip sync like a human

The learning process happened in stages. First, Emo explored its own face by moving its motors while watching itself in a mirror. This helped the system understand how motor commands change facial shapes.

Researchers then introduced a learning pipeline that connects sound to movement. Emo watched hours of YouTube videos of people speaking and singing, while an AI model analysed the relationship between audio and visible lip motion.

Instead of focusing on language or meaning, the system studied the raw sounds of speech. A facial action transformer converted those learned patterns into real-time motor commands.

This approach allowed Emo to lip sync not only in English but also in languages it was never trained on, including French, Arabic, and Chinese. The same method worked for singing, which is harder because of stretched vowels and rhythm changes.

Researchers say this matters because future robots will need to communicate naturally if they are going to work alongside people. This advancement has arrived when interest in robots for homes and workplaces is climbing fast.

At CES 2026, that momentum was on full display, with demos ranging from Boston Dynamics’ Atlas humanoid which is ready to enter workplace to SwitchBot’s household-focused robot that can cook meals and do your laundry, and LG’s upcoming home assistant robot designed to make everyday life easier.

Add advances like artificial skin that gives robots human-like sensitivity, and paired with realistic lip syncing, it is easy to see how robots are starting to feel less like machines and more like social companions. Emo is still a research project, but it shows how robots may one day learn human skills the same way we do by watching and listening.



Source link

━ more like this

Capcom’s long-delayed Pragmata is now arriving a week earlier

Capcom during its March 5 Spotlight showcase that Pragmata, its dystopian sci-fi adventure game, will release on April 17 for PlayStation...

Motorola’s upcoming Razr 70 foldable could get a camera and memory boost

Motorola hasn’t said a word officially, but China’s TENAA certification database (via Gadgets360) has done the talking anyway. The Motorola Razr 70 has...

T-Mobile 5G Home Internet’s latest deal gives you up to $300 back 

If you’ve been considering a switch from traditional cable, T-Mobile 5G Home Internet’s newest promotion may be the most compelling reason yet to make the move. The...

Rad Power Bikes gets a new owner, pledge to build bikes in the US

Life EV has completed a court-approved acquisition of Rad Power Bikes, granting a second life to the troubled e-bike brand.The Florida-based Life EV...

Microsoft Copilot just made browser switching a thing of the past

If you have ever been mid-conversation with Copilot, clicked a link, and then spent the next few minutes trying to find your way...
spot_img