Training AI to read your lips — in multiple languages

Episode 277 November 30, 2022 00:04:09
Training AI to read your lips — in multiple languages
Localization Today
Training AI to read your lips — in multiple languages

Nov 30 2022 | 00:04:09

/

Hosted By

Eddie Arrieta

Show Notes

While widely used speech recognition tools like Siri or Otter generally analyze audio alone, researchers have also made progress in developing visual speech recognition (VSR) models, which rely on visual input to identify what a speaker is saying.

Other Episodes

Episode 38

February 24, 2022 00:03:51
Episode Cover

On peut se tutoyer? The impossible task of conveying complex concepts in limited space

With more foreign language content coming to popular streaming platforms like Netflix, the art of subtitle translation continues to be front and center. Have...

Listen

Episode 201

September 06, 2022 00:03:45
Episode Cover

Developing machine translation to help Indigenous refugees navigate immigration courts

To help ease the linguistic challenges for refugees from Central America and Mexico seeking asylum in the United States, a team of researchers at...

Listen

Episode 321

August 15, 2025 00:29:58
Episode Cover

Global Ambitions: Revolution in Motion — The Preview

Gabriel Karandyšovský previews “Global Ambitions: Revolution in Motion,” a five-article sampler. He digs into making AI actually ship through better infrastructure, why incremental wins...

Listen