Training AI to read your lips — in multiple languages

Episode 277 November 30, 2022 00:04:09
Training AI to read your lips — in multiple languages
Localization Today
Training AI to read your lips — in multiple languages

Nov 30 2022 | 00:04:09

/

Hosted By

Eddie Arrieta

Show Notes

While widely used speech recognition tools like Siri or Otter generally analyze audio alone, researchers have also made progress in developing visual speech recognition (VSR) models, which rely on visual input to identify what a speaker is saying.

Other Episodes

Episode 10

February 01, 2023 00:12:34
Episode Cover

Gridly vs. Google Sheets, or Changing outdated practices in a localization pipeline | January 2023

For as long as Denis Ivanov could remember, his localization team worked with Google Sheets. Everyone was comfortable and familiar with it. But at...

Listen

Episode 218

October 02, 2024 00:09:26
Episode Cover

Abduweli Ayup: Winner of the First Language Rights Defenders Award

Interview by Gerald Roche Scholar and activist Abduweli Ayup was imprisoned in China for his work promoting the language rights of Uyghur people and...

Listen

Episode 291

June 09, 2025 00:06:49
Episode Cover

Stick-Joy and Joysticks: A Gen-Xer’s take on gaming, language, and the alchemy of translation

By Ewandro Magalhães The author shares his impressions on how games travel across language and culture, drawing comparisons among make-believe games, board games, and...

Listen