Training AI to read your lips — in multiple languages

Episode 277 November 30, 2022 00:04:09
Training AI to read your lips — in multiple languages
Localization Today
Training AI to read your lips — in multiple languages

Nov 30 2022 | 00:04:09

/

Hosted By

Eddie Arrieta

Show Notes

While widely used speech recognition tools like Siri or Otter generally analyze audio alone, researchers have also made progress in developing visual speech recognition (VSR) models, which rely on visual input to identify what a speaker is saying.

Other Episodes

Episode 281

May 08, 2025 00:34:39
Episode Cover

Turning Data Into Direction

In this episode, we speak with Veronique Ozkaya, Co-CEO of Datamundi AI, about the company’s transformation from Summa Linguae Technologies into a data-focused, AI-driven...

Listen

Episode 7

January 14, 2022 00:02:42
Episode Cover

University of Macau researchers win several prizes at MT conference.mp3

The government of Macau recently commended the University of Macau’s Natural Language Processing and Portuguese-Chinese Machine Translation Lab, for landing in first place in...

Listen

Episode 170

July 21, 2022 00:02:53
Episode Cover

Song title translation on Spotify complicates search for users

Spotify recently addressed user concerns regarding the automatic translation of song names into English, making titles in languages such as Chinese particularly difficult to...

Listen