While widely used speech recognition tools like Siri or Otter generally analyze audio alone, researchers have also made progress in developing visual speech recognition (VSR) models, which rely on visual input to identify what a speaker is saying.
In this episode, we speak with Veronique Ozkaya, Co-CEO of Datamundi AI, about the company’s transformation from Summa Linguae Technologies into a data-focused, AI-driven...
The government of Macau recently commended the University of Macau’s Natural Language Processing and Portuguese-Chinese Machine Translation Lab, for landing in first place in...
Spotify recently addressed user concerns regarding the automatic translation of song names into English, making titles in languages such as Chinese particularly difficult to...