A man hard to read. Photo of Alan Moore by Mirka. CC-BY-SA 3.0 |
Most significant problem is however ambiguity between mouth shapes and sounds. During speech, the mouth forms between 10 and 14 different shapes, known as visemes. By contrast, speech contains around 50 individual sounds known as phonemes. So a single viseme can represent several different phonemes.
Source: The Challenges and Threats of Automated Lip Reading
No comments:
Post a Comment