In this work we accomplish the task of lipreading 100 words selected from the 1000 most common words in the Italian language through deep learning. We create our own dataset by recording a person pronouncing words facing a camera. The chosen set of words consists of nouns, adjectives and verbs and can possibly be extended to the whole dictionary. We use an adapted version of GCNet for our purpose. The model we present should be taken as a follow up, simplified version of LipNet, Deep Speech 2 and Lip Reading Sentences in the Wild, where more complex networks are capable of recognizing entire sentences coming from different speakers, possibly with different orientations and in various settings.
-
Notifications
You must be signed in to change notification settings - Fork 0
jacopopiogargano/Ita-Lip
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published