You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I believe we can, since all we need to do is making a custom version of the ASRDataset, Speech Featurizer, Text Featurizer and feed the dataset into models with phonemes classes.
Can this be modified to extract Phonems with their start and end time stamps in audio file?
The text was updated successfully, but these errors were encountered: