Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

README: Speaker ID process clarification #18

Open
lkraav opened this issue Sep 28, 2017 · 2 comments
Open

README: Speaker ID process clarification #18

lkraav opened this issue Sep 28, 2017 · 2 comments
Labels

Comments

@lkraav
Copy link

lkraav commented Sep 28, 2017

Perhaps the README could clarify what the expected process output is when speaker ID feature is enabled? What is supposed to look different in the text output compared to disabling speaker ID. Is it possible to give speakers names via some transcription configuration file, or is that post-text-editing work?

@alumae
Copy link
Owner

alumae commented Sep 28, 2017

Yes, this needs to be clarified in the README.

Just to let you know, it only changes the names of the speakers in the output trs files, and the recognized speakers is a closed set of Estonian public figures who occur often enough in Estonian broadcast news (see for example the names in http://bark.phon.ioc.ee/tsab/p/play?trans=8840). It's not possible to change it or retrain it any way (currently). So it probably only interest you if you process Estonian broadcast news.

@lkraav
Copy link
Author

lkraav commented Sep 28, 2017

Hehe, yeah this was useful information. Well, I'm running it on my custom audio, let's see what it comes up with. A retraining process would definitely be useful.

@alumae alumae added the bug label Sep 28, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants