Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

generating alignments #10

Open
yasheshgaur opened this issue Dec 22, 2015 · 2 comments
Open

generating alignments #10

yasheshgaur opened this issue Dec 22, 2015 · 2 comments

Comments

@yasheshgaur
Copy link

Hi,

Kaldi scripts usually also generate alignments with lattices. You have both lat..gz and ali..gz files.

While in the offline transcriber, we only have the lattices as outputs. Is there any way to also generate alignments?

Thanks!

@alumae
Copy link
Owner

alumae commented Dec 23, 2015

Alignments in the form of CTM files can already be generated (see https://github.com/alumae/kaldi-offline-transcriber/blob/master/Makefile#L249). I.e., you may invoke

make build/output/foo.ctm

which generates a CTM file for src-audio/foo.mp3.

If you need alignments in other format (e.g. phone alignments), you may look inside the steps/get_ctm.sh file an modify it according to you needs).

@vince62s
Copy link

By the way, is there any existing script to convert ctm files into the "kaldi training data files" text, segments, utt2spk, spk2utt ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants