From 983debea268205e897f38ac0205f9db510d00e64 Mon Sep 17 00:00:00 2001 From: Ananth Shyam S Date: Thu, 12 Sep 2024 16:00:42 +0530 Subject: [PATCH] Update training.md --- training.md | 45 +++++++++++++++++++++++++-------------------- 1 file changed, 25 insertions(+), 20 deletions(-) diff --git a/training.md b/training.md index 2c95c31..e2f4b90 100644 --- a/training.md +++ b/training.md @@ -11,16 +11,19 @@ ### Speech Corpus Directory Structure ``` -+-- speech_corpus_directory -| +-- speaker1 -| --- recording1.wav -| --- recording1.lab -| --- recording2.wav -| --- recording2.lab -| +-- speaker2 -| --- recording3.wav -| --- recording3.lab -| --- ... +speech_corpus_directory/ +│ +├── speaker1/ +│ ├── recording1.wav +│ ├── recording1.lab +│ ├── recording2.wav +│ └── recording2.lab +│ +├── speaker2/ +│ ├── recording3.wav +│ └── recording3.lab +│ ... + ``` ### Audio Files @@ -78,14 +81,16 @@ mfa train --clean --phone_set UNKNOWN --use_mp -j 16 --single_speaker ~/path/to/ ### Acoustic Model Directory Structure ``` -+-- acoustic_model_directory -| --- final.alimdl -| --- final.mdl -| --- lda.mat -| --- meta.json -| --- phone_lm.fst -| --- phone_pdf.counts -| --- phones.txt -| --- rules.yaml -| --- Tree +acoustic_model_directory/ +│ +├── final.alimdl # Alignment model file +├── final.mdl # Final model file +├── lda.mat # Linear Discriminant Analysis matrix +├── meta.json # Metadata file containing model information +├── phone_lm.fst # Phone-level language model in FST format +├── phone_pdf.counts # Phone PDF counts file +├── phones.txt # Phone list +├── rules.yaml # Rules file +└── Tree # Decision tree file + ```