[Question] CTC-ASSISTED LLM-BASED CONTEXTUAL ASR #178

hjjjackie · 2024-11-27T05:25:22Z

Hi, Thank you for sharing the codes. I am wondering how to prepare the following files in contextual_asr_config.py?

infer_file: str = "/nfs/yangguanrou.ygr/data/fbai-speech/is21_deep_bias/my_ref/test-clean.biasing_100.tsv"
ctc_file: str = "/nfs/yangguanrou.ygr/data/librispeech_my_infer/wavlm_large_libri_test_other_char.txt"
filter_type: str = "char"
phn_to_name_dict: str = "/nfs/yangguanrou.ygr/data/librispeech_my_infer/wavlm_ft_libri960_${ref_split}_phn.json"

The text was updated successfully, but these errors were encountered:

yanghaha0908 · 2024-11-29T03:50:29Z

For infer_file, get it directly from https://github.com/facebookresearch/fbai-speech/blob/main/is21_deep_bias/ref/test-clean.biasing_100.tsv, I just changed the order to match the audio order.
For ctc_file, it's the viterbi decode results of our CTC Fine-tuned WavLM-Large checkpoint. For your convenience, I've uploaded them, the link is in README.
Just ignore phn_to_name_dict, it's irrelevant, I'll delete it and update contextual_asr_config.py.

Refer to #181

hjjjackie changed the title ~~CTC-ASSISTED LLM-BASED CONTEXTUAL ASR~~ [Question] CTC-ASSISTED LLM-BASED CONTEXTUAL ASR Nov 27, 2024

ddlBoJack assigned yanghaha0908 Nov 27, 2024

yanghaha0908 closed this as completed Dec 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] CTC-ASSISTED LLM-BASED CONTEXTUAL ASR #178

[Question] CTC-ASSISTED LLM-BASED CONTEXTUAL ASR #178

hjjjackie commented Nov 27, 2024 •

edited

Loading

yanghaha0908 commented Nov 29, 2024 •

edited

Loading

[Question] CTC-ASSISTED LLM-BASED CONTEXTUAL ASR #178

[Question] CTC-ASSISTED LLM-BASED CONTEXTUAL ASR #178

Comments

hjjjackie commented Nov 27, 2024 • edited Loading

yanghaha0908 commented Nov 29, 2024 • edited Loading

hjjjackie commented Nov 27, 2024 •

edited

Loading

yanghaha0908 commented Nov 29, 2024 •

edited

Loading