Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question] CTC-ASSISTED LLM-BASED CONTEXTUAL ASR #178

Closed
hjjjackie opened this issue Nov 27, 2024 · 1 comment
Closed

[Question] CTC-ASSISTED LLM-BASED CONTEXTUAL ASR #178

hjjjackie opened this issue Nov 27, 2024 · 1 comment
Assignees

Comments

@hjjjackie
Copy link

hjjjackie commented Nov 27, 2024

Hi, Thank you for sharing the codes. I am wondering how to prepare the following files in contextual_asr_config.py?

infer_file: str = "/nfs/yangguanrou.ygr/data/fbai-speech/is21_deep_bias/my_ref/test-clean.biasing_100.tsv"
ctc_file: str = "/nfs/yangguanrou.ygr/data/librispeech_my_infer/wavlm_large_libri_test_other_char.txt"
filter_type: str = "char"
phn_to_name_dict: str = "/nfs/yangguanrou.ygr/data/librispeech_my_infer/wavlm_ft_libri960_${ref_split}_phn.json"
@hjjjackie hjjjackie changed the title CTC-ASSISTED LLM-BASED CONTEXTUAL ASR [Question] CTC-ASSISTED LLM-BASED CONTEXTUAL ASR Nov 27, 2024
@yanghaha0908
Copy link
Collaborator

yanghaha0908 commented Nov 29, 2024

  1. For infer_file, get it directly from https://github.com/facebookresearch/fbai-speech/blob/main/is21_deep_bias/ref/test-clean.biasing_100.tsv, I just changed the order to match the audio order.
  2. For ctc_file, it's the viterbi decode results of our CTC Fine-tuned WavLM-Large checkpoint. For your convenience, I've uploaded them, the link is in README.
  3. Just ignore phn_to_name_dict, it's irrelevant, I'll delete it and update contextual_asr_config.py.

Refer to #181

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants