diff --git a/_posts/2024-08-09-Generative-Speech.md b/_posts/2024-08-09-Generative-Speech.md index f8f6f56d..1b49a43a 100644 --- a/_posts/2024-08-09-Generative-Speech.md +++ b/_posts/2024-08-09-Generative-Speech.md @@ -207,6 +207,11 @@ faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2 **Github:** [https://github.com/QwenLM/Qwen-Audio](https://github.com/QwenLM/Qwen-Audio)
![](https://github.com/QwenLM/Qwen-Audio/raw/main/assets/framework.png) +--- +### Canary +**model:** [nvidia/canary-1b](https://huggingface.co/nvidia/canary-1b)
+**Paper:** [Less is More: Accurate Speech Recognition & Translation without Web-Scale Data](https://arxiv.org/abs/2406.19674)
+ --- ### [Whisper Large-v3](https://huggingface.co/openai/whisper-large-v3) **model:** openai/whisper-large-v3