Skip to content

Commit

Permalink
Update 2024-08-09-Generative-Speech.md
Browse files Browse the repository at this point in the history
  • Loading branch information
rkuo2000 authored Oct 13, 2024
1 parent a60ff99 commit fcf6492
Showing 1 changed file with 5 additions and 0 deletions.
5 changes: 5 additions & 0 deletions _posts/2024-08-09-Generative-Speech.md
Original file line number Diff line number Diff line change
Expand Up @@ -207,6 +207,11 @@ faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2
**Github:** [https://github.com/QwenLM/Qwen-Audio](https://github.com/QwenLM/Qwen-Audio)<br>
![](https://github.com/QwenLM/Qwen-Audio/raw/main/assets/framework.png)

---
### Canary
**model:** [nvidia/canary-1b](https://huggingface.co/nvidia/canary-1b)<br>
**Paper:** [Less is More: Accurate Speech Recognition & Translation without Web-Scale Data](https://arxiv.org/abs/2406.19674)<br>

---
### [Whisper Large-v3](https://huggingface.co/openai/whisper-large-v3)
**model:** openai/whisper-large-v3<br>
Expand Down

0 comments on commit fcf6492

Please sign in to comment.