Skip to content

Commit

Permalink
Update 2024-08-09-Generative-Speech.md
Browse files Browse the repository at this point in the history
  • Loading branch information
rkuo2000 authored Oct 13, 2024
1 parent 5c1a0cb commit a60ff99
Showing 1 changed file with 5 additions and 0 deletions.
5 changes: 5 additions & 0 deletions _posts/2024-08-09-Generative-Speech.md
Original file line number Diff line number Diff line change
Expand Up @@ -202,6 +202,11 @@ faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2
### [Open Whisper-style Speech Models (OWSM)](https://www.wavlab.org/activities/2024/owsm/)
**Paper:** [OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer](https://arxiv.org/abs/2401.16658)<br>

---
### Qwen-Audio
**Github:** [https://github.com/QwenLM/Qwen-Audio](https://github.com/QwenLM/Qwen-Audio)<br>
![](https://github.com/QwenLM/Qwen-Audio/raw/main/assets/framework.png)

---
### [Whisper Large-v3](https://huggingface.co/openai/whisper-large-v3)
**model:** openai/whisper-large-v3<br>
Expand Down

0 comments on commit a60ff99

Please sign in to comment.