diff --git a/_posts/2024-01-01-owsm.md b/_posts/2024-01-01-owsm.md index 4d66d145..3f811646 100644 --- a/_posts/2024-01-01-owsm.md +++ b/_posts/2024-01-01-owsm.md @@ -8,7 +8,7 @@ comments: false ## Overview -The **O**pen **W**hisper-style **S**peech **M**odels (OWSM, pronounced as "awesome") are a series of speech foundation models developed by [WAVLab](https://www.wavlab.org/) at Carnegie Mellon University. We reproduce Whisper-style training using publicly available data and our open-source toolkit [ESPnet](https://github.com/espnet/espnet). By publicly releasing data preparation scripts, training and inference code, pre-trained model weights and training logs, we aim to promote transparency and open science in large-scale speech pre-training. +**O**pen **W**hisper-style **S**peech **M**odels (OWSM, pronounced as "awesome") are a series of speech foundation models developed by [WAVLab](https://www.wavlab.org/) at Carnegie Mellon University. We reproduce Whisper-style training using publicly available data and our open-source toolkit [ESPnet](https://github.com/espnet/espnet). By publicly releasing data preparation scripts, training and inference code, pre-trained model weights and training logs, we aim to promote transparency and open science in large-scale speech pre-training. ## News @@ -19,7 +19,6 @@ The **O**pen **W**hisper-style **S**peech **M**odels (OWSM, pronounced as "aweso - Gradio demo: [![Static Badge](https://img.shields.io/badge/OWSM-Demo-orange)](https://pyf98-owsm-v3-demo.hf.space) - Colab notebook: [![Open All Collab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1zKI3ZY_OtZd6YmVeED6Cxy1QwT1mqv9O?usp=sharing) - ## Papers Please cite our papers if you use OWSM. @@ -80,7 +79,7 @@ We have released various pre-trained models. The training logs are also availabl E-Branchformer 1.02B espnet/owsm_v3.1_ebf - TBD + egs2/owsm_v3.1/s2t1