Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Multilingual Spoken Words Corpus to list #24

Open
Ekhao opened this issue Aug 16, 2024 · 0 comments
Open

Add Multilingual Spoken Words Corpus to list #24

Ekhao opened this issue Aug 16, 2024 · 0 comments

Comments

@Ekhao
Copy link

Ekhao commented Aug 16, 2024

I stumbled over this great list of voice datasets but noticed that the "Multilingual Spoken Words Corpus" dataset is missing.

Url: https://mlcommons.org/datasets/multilingual-spoken-words/
Short Description: MLCommons Multilingual Spoken Words Corpus is a large and growing audio dataset of spoken words in 50 languages for academic research and commercial applications in keyword spotting and spoken term search, licensed under CC-BY 4.0. The dataset contains more than 340,000 keywords, totalling 23.4 million 1-second spoken examples (over 6,000 hours).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant