Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question #6

Open
lvaleriu opened this issue Oct 11, 2018 · 4 comments
Open

Question #6

lvaleriu opened this issue Oct 11, 2018 · 4 comments

Comments

@lvaleriu
Copy link
Contributor

Why do we need to do this in fact: "Use ffmpeg to convert and split WAV files into 10 second parts"?

After downloading we have big wav files. We can then directly convert them to spectogram image files.
This will slice anyway the image into 10 seconds spectograms.

@Bartzi
Copy link
Member

Bartzi commented Oct 12, 2018

Of course you can also do it in this way... if you think that this works better for you, then go ahead...

@lvaleriu
Copy link
Contributor Author

It is mainly because i dont need to store segment wav files too (which is 88 gb on my disk).
I already store the youtube downloaded files directly to mp3 now for the same reason.
And i've managed to extract 10 seconds spectograms from the mp3s quite fast actually.

@omfuke
Copy link

omfuke commented Nov 12, 2020

how much amount of data I should use for classifying between Hindi and English? is 20000 spectrogram per language is sufficient ?

@Bartzi
Copy link
Member

Bartzi commented Nov 12, 2020

Sounds like a good amount of data. I think it could work!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants