-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Requesting a format in audb.load() might lead to duplicated index entries #322
Labels
Comments
I'm just asking myself if it should be allowed at all to store two files differing only in their file extension. |
Yes, I guess this would be the easiest solution to this problem: extending Lines 19 to 36 in a359988
|
Maybe we could change def job(table_id):
audformat.assert_no_duplicates(db[table_id]._df) to def job(table_id):
index = audformat.utils.replace_file_extension(db[table_id].index, '')
audformat.assert_no_duplicates(index) |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Usually, we expect that a
audformat
conform table don't has any duplicated index entries.But if you have a database with the following table:
and you request the corresponding database with
audb.load(..., format='wav')
you will end up with a table that has duplicated index entries:Minimal example
Create database with corresponding
a.wav
anda.flac
files and publish it.When loading without requested format everything is fine:
But when requesting
format='wav'
we get duplicated index entries:The text was updated successfully, but these errors were encountered: