Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions about custom data usage for my context #46

Closed
feranzie opened this issue Jul 3, 2024 · 1 comment
Closed

Questions about custom data usage for my context #46

feranzie opened this issue Jul 3, 2024 · 1 comment
Assignees

Comments

@feranzie
Copy link

feranzie commented Jul 3, 2024

Great work with the repo. I have a couple of questions regarding using this for my context.

I'm trying to query songs on radio broadcast recordings.
I initially trained the model using your data on Kaggle but I got poor results (practically 0.00) during evaluation with my custom data.
so I followed your advice in #41 to train the model on my own dataset but since my data is not much ( I have just 3, 1hr 30 mins of audios being split and categorized by music, speech and noise) I just added them to respective folders within your dataset and trained on 20 epochs.

for more context I'm generating fingerprints using the code in #38 and evaluation seems to require dummy_db.mm and dummy_db_shape.npy so I copied both into the logs folder for my checkpoint from the folder where fingerprints were generated while using your test data and it works fine.
With the model trained on my data combined with your Kaggle data, evaluation metrics increased insignificantly I was able to get some top 1 exact matches of about (0.17-0,19)
I assumed the problem could be from the dummy data used to train the index so I tried using L2 index type and even setting fake_recon_index = db as you mentioned in #38 but all evaluation results are still in the range 0.00 to 0.25.

Is there anything I am doing wrong or missing out?

Or this repo cannot work for my data type?

@mimbres mimbres self-assigned this Jul 5, 2024
@mimbres
Copy link
Owner

mimbres commented Jul 6, 2024

Hi @feranzie,
Interesting. When you say 0.17-0.19, do you mean 17% or 0.17%?

About training:

  1. Are you sure you put your custom music data into test-query-db-500-30s and kept the training set (10k songs) unchanged?
  2. Did you set TR_SPEECH_AUG = True in the config? This setting is necessary for the speech to be treated as noise.
  3. Have you checked if the performance of your newly trained model is good with the official test queries?

Sanity check:
4. Make db.mm and query.mm the same (copy them), and also make the shape .npy files the same. You must get 100% in the test.

@mimbres mimbres closed this as completed Sep 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants