Were both training and test splits from original datasets (e.g., AudioSet, VggSound) used to train CLAP? #97

ttgeng233 · 2023-08-10T20:29:03Z

I want to know the exact splits of AudioSet or VggSound used to train the CLAP. Because many audio-related datasets for downstream tasks were collected from these two large-scale datasets, if all their test data were seen during the pre-training stage, the evaluation results would be unconvincing.

YuchenHui22314 · 2024-09-12T02:15:50Z

While evaluating, we manually eliminate those examples already seen in the pretraining stage. For example, while testing on ESC-50, we eliminated all overlaps with freesound and audioset.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Were both training and test splits from original datasets (e.g., AudioSet, VggSound) used to train CLAP? #97

Were both training and test splits from original datasets (e.g., AudioSet, VggSound) used to train CLAP? #97

ttgeng233 commented Aug 10, 2023

YuchenHui22314 commented Sep 12, 2024

Were both training and test splits from original datasets (e.g., AudioSet, VggSound) used to train CLAP? #97

Were both training and test splits from original datasets (e.g., AudioSet, VggSound) used to train CLAP? #97

Comments

ttgeng233 commented Aug 10, 2023

YuchenHui22314 commented Sep 12, 2024