[WARNING] False Positive Issues #105

ziczhu · 2023-06-29T04:02:13Z

Currently, we are experiencing a high number of false positives when utilizing this library. In our scenario, approximately 70% of the results are false positives, which significantly impacts the accuracy of our application.

To address this issue, I suggest to use the following precheck before using the library:

Preprocessing based on video length: Consider incorporating a preprocessing step that filters out videos with durations less than 1 minute. This criteria can help eliminate irrelevant and short-duration videos, which often contribute to false positive matches.
Similarity threshold adjustment: Modify the similarity threshold used by the library to make it more stringent. By increasing the threshold, the library will only consider videos with a higher degree of similarity, reducing the occurrence of false positives. This adjustment can significantly improve the precision of the matching process.
Comparison of video durations: Introduce a comparison mechanism that checks the proximity of video durations when assessing similarity. This step would ensure that two videos are not considered similar if their durations differ significantly. By including this additional criterion, we can reduce the occurrence of false positives caused by videos with vastly different lengths.

But still thanks to the author to provide this library for low cost comparison, but if you're using it in a very serious scenario, I would suggest use it like the bloom filter, and do intensive algorithm after positive result.

Qinmayyear · 2024-10-25T07:23:26Z

Wish I saw this earlier. This library cannot be use to detect videos less than 1min, there were many false positive cases :(

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WARNING] False Positive Issues #105

[WARNING] False Positive Issues #105

ziczhu commented Jun 29, 2023

Qinmayyear commented Oct 25, 2024

[WARNING] False Positive Issues #105

[WARNING] False Positive Issues #105

Comments

ziczhu commented Jun 29, 2023

Qinmayyear commented Oct 25, 2024