-
Notifications
You must be signed in to change notification settings - Fork 0
Image retrieval practices
Media manager must support storing media files, as well as searching for matching stored files when a similar or duplicate media file (the query file) is provided.
In this article, we focus on the techniques to search for images efficiently, involving how to convert the image into appropriate format to make the easily searchable (usually as vectors), how to perform search using the format, how to derive simialrity between the query image and the matching images, etc. In computer science, this is called image retrieval (IR).
Matsui et al, 2020 has provided a clear overview on modern image retireval preatices: Source: https://www.youtube.com/watch?v=SKrHs03i08Q&list=PLKQB14e0EJUWaTnwgQogJ3nSLzEFNn9d8&t=13m00s
Given a dataset (in this case, it is a set of images to search from), finding the most similar images
First of all, the scale of the dataset (number of images in the database to search for) is discussed. In a billion-scale database,