GitHub - HanCai98/Image-Retrieval-System: An image retrieval system which can find the matching image from thousands of images when giving a short description.

Description

In this project, we want to develop a model with which you can find the specific photo given a short description. We firstly use a pretrained Faster R-CNN object detection model to find out objects in each image. Then we use a pretrained ResNet-34 model to generate object embedding for each detected object. Next, for each image, we pass object embeddings to an image encoder and pass each word in a text encoder. After that, we calculate the similarity score for each image-caption pair. Our entire model architecture is shown in below.

Dataset

In this project, we used Flickr30k Dataset, which is a large corpus of 30K images and 150K descriptive captions. We split the whole dataset into a train set, a validation set and a test set. There are 1,000 images in both validation set and test set and 29,783 images in the train set. You can download the dataset and find the preprocess file at here.

Object Dector

You can find a pretrained Faster R-CNN object detection model at here.

Writeup

You can find our writeup at here.

Presentation

You can find our project presentation at here.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
__pycache__		__pycache__
configs		configs
experiments/find_photo		experiments/find_photo
images		images
README.md		README.md
dataset.py		dataset.py
gen.py		gen.py
loss.py		loss.py
main.py		main.py
model.py		model.py
optim.py		optim.py
retrieve_info.json		retrieve_info.json
script.py		script.py
test.py		test.py
train.py		train.py
utils.py		utils.py
vocab.json		vocab.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Description

Dataset

Object Dector

Writeup

Presentation

About

Releases

Packages

Contributors 2

Languages

HanCai98/Image-Retrieval-System

Folders and files

Latest commit

History

Repository files navigation

Description

Dataset

Object Dector

Writeup

Presentation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages