Image clustering and semantic annotation via contrastive learning

This repo is based and modified by the paper:

SCAN: Learning to Classify Images without Labels

Installation

The code runs with Pytorch on Linux system. Assuming Anaconda, the most important packages can be installed as:

conda install pytorch=1.4.0 torchvision=0.5.0 cudatoolkit=10.0 -c pytorch
conda install matplotlib scipy scikit-learn   # For evaluation and confusion matrix visualization
conda install faiss-gpu                       # For efficient nearest neighbors search 
conda install pyyaml easydict                 # For using config files
conda install termcolor                       # For colored print statements

We refer to the requirements.txt file for an overview of the packages in the environment we used to produce our results.

Training

Setup

The following files need to be adapted in order to run the code on your own machine:

Change the file paths to the datasets in utils/mypath.py, e.g. /path/to/cifar10.
Specify the output directory in configs/env.yml. All results will be stored under this directory.

Our experimental evaluation includes the following datasets: CIFAR10, CIFAR100-20, STL10 and ImageNet. The ImageNet dataset should be downloaded separately and saved to the path described in utils/mypath.py. Other datasets will be downloaded automatically and saved to the correct path when missing.

Train model

The configuration files can be found in the configs/ directory. The training procedure consists of the following steps:

STEP 1: Solve the pretext task i.e. simclr.py
STEP 2: Perform the clustering step i.e. scan.py
STEP 3: Perform the self-labeling step i.e. selflabel.py

For example, run the following commands sequentially to perform our method on CIFAR10:

python simclr.py --config_env configs/env.yml --config_exp configs/pretext/simclr_cifar10.yml
python scan.py --config_env configs/env.yml --config_exp configs/scan/scan_cifar10.yml
python selflabel.py --config_env configs/env.yml --config_exp configs/selflabel/selflabel_cifar10.yml

The whole training and evaluation process costs around 12 hours on a single GTX1080 Ti GPU.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
configs		configs
data		data
dataset		dataset
images		images
logs		logs
losses		losses
models		models
utils		utils
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
cifar10-perspective.out		cifar10-perspective.out
cifar10-pretext.out		cifar10-pretext.out
cifar10-scan.out		cifar10-scan.out
cifar10-selflabel.out		cifar10-selflabel.out
eval.py		eval.py
fashion-mnist-perspective.out		fashion-mnist-perspective.out
fashion-mnist-pretext.out		fashion-mnist-pretext.out
fashion-mnist-scan.out		fashion-mnist-scan.out
fashion-mnist-selflabel.out		fashion-mnist-selflabel.out
mnist-noflip.out		mnist-noflip.out
mnist-pretext.out		mnist-pretext.out
mnist-scan.out		mnist-scan.out
mnist-selflabel.out		mnist-selflabel.out
moco.py		moco.py
requirements.txt		requirements.txt
run.slurm		run.slurm
scan.py		scan.py
selflabel.py		selflabel.py
simclr.py		simclr.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image clustering and semantic annotation via contrastive learning

Installation

Training

Setup

Train model

About

Releases

Packages

Languages

License

LiAlH4/Image-Clustering-SLproject

Folders and files

Latest commit

History

Repository files navigation

Image clustering and semantic annotation via contrastive learning

Installation

Training

Setup

Train model

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages