Skip to content

Latest commit

 

History

History
179 lines (122 loc) · 9.86 KB

dataset.md

File metadata and controls

179 lines (122 loc) · 9.86 KB

Instructions for downloading the datasets

The datasets used in this work can be broadly categorized into data required for blending and data necessary for evaluation.

All the datasets should be downloaded and placed in the data directory.

The directory structure of data should be as follows:

DermSynth3D_private/
┣ ...						   		# other source code
┣ data/                        		# directory to store the data
┃  ┣ 3dbodytex-1.1-highres   		# data for 3DBodyTex.v1 3d models and texture maps
┃  ┣ fitzpatrick17k/
┃  ┃  ┣ data/               		# Fitzpatrick17k images
┃  ┃  ┗ annotations/        		# annotations for Fitzpatrick17k lesions
┃  ┣ ph2/                   
┃  ┃  ┣ images/               		# PH2 images
┃  ┃  ┗ labels/               		# PH2 annotations
┃  ┣ dermofit/ 						# Dermofit dataset
┃  ┃  ┣ images/               		# Dermofit images	
┃  ┃  ┗ targets/               		# Dermofit annotations 
┃  ┣ FUSeg/
┃  ┃  ┣ train/               		# training set with images/labels for FUSeg
┃  ┃  ┣ validation/               	# val set with images/labels for FUSeg
┃  ┃  ┗ test/               		# test set with images/labels for FUSeg
┃  ┣ Pratheepan_Dataset/
┃  ┃  ┣ FacePhoto/               	# images from Pratheepan dataset
┃  ┃  ┗ GroundT_FacePhoto/  	 	# annotations
┃  ┣ lesions/                   	# keep the non-skin masks for 3DBodyTex.v1 meshes here
┃  ┣ annotations/                   # segmentation masks for Annotated Fitzpatrick17k lesions
┃  ┣ bodytex_anatomy_labels/ 		# per-vertex labels for anatomy of 3DBodyTex.v1 meshes
┃  ┣ background/               		# keep the background scenes for rendering here
┃  ┗ synth_data/			   		# the generated synthetic data will be stored here
    	┣ train/               		# training set with images/labels for training on synthetic data<val/test>/ 			 	# val and test set with images/labels for training on synthetic data
Data For Blending

Download 3DBodyTex.v1 meshes

The 3DBodyTex.v1 dataset can be downloaded from here.

3DBodyTex.v1 contains the meshes and texture images used in this work and can be downloaded from the external site linked above (after accepting a license agreement).

NOTE: These textured meshes are needed to run the code to generate the data.

We provide the non-skin texture maps annotations for 2 meshes: 006-f-run and 221-m-u. Hence, to generate the data, make sure to get the .obj files for these two meshes and place them in data/3dbodytex-1.1-highres before excecuting scripts/gen_data.py.

After accepting the licence, download and unzip the data in ./data/.


Download the 3DBodyTex.v1 annotations

Non-skin texture maps Anatomy labels

We provide the non-skin texture map ($T_{nonskin}$) annotations for 215 meshes from the 3DBodyTex.v1 dataset here.

We provide the per-vertex labels for anatomical parts of the 3DBodyTex.v1 meshes obtained by fitting SCAPE template body model here.

The folders are organised with the same IDs as the meshes in 3DBodyTex.v1 dataset.


Download the Fitzpatrick17k dataset

Fitzpatrick17k Images Fitzpatrick17k annotations
image mask.

We used the skin conditions from Fitzpatrick17k. See their instructions to get access to the Fitzpatrick17k images.
We provide the raw images for the Fitzpatrick17k dataset here.
After downloading the dataset, unzip the dataset:
unzip fitzpatrick17k.zip -d data/fitzpatrick17k/

We provide the densely annotated lesion masks from the Fitzpatrick17k dataset are given within this repository under the data directory. More of such annotations can be downloaded from here.


Download the Background Scenes

scene1 scene2

Although you can use any scenes as background for generating the random views of the lesioned-meshes, we used SceneNet RGB-D for the background IndoorScenes. Specifically, we used this split, and sampled 3000 images from it.

For convenience, the background scenes we used to generate the ssynthetic dataset can be downloaded from here.


Data For Training

Download the FUSeg dataset

scene1 scene2

The Foot Ulcer Segmentation Challenge (FUSeg) dataset is available to download from their official repository. Download and unpack the dataset at data/FUSeg/, maintaining the Folder Structure shown above.

For simplicity, we mirror the FUSeg dataset here.


Download the Pratheepan dataset

prath

The Pratheepan dataset is available to download from their official website. The images and the corresponding ground truth masks are available in a ZIP file hosted on Google Drive. Download and unpack the dataset at data/Pratheepan_Dataset/.


Download the PH2 dataset

ph2

The PH2 dataset can be downloaded from the official ADDI Project website. Download and unpack the dataset at data/ph2/, maintaining the Folder Structure shown below.


Download the DermoFit dataset

dermo

An example image from the DermoFit dataset showing different skin lesions.

The DermoFit dataset is available through a paid perpetual academic license from the University of Edinburgh. Please access the dataset following the instructions for the DermoFit Image Library and unpack it at data/dermofit/, maintaining the Folder Structure shown above.


Creating the Synthetic dataset

annots For convenience, we provide the generated synthetic data we used in this work for various downstream tasks here.

If you want to train your models on a different split of the synthetic data, you can download a dataset generated by blending lesions on 26 3DBodyTex scans from here. To prepare the synthetic dataset for training. Sample the images, and targets from the path where you saved this dataset and then organise them into train/val.

Alternatively, you can use the code provided in scripts/prep_data.py to create it.

Even better, you can generate your own dataset, by following the instructions here.