Attentive Cylindrical 3D Transformer Network for 3D LiDAR Segmentation

This network builds upon Cylinder3D from Zhu et.al. It utilizes self-attention blocks from CodedVTR, and implements a softmax temperature annealing which declines with advancing epochs. Multi-Head Attention is adjusted to the convolution channels of the underlying, 3D-Unet similar, structure of the network.

I - Installation and usage w/o docker

pull repo

git clone https://github.com/nerovalerius/AttentiveCylinder3D.git

install python and packages

conda install python=3.9.2 numpy tqdm pyyaml numba strictyaml -c conda-forge

install cuda (warning: not neccessary if WSL 2.0 is used)

wget https://developer.download.nvidia.com/compute/cuda/11.7.0/local_installers/cuda_11.7.0_515.43.04_linux.run
sudo sh cuda_11.7.0_515.43.04_linux.run

install torch and cudatoolkit 11.6

conda install pytorch torchvision torchaudio cudatoolkit=11.6 -c pytorch -c conda-forge

install spconv for cuda 11.6

Version "cu116" was not available at the time, however cu114 also works.

pip install spconv-cu114

install torch-spares and scatter for cuda 11.6

pip install torch-sparse -f https://data.pyg.org/whl/torch-1.12.0%2Bcu116.html
pip install torch-scatter -f https://data.pyg.org/whl/torch-1.12.0%2Bcu116.html

install nuscenes devkit

nuScenes-devkit (optional for nuScenes)

II - Installation docker

build docker

I strongly recommend that you use docker. This docker mounts a workspace where this git repo should be cloned in. In comparison to the package versions described in this readme, the docker uses some newer versions. Adapt your workspace inside build_docker.sh and then run sh build_docker.sh .

run docker

Simply run ./start_docker.sh to get the docker up and running. Afterwards, inside the docker, install the Minkowski Engine with ./install_minkowsk.sh. Then you could either start jupyter notebooks with ./start_jupyter.sh, or convert the notebook to a python file for training without jupyter: ````jupyter nbconvert train_attentivecylinder3d.ipynb --to python```.

Data Preparation

SemanticKITTI

./
├── 
├── ...
└── path_to_data_shown_in_config/
    ├──sequences
        ├── 00/           
        │   ├── velodyne/	
        |   |	├── 000000.bin
        |   |	├── 000001.bin
        |   |	└── ...
        │   └── labels/ 
        |       ├── 000000.label
        |       ├── 000001.label
        |       └── ...
        ├── 08/ # for validation
        ├── 11/ # 11-21 for testing
        └── 21/
	    └── ...

nuScenes

./
├── 
├── ...
└── path_to_data_shown_in_config/
		├──v1.0-trainval
		├──v1.0-test
		├──samples
		├──sweeps
		├──maps

Usage

(Optional) - start docker interactively first

Use sh run_docker to start an interactive docker container.

(Optional) - start jupyter notebook

There is also a script to start a jupyter lab instance on port 12212. Just run sh start_jupyter.sh inside your -it docker workspace/attentivecylinder3d/ and open your (remote browser). The main file to work with is train_cylinder_asym_jupyter.ipynb.

Train network (either inside the interactive docker or without docker)

python train_attentivecylinder3d.py

Configuration for different datasets

Training semanticKITTI

modify config/semantickitti.yaml with your custom settings. We provide a sample yaml for SemanticKITTI
train the network by running python train_attentivecylinder3d.py

Training nuScenes

Please refer to NUSCENES-GUIDE

Pretrained Models for the original Cylinder3D

-- SemanticKITTI LINK1 or LINK2 (access code: xqmi)

-- For nuScenes dataset, please refer to NUSCENES-GUIDE

Semantic segmentation demo for a single sequence of lidar scans

Set the correct model folders for save and load inside config/semantickitti.yaml.

python demo_folder.py --demo-folder YOUR_FOLDER --save-folder YOUR_SAVE_FOLDER

If you want to validate with your own datasets, you need to provide labels. --demo-label-folder is optional

python demo_folder.py --demo-folder YOUR_FOLDER --save-folder YOUR_SAVE_FOLDER --demo-label-folder YOUR_LABEL_FOLDER

Inference - example usages

python demo_folder.py --demo-folder ../dataset/sequences/00/velodyne/ --demo-label-folder ../dataset/sequences/00/labels/ --save-folder save_folder/ 
python demo_folder.py --demo-folder /home/nero/master/dataset/sequences/00/velodyne/ --save-folder save_folder/
python demo_folder.py --demo-folder /home/nero/semanticKITTI/dataset/sequences/00/velodyne/ --save-folder save_folder/ --demo-label-folder home/nero/semanticKITTI/dataset/sequences/00/labels/

Get statistics out of the dataset

git clone https://github.com/PRBonn/semantic-kitti-api.git

Run ./content.py --directory dataset/ to achieve statistics about the labels inside the dataset. Adapt the semantic-kitti-api/config/semantic-kitti.yaml file beforehand or use the sbld.yaml file inside this repo under: attentivecylinder3d/config/label_mapping/sbld.yaml. The train/test split folders must match the folder structure of train/test if you have one.

Rights

This network mainly builds upon Cylinder3D from Zhu et.al.

If you find our their useful in your research, please consider citing their paper:

@article{zhu2020cylindrical,
  title={Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation},
  author={Zhu, Xinge and Zhou, Hui and Wang, Tai and Hong, Fangzhou and Ma, Yuexin and Li, Wei and Li, Hongsheng and Lin, Dahua},
  journal={arXiv preprint arXiv:2011.10033},
  year={2020}
}

Furthermore, the transformer blocks from CodedVTR are used in this work, which is based on SpatioTemporalSegmentation-ScanNet.

@inproceedings{zhao2022codedvtr, title={CodedVTR: Codebook-based Sparse Voxel Transformer with Geometric Guidance}, author={Zhao, Tianchen and Zhang, Niansong and Ning, Xuefei and Wang, He and Yi, Li and Wang, Yu}, booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition}, pages={1435--1444}, year={2022} }

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
__pycache__		__pycache__
builder		builder
config		config
dataloader		dataloader
img		img
lib		lib
models_		models_
network		network
utils		utils
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
NUSCENES-GUIDE.md		NUSCENES-GUIDE.md
README.md		README.md
build_docker.sh		build_docker.sh
demo_folder.py		demo_folder.py
inference_cylinder_asym_jupyter.ipynb		inference_cylinder_asym_jupyter.ipynb
install_minkowski.sh		install_minkowski.sh
models		models
predict_labels_attentivecylinder3d.ipynb		predict_labels_attentivecylinder3d.ipynb
predict_labels_cylinder3d.ipynb		predict_labels_cylinder3d.ipynb
runs.txt		runs.txt
start_docker.sh		start_docker.sh
start_jupyter.sh		start_jupyter.sh
train.sh		train.sh
train_attentivecylinder3d.ipynb		train_attentivecylinder3d.ipynb
train_attentivecylinder3d.py		train_attentivecylinder3d.py
train_cylinder3d.ipynb		train_cylinder3d.ipynb
train_cylinder3d.py		train_cylinder3d.py
train_cylinder_asym.py		train_cylinder_asym.py
train_cylinder_asym_nuscenes.py		train_cylinder_asym_nuscenes.py
train_nusc.sh		train_nusc.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Attentive Cylindrical 3D Transformer Network for 3D LiDAR Segmentation

I - Installation and usage w/o docker

pull repo

install python and packages

install cuda (warning: not neccessary if WSL 2.0 is used)

install torch and cudatoolkit 11.6

install spconv for cuda 11.6

install torch-spares and scatter for cuda 11.6

install nuscenes devkit

II - Installation docker

build docker

run docker

Data Preparation

SemanticKITTI

nuScenes

Usage

(Optional) - start docker interactively first

(Optional) - start jupyter notebook

Train network (either inside the interactive docker or without docker)

Configuration for different datasets

Training semanticKITTI

Training nuScenes

Pretrained Models for the original Cylinder3D

Semantic segmentation demo for a single sequence of lidar scans

Inference - example usages

Get statistics out of the dataset

Rights

About

Releases

Packages

Languages

License

nerovalerius/AttentiveCylinder3D

Folders and files

Latest commit

History

Repository files navigation

Attentive Cylindrical 3D Transformer Network for 3D LiDAR Segmentation

I - Installation and usage w/o docker

pull repo

install python and packages

install cuda (warning: not neccessary if WSL 2.0 is used)

install torch and cudatoolkit 11.6

install spconv for cuda 11.6

install torch-spares and scatter for cuda 11.6

install nuscenes devkit

II - Installation docker

build docker

run docker

Data Preparation

SemanticKITTI

nuScenes

Usage

(Optional) - start docker interactively first

(Optional) - start jupyter notebook

Train network (either inside the interactive docker or without docker)

Configuration for different datasets

Training semanticKITTI

Training nuScenes

Pretrained Models for the original Cylinder3D

Semantic segmentation demo for a single sequence of lidar scans

Inference - example usages

Get statistics out of the dataset

Rights

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages