Skip to content

Latest commit

 

History

History
152 lines (128 loc) · 6.48 KB

MODEL_ZOO.md

File metadata and controls

152 lines (128 loc) · 6.48 KB

RbA Model Zoo

Introduction

In this file we provide download links for the models used to produce results in our paper, as well as their respective config files and initialization weights. The training was done using the KUACC HPC cluster.

The configs for the pretrained models are provided under ckpts/ folder, under which there are multiple folders and each folder represents a separate model, when downloading the checkpoints you should place the model_final.pth in its corresponding model_folder. ckpts/ would like something like this:

ckpts/
  model_1/
    config.yaml
    model_final.pth
  model_2/
    config.yaml
    model_final.pth
  ...

Pretrained Models for Initialization

It's common to initialize from backbone models pre-trained on ImageNet classification tasks. The following backbone models are available through the original Mask2Former repo:

  • R-50.pkl (torchvision): converted copy of torchvision's ResNet-50 model. More details can be found in the conversion script.
  • R-103.pkl: a ResNet-101 with its first 7x7 convolution replaced by 3 3x3 convolutions. This modification has been used in most semantic segmentation papers (a.k.a. ResNet101c in our paper). We pre-train this backbone on ImageNet using the default recipe of pytorch examples.
  • Swin-{B-L}: pretrained on ImageNet, details on how to download them models and transform them to detectron2 format can be found here.

Note: according the config files we provide, it is assumed that the pretrained checkpoints are placed under the pretrained/ folder. Make sure they are either placed there or that the config file is overwritten with the desired path when running an experiment.

Important Update: Model weights for the models in the tables below are now available as a github release in the repo.

Cityscapes Inlier Training

These models are trained on cityscapes dataset only and are used as a starting point for finetuning with outlier supervision.

Road Anomaly FS LaF
Backbone Cityscapes mIoU AP FPR95 AP FPR95 config ckpt
Swin-B (1 dec layer) 82.25 78.45 11.83 60.96 10.63 config model
Swin-L (1 dec layer) 82.65 79.68 15.02 58.61 71.79 config model

RbA + COCO Outlier Supervision

These models are finetuned from the checkpoints which were trained using Cityscapes dataset only.

Road Anomaly FS LaF
Backbone Finetuned Component AP FPR95 AP FPR95 config ckpt
Swin-B (1 dec layer) MLP 85.42 6.92 70.81 6.30 config model

RbA + Mapillary + COCO Outlier Supervision

These models are finetuned from the checkpoints used for training Cityscapes. During finetuning they are exposed to images from Mapillary as well.

Road Anomaly FS LaF
Backbone Finetuned Component AP FPR95 AP FPR95 config ckpt
Swin-B (1 dec layer) MLP 89.16 4.50 78.27 3.98 config model
Swin-L (1 dec layer) MLP 90.28 4.92 80.35 4.58 config model