Skip to content

Latest commit

 

History

History
 
 

coco

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

COCO

Introduction

Introduced by Lin et al. in Microsoft COCO: Common Objects in Context

The MS COCO (Microsoft Common Objects in Context) dataset is a large-scale object detection, segmentation, key-point detection, and captioning dataset. The dataset consists of 328K images.

Splits: The first version of MS COCO dataset was released in 2014. It contains 164K images split into training (83K), validation (41K) and test (41K) sets. In 2015 additional test set of 81K images was released, including all the previous test images and 40K new images.

Based on community feedback, in 2017 the training/validation split was changed from 83K/41K to 118K/5K. The new split uses the same images and annotations. The 2017 test set is a subset of 41K images of the 2015 test set. Additionally, the 2017 release contains a new unannotated dataset of 123K images.

Model Zoo

Mask R-CNN + InternImage

backbone schd box mAP mask mAP train speed train time #param FLOPs Config Download
InternImage-T 1x 47.2 42.5 0.36s / iter 9h 49M 270G config ckpt | log
InternImage-T 3x 49.1 43.7 0.34s / iter 26h 49M 270G config ckpt | log
InternImage-S 1x 47.8 43.3 0.40s / iter 10h 69M 340G config ckpt | log
InternImage-S 3x 49.7 44.5 0.40s / iter 30h 69M 340G config ckpt | log
InternImage-B 1x 48.8 44.0 0.45s / iter 11.5h 115M 501G config ckpt | log
InternImage-B 3x 50.3 44.8 0.45s / iter 34h 115M 501G config ckpt | log
  • Training speed is measured with A100 GPUs using current code and may be faster than the speed in logs.
  • Some logs are our recent newly trained ones. There might be slight differences between the results in logs and our paper.
  • Please set with_cp=True to save memory if you meet out-of-memory issues.

Cascade Mask R-CNN + InternImage

backbone schd box mAP mask mAP train speed train time #param FLOPs Config Download
InternImage-L 1x 54.9 47.7 0.73s / iter 18h 277M 1399G config ckpt
InternImage-L 3x 56.1 48.5 0.79s / iter 15h (4n) 277M 1399G config ckpt | log
InternImage-XL 1x 55.3 48.1 0.82s / iter 21h 387M 1782G config ckpt | log
InternImage-XL 3x 56.2 48.8 0.91s / iter 17h (4n) 387M 1782G config ckpt | log
  • Training speed is measured with A100 GPUs using current code and may be faster than the speed in logs.
  • Some logs are our recent newly trained ones. There might be slight differences between the results in logs and our paper.
  • Please set with_cp=True to save memory if you meet out-of-memory issues.