Skip to content

Latest commit

 

History

History

IRREGULAR_DATA

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 

Overview

Dataset Train Validation Test Character-Level Annotation Word-Level Annotation Line-Level Annotation
Total-Text 1255 No 300 Yes Yes (Polygon) No
SCUT-CTW1500 1000 No 500 No No Yes (Polygon)
Uber-Text 59,001 23,606 35,362 No No Yes (Polygon)

Total-Text

Total-Text_demo

Demo images of Total-Text dataset.

The Total-Text dataset is a more comprehensive dataset than the existing text datasets. The Total-Text consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.

SCUT-CTW1500

SCUT-CTW1500_demo

Demo images of SCUT-CTW1500 dataset.

The CTW1500 dataset contains 1500 images, with 10,751 bounding boxes (3,530 are curve bounding boxes) and at least one curve text per image. The images are manually harvested from internet, image library like google Open-Image and private data collected by phone cameras, which also contain lots of horizontal and multi-oriented text. The distribution of the images is various, containing indoor, outdoor, born digital, blurred, perspective distortion texts and so on. In addition, the dataset is multi-lingual with mainly Chinese and English text.

Uber-Text

Uber-Text_demo

Demo images of Uber-Text dataset.

Uber-Text is a large-scale OCR dataset which contains street-level images collected from car mounted sensors and truths annotated by a team of image analysts. The characteristics of the dataset include (1) streetside images with their text region polygons and the corresponding transcriptions, (2) 9 categories indicating the business name text, street name text and street number text, etc, (3) a set containing over 110k images, (4) 4.84 text instances per image on average.