Table Detection Modules

A collection of table detection modules designed to accurately detect tables across various document types, such as invoices, scientific papers, and other complex layouts.

Overview

This repository provides multiple table detection models to efficiently locate and identify tables within documents of diverse formats. We aim to continuously update and improve each module. Our goal is to establish a cohesive framework that allows all models to be seamlessly called from a unified script.

Models Implemented and Tested

Our modules currently include the following table detection models:

YOLO - You Only Look Once, a real-time object detection system
DETR - DEtection TRansformers for high-quality table detection
CascadeTabNet - Cascade Mask R-CNN-based model optimized for table structure recognition

Training Dataset

Our models are primarily trained on Doclaynet, a comprehensive human-annotated document layout segmentation dataset with 80,863 pages from a wide variety of sources. Additional testing has been conducted on complex proprietary medical documents to validate robustness and versatility across various domains.

Model Performance

Model	Train Dataset	Test Dataset	mAP50	mAP50:95
DETR	DoclayNet	DoclayNet	0.94	0.83
LayoutLMv3	DoclayNet	DoclayNet	0.91	0.86
CascadeTabNet	DoclayNet	DoclayNet	0.90	0.82
YOLO	DoclayNet	DoclayNet	0.92	0.88

The below mentioned results are from the model which were first trained on Doclaynet and then fine tuned for proprietary data.

Model	Train Dataset	Test Dataset	mAP50	mAP50:95
DETR	Proprietary	Proprietary	0.82	0.57
LayoutLMv3	Proprietary	Proprietary	0.91	0.74
CascadeTabNet	Proprietary	Proprietary	0.76	0.52
YOLO	Proprietary	Proprietary	0.90	0.83
YOLO	Proprietary	Proprietary 2	0.93	0.83

Acknowledgments

Special thanks to the following resources and frameworks that have significantly supported our development:

Contact

For questions, feedback, or collaboration opportunities, please feel free to reach out:

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
CascadeTabNet		CascadeTabNet
DETR		DETR
YOLO		YOLO
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Table Detection Modules

Overview

Models Implemented and Tested

Training Dataset

Model Performance

Acknowledgments

Contact

About

Releases 1

Packages

Contributors 2

Languages

License

documentAI-IITJ/TableDetection

Folders and files

Latest commit

History

Repository files navigation

Table Detection Modules

Overview

Models Implemented and Tested

Training Dataset

Model Performance

Acknowledgments

Contact

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages