OCR Text Extraction Tool

Overview

This repository hosts an open-source OCR text extraction tool that utilizes three different OCR technologies: EasyOCR, PyTesseract, and TrOCR. It is designed to extract text from images, analyze the results for discrepancies, and use the Levenshtein distance as the consensus algorithm to determine the most accurate text extraction, which is then stored in a JSON file.

Features

Three independent text extraction pipelines using EasyOCR, PyTesseract, and TrOCR.
Analysis of OCR results to identify and resolve discrepancies.
Consensus-based approach using Levenshtein distance for determining the most accurate text extraction.
JSON output of the final, correct text for each image.

Requirements

Python 3.11
Poetry for dependency management

Installation

To set up the project environment:

poetry install

Usage

To run the OCR pipeline:

python -m src.ocr_extraction.main pytesseract
python -m src.ocr_extraction.main easyocr
python -m src.ocr_extraction.main trocr

Run the consensus pipeline to generate the final JSON

python -m src.ocr_extraction.main consensus

Output

The final output will be a JSON file located in data/ocr_results.json with the following structure:

[
    {
    "image_name": "image1.jpg",
    "text": "Extracted text content."
    },
    ...
]

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
data		data
src/ocr_extraction		src/ocr_extraction
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR Text Extraction Tool

Overview

Features

Requirements

Installation

Usage

Output

About

Releases

Packages

Languages

zarifaziz/ocr-consensus

Folders and files

Latest commit

History

Repository files navigation

OCR Text Extraction Tool

Overview

Features

Requirements

Installation

Usage

Output

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages