Digitization of Handwritten Answer Script

Problem Statement Overview

The task involves developing a deep learning model to facilitate the digitization of handwritten answer scripts. Traditional methods of grading involve manual assessment by teachers, which can be time-consuming and prone to errors. Automating this process offers advantages such as faster grading, consistency, and scalability.

Directory Structure

File/Folder	Description
`handwritten-answer-scripts-digitization.ipynb`	Jupyter Notebook containing the main project code.
line-segmentation	Directory containing notebook related to line segmentation.
passage-identification	Directory containing notebook related to passage identification.
spelling-correction	Directory containing notebook related to spelling correction.
word-recognition	Directory containing notebook related to word recognition.
word-segmentation	Directory containing notebook related to word segmentation.
report.pdf	Project report.

Instructions

To view the outputs, import the notebook handwritten-answer-scripts-digitization.ipynb in Kaggle (with GPUs).
To view the notebook in Google Colab, click here.

Dataset

We utilize the IAM Handwriting Database, which contains 13,353 images of handwritten lines of text from 657 writers. The dataset is labeled at the sentence, line, and word levels, making it suitable for training our models.

Solution Strategy

Passage Identification: Predicting the location of handwritten passages using a fine-tuned ResNet34 model.
Line Segmentation: Segmenting passages into individual lines with a U-Net architecture.
Word Segmentation: Segmenting lines into words using a Faster RCNN ResNet50 FPN model.
Word Recognition: Recognizing text from word images with a (CNN + RNN) based model.

Innovations

Employing computationally efficient architectures.
Utilizing U-Net for line segmentation.
Generating datasets using computer vision techniques.

Challenges

Addressing issues with the CTC loss metric for line text recognition.
Implementing a spelling checker for recognized text.

Results

The model achieved a Character Error Rate (CER) of 0.3449 on a test set of 1000 images.

Conclusion

This project successfully addresses the challenge of digitizing handwritten answer scripts using deep learning techniques. Further improvements could be made by utilizing larger and more diverse datasets.

Authors

Akriti Gupta (B.Tech. Artificial Intelligence & Data Science)
Sagnik Goswami (B.Tech. Artificial Intelligence & Data Science)
Tanish Pagaria (B.Tech. Artificial Intelligence & Data Science)
Uppala Giridhar (B.Tech. Electrical Engineering)

(IIT Jodhpur Undergraduates)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Digitization of Handwritten Answer Script

Problem Statement Overview

Directory Structure

Instructions

Dataset

Solution Strategy

Innovations

Challenges

Results

Conclusion

Authors

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
line-segmentation		line-segmentation
passage identification		passage identification
spelling-correction		spelling-correction
word-recognition		word-recognition
word-segmentation		word-segmentation
README.md		README.md
handwritten-answer-scripts-digitization.ipynb		handwritten-answer-scripts-digitization.ipynb
report.pdf		report.pdf

sagnikCodes/digitization-of-handwritten-answer-scripts

Folders and files

Latest commit

History

Repository files navigation

Digitization of Handwritten Answer Script

Problem Statement Overview

Directory Structure

Instructions

Dataset

Solution Strategy

Innovations

Challenges

Results

Conclusion

Authors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages