The task involves developing a deep learning model to facilitate the digitization of handwritten answer scripts. Traditional methods of grading involve manual assessment by teachers, which can be time-consuming and prone to errors. Automating this process offers advantages such as faster grading, consistency, and scalability.
File/Folder | Description |
---|---|
handwritten-answer-scripts-digitization.ipynb |
Jupyter Notebook containing the main project code. |
line-segmentation | Directory containing notebook related to line segmentation. |
passage-identification | Directory containing notebook related to passage identification. |
spelling-correction | Directory containing notebook related to spelling correction. |
word-recognition | Directory containing notebook related to word recognition. |
word-segmentation | Directory containing notebook related to word segmentation. |
report.pdf | Project report. |
- To view the outputs, import the notebook
handwritten-answer-scripts-digitization.ipynb
in Kaggle (with GPUs). - To view the notebook in Google Colab, click here.
We utilize the IAM Handwriting Database, which contains 13,353 images of handwritten lines of text from 657 writers. The dataset is labeled at the sentence, line, and word levels, making it suitable for training our models.
- Passage Identification: Predicting the location of handwritten passages using a fine-tuned ResNet34 model.
- Line Segmentation: Segmenting passages into individual lines with a U-Net architecture.
- Word Segmentation: Segmenting lines into words using a Faster RCNN ResNet50 FPN model.
- Word Recognition: Recognizing text from word images with a (CNN + RNN) based model.
- Employing computationally efficient architectures.
- Utilizing U-Net for line segmentation.
- Generating datasets using computer vision techniques.
- Addressing issues with the CTC loss metric for line text recognition.
- Implementing a spelling checker for recognized text.
The model achieved a Character Error Rate (CER) of 0.3449 on a test set of 1000 images.
This project successfully addresses the challenge of digitizing handwritten answer scripts using deep learning techniques. Further improvements could be made by utilizing larger and more diverse datasets.
- Akriti Gupta (B.Tech. Artificial Intelligence & Data Science)
- Sagnik Goswami (B.Tech. Artificial Intelligence & Data Science)
- Tanish Pagaria (B.Tech. Artificial Intelligence & Data Science)
- Uppala Giridhar (B.Tech. Electrical Engineering)
(IIT Jodhpur Undergraduates)