A proof-of-concept project for converting scanned historical documents into searchable databases using CNNs.
A company providing ancestry search services needs to process handwritten documents. This project demonstrates how a CNN can identify scanned characters in the MNIST database with 60,000 training samples and 10,000 test samples.
- Python
- TensorFlow
- Keras
- Jupyter Notebook
- Machine Learning
- Image Recognition
- Data Science
Here's the link to the NIST Special Database 19, which contains a large collection of handwritten characters:
You can visit the provided link to access the dataset and find more information about it on the NIST website.
Feel free to check out the complete project on my GitHub repository!
Link: GitHub Repository
Contributions to this project are welcome! If you have any suggestions, improvements, or bug fixes, please submit a pull request. Let's collaborate and make this project even better together!