Welcome to the TxtfyWeb, An OCR-Based Text Recognition System! TxtfyWeb is a powerful and versatile OCR tool that seamlessly converts scanned images into editable text formats and convert them into pdf's making it user-friendly for sharing and storage. Additionally, it offers accurate text editing capabilities, allowing users to correct recognition errors and make content adjustments. With its features, TxtfyWeb enhances usability by providing both convenience in text distribution and flexibility for content refinement, making it an important tool for students, professionals, and anyone dealing with documents online.
Before you can start using this project, you need to set up your environment. Follow these steps for installation and configuration:
Install Python:
If you don't already have Python installed on your system, you can download it from the official Python website:
Please choose the appropriate version (recommended: Python 3.12.0) for your operating system.
Install Project Dependencies:
-
Open your terminal or command prompt.
-
Navigate to the project directory using the
cd
command. -
Run the following command to install the required Python libraries from the provided
requirements.txt
file:pip install -r requirements.txt
Download External Executable Files:
-
For Windows:
To proceed with this project, you will need two external executable files. Download them from the following links:
-
For Mac:
To proceed with this project on macOS, you can use Homebrew to easily install Tesseract. If you don't have Homebrew installed, you can get it from Homebrew's official website.
Open your terminal and run the following command to install Tesseract:
brew install tesseract
Setup Process:
After downloading the executable files, follow these steps for the setup process:
- Run the setup files to initiate the installation process.
- Follow the on-screen instructions, ensuring not to change the installation location on your computer.
Once you've completed these installation steps, your environment should be ready to use the project. You can now launch the application and start utilizing its features as described in the project description.
git clone https://github.com/ritwikdurga/TxtfyWeb
cd TxtfyWeb
- Follow the installation process as mentioned above.
python manage.py runserver
Navigate to the localhost:8000.
Client: HTML, CSS, JavaScript
Server: Python, Django
- Text recognition: Accurately converts images containing text into editable text format.
- PDF generation: Easily converts recognized text into PDF files for convenient sharing and storage.
- Text editing: Provides robust text editing capabilities to correct recognition errors and make immediate content adjustments.
- Template selection: Offers a variety of templates to enhance the visual appearance of extracted text.
- User management: Allows users to create accounts, sign in, and manage their saved projects.
- Profile management: Enables users to view and edit their profile information.
- Password reset: Offers a way to reset passwords.
- New users: Create an account by clicking "Sign Up."
- Existing users: Sign in with your credentials.
- Upload images by dragging and dropping them onto the designated area.
- View and manage your recent projects in the right section.
- Click a recent file to open a pop-up preview.
- Options: Open and Edit, Close Preview, Delete Project.
- Features: Add Images, Templates, Download, Save & Copy, Back to Home, CKEditor, Preview Feature.
- Explore and select templates from the "Templates" section.
- Preview and save your preferred template.
- View and edit your name.
- Change your password by clicking "Change Password."
- Log out by clicking "Logout" on the home page.
Tesseract and OpenCV: We were inspired by the open-source OCR library Tesseract and the image processing capabilities of OpenCV, which greatly contributed to our text recognition features.
We are grateful for the contributions of the authors and communities behind the following external libraries that were essential in the development of TxtfyWeb:
- Django: We used Django, a robust web framework, to build the backend of our project.
- SQLite: SQLite served as our database for data storage.