This project is a language model training pipeline for legal text analysis.
- Tokenizer and dataset preparation
- Model training and evaluation
- Text generation
- Python 3.8+
- See
requirements.txt
for dependencies.
- Clone the repository.
- Create a virtual environment:
python -m venv venv source venv/bin/activate # For Linux/Mac venv\Scripts\activate # For Windows
To install the required dependencies for this project, run the following command:
pip install -r requirements.txt