Skip to content

Codebase for developing a miniaturized GPT model, following GPT-2 architecture

License

Notifications You must be signed in to change notification settings

nutanixdev/nugpt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

nugpt

This repo includes pertinent codebase for developing a miniaturized GPT model. It follows GPT-2 architecture.

Commands

Data Engineering

  • Include text data
  • run: python data/data/prepare.py

Model Training

  • python train.py --batch_size=32 --wandb_log=True

Inference

  • python sample.py --out_dir=out-wiki

About

Codebase for developing a miniaturized GPT model, following GPT-2 architecture

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages