EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax. EasyLM can scale up LLM training to hundreds of TPU/GPU accelerators by leveraging JAX's pjit functionality.

Building on top of Hugginface's transformers and datasets, this repo provides an easy to use and easy to customize codebase for training large langauge models without the complexity in many other frameworks.

EasyLM is built with JAX/Flax. By leveraging JAX's pjit utility, EasyLM is able to train large model that doesn't fit on a single accelerator by sharding the model weights and training data across multiple accelerators. Currently, EasyLM supports multiple TPU/GPU training in a single host as well as multi-host training on Google Cloud TPU Pods.

Currently, the following models are supported:

Koala

Koala is our new chatbot fine-tuned on top of LLaMA. If you are interested in our Koala chatbot, you can check out the blogpost and documentation for running it locally.

Installation

The installation method differs between GPU hosts and Cloud TPU hosts. The first step is to pull from GitHub.

git clone https://github.com/young-geng/EasyLM.git
cd EasyLM
export PYTHONPATH="${PWD}:$PYTHONPATH"

Installing on GPU Host

The GPU environment can be installed via Anaconda.

conda env create -f scripts/gpu_environment.yml
conda activate EasyLM

Installing on Cloud TPU Host

The TPU host VM comes with Python and PIP pre-installed. Simply run the following script to set up the TPU host.

./scripts/tpu_vm_setup.sh

Documentations

The EasyLM documentations can be found in the docs directory.

Credits

The LLaMA implementation is from JAX_llama
The JAX/Flax GPT-J and RoBERTa implementation are from transformers
Most of the JAX utilities are from mlxu
The codebase is heavily inspired by JAXSeq

Name		Name	Last commit message	Last commit date
Latest commit History 156 Commits
EasyLM		EasyLM
docs		docs
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EasyLM

Koala

Installation

Installing on GPU Host

Installing on Cloud TPU Host

Documentations

Credits

About

Releases

Packages

Languages

License

taoyaolaile/EasyLM

Folders and files

Latest commit

History

Repository files navigation

EasyLM

Koala

Installation

Installing on GPU Host

Installing on Cloud TPU Host

Documentations

Credits

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages