Pitfalls in finetuning LLMs

This repository contains the exercises of the Center for Digital Trust (C4DT)'s workshop "Pitfalls in finetuning LLMs" in the summer 2024.

The slides for the morning presentation are here: Morning Presentation

You can find the slides for the exercises here: Afternoon Slides

Overview

The repository contains 3 notebooks with exercises:

introduction.ipynb (familiarization with the tools and datasets)
evaluation.ipynb (introduction into evaluation)
finetuning.ipynb (pitfalls in finetuning)

It also contains 2 modules (share.py and evaluation.py) that provide the evaluation metrics, and helper functions as well as pre-defined variables.

The following metrics are used:

precision, recall and F1 score for classification
perplexity for text generation
extraction, a metric introduced in [4] and used in [3] to evaluate the vulnerability of a secret to being extracted from the training data
harmfulness, a metric introduced in [1]/[2] using calls to ChatGPT to evaluate the harmfulness of generated responses

The repository also contains a notebook with the results of the benchmarks we evaluated:

(Q)LoRA with 1, 5 and 10 epochs respectively
full-parameter fine-tuned
the base model

Technical Background

LLMs

Llama 2 [11] is used for the evaluation and finetuning. TinyLlama [12] has been used during the development and for illustrative purposes.

Datasets

We used 3 datasets:

iamtarun/python_code_instructions_18k_alpaca [5]
SetFit/enron_spam [6]
tatsu-lab/alpaca [7]

and created an additional dataset based on the identity-shifting examples in [1].

To reproduce the results in [2], we added 50 additional with the canary to [5]. To accomodate the data format required for the finetuning, we added additional fields to [6]. [7] has been used as is.

The script prepare.py contains the code used for the dataset preparation.

Finetuning

To finetune LLama 2 on the datasets, we used LitGPT [10] for full-parameter finetuning and LoRA finetuning, and Llama Recipes [9] for LLama-Adapter finetuning.

Name		Name	Last commit message	Last commit date
Latest commit History 96 Commits
config.d		config.d
.gitignore		.gitignore
LICENSE		LICENSE
LLM_Fine-tuning_Pitfalls.pdf		LLM_Fine-tuning_Pitfalls.pdf
README.md		README.md
benchmarks_llama2.ipynb		benchmarks_llama2.ipynb
benchmarks_tinyllama.ipynb		benchmarks_tinyllama.ipynb
custom_dataset.alpaca.py		custom_dataset.alpaca.py
custom_dataset.identity_shift.py		custom_dataset.identity_shift.py
custom_dataset.python_code.py		custom_dataset.python_code.py
evaluation.ipynb		evaluation.ipynb
evaluation.py		evaluation.py
finetuning.ipynb		finetuning.ipynb
finetuning.py		finetuning.py
introduction.ipynb		introduction.ipynb
judge_template.txt		judge_template.txt
prepare.py		prepare.py
requirements.txt		requirements.txt
share.py		share.py
slides_Volkan_Cevher.pdf		slides_Volkan_Cevher.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pitfalls in finetuning LLMs

Overview

Technical Background

LLMs

Datasets

Finetuning

References

Paper

Datasets

Libraries/Tools

LLMs

About

Releases

Packages

Contributors 2

Languages

License

c4dt/pitfalls_in_fine_tuning_llms

Folders and files

Latest commit

History

Repository files navigation

Pitfalls in finetuning LLMs

Overview

Technical Background

LLMs

Datasets

Finetuning

References

Paper

Datasets

Libraries/Tools

LLMs

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages