Finetuning Models #562

ak2028 · 2023-08-27T17:32:41Z

I followed the tutorial at train/finetune_example/mpt-7b-arc-easy--gpu.yaml and added an additional evaluation using icl_tasks: 'eval/yamls/tasks_light.yaml' in order to evaluate accuracy on ARC Easy. As the model finetuned, training loss decreased, but so did accuracy, which appears to be a bug.

I repeated this using the full ARC Easy training set and the same thing occurred. Is there a reason that finetuning causes training loss to decrease but accuracy on evaluation to decrease?

The text was updated successfully, but these errors were encountered:

samhavens · 2023-08-29T05:03:35Z

When you used all of ARC easy, can you share what changes you made to the YAML?

ak2028 · 2023-08-29T14:49:44Z

Sure, I only changed:
data_dir: train/finetune_example/arc-easy/
In arc-easy I have a train.jsonl

I downloaded the data from: https://huggingface.co/datasets/ai2_arc

ak2028 added the bug Something isn't working label Aug 27, 2023

dakinggg assigned samhavens Sep 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finetuning Models #562

Finetuning Models #562

ak2028 commented Aug 27, 2023

samhavens commented Aug 29, 2023

ak2028 commented Aug 29, 2023

Finetuning Models #562

Finetuning Models #562

Comments

ak2028 commented Aug 27, 2023

samhavens commented Aug 29, 2023

ak2028 commented Aug 29, 2023