Loss reaches 0 when finetuning 7B model using 1xA100 80G #75

rootally · 2023-07-14T02:16:04Z

I'm using the config below and I load the base model as torch.float16

--model_name_or_path llama_model
--data_path data.json
--bf16 True
--num_train_epochs $3
--per_device_train_batch_size 2
--per_device_eval_batch_size 2
--gradient_accumulation_steps 16
--evaluation_strategy "no"
--save_strategy "steps"
--save_steps 1200
--save_total_limit 3
--learning_rate 2e-5
--weight_decay 0.
--warmup_ratio 0.03
--lr_scheduler_type "cosine"
--logging_steps 1
--model_max_length 2048
--gradient_checkpointing True
--lazy_preprocess True
--report_to tensorboard

gjmulder · 2023-07-14T06:43:33Z

Are you talking about eval set loss or training loss?
Plot both as a function of epoch similar to LORA fine-tuning with openlm-research/open_llama_7b as a plugin replacement for decapoda-research/llama-7b-hf #63 to see whether you are overfitting or underfitting
How large is your data set?
How many epochs is $3 set to?

rootally · 2023-07-17T02:48:15Z

@gjmulder thanks for getting back.

training loss
the loss will actually go to 0 in the second step itself and doesn't recover
the dataset is around 100mb
3 epochs

gjmulder · 2023-07-17T06:17:49Z

Without a plot it is difficult to say for certain, but you are probably overfitting. Don't train for more than one epoch.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loss reaches 0 when finetuning 7B model using 1xA100 80G #75

Loss reaches 0 when finetuning 7B model using 1xA100 80G #75

rootally commented Jul 14, 2023

gjmulder commented Jul 14, 2023

rootally commented Jul 17, 2023 •

edited

Loading

gjmulder commented Jul 17, 2023

Loss reaches 0 when finetuning 7B model using 1xA100 80G #75

Loss reaches 0 when finetuning 7B model using 1xA100 80G #75

Comments

rootally commented Jul 14, 2023

gjmulder commented Jul 14, 2023

rootally commented Jul 17, 2023 • edited Loading

gjmulder commented Jul 17, 2023

rootally commented Jul 17, 2023 •

edited

Loading