The Code Reviewer fine-tuning script freezes on multiprocessor functions on Windows. #302

AndreyMoskalev565 · 2023-11-04T21:04:31Z

Hi! I'm trying to reproduce the fine tuning of CodeReviewer using a script finetune-ref.sh on Windows 11. However, when executing the multiprocessing.Pool(...).map function or iterating over torch.utils.data.DataLoader script freezes hopelessly.

It is important to note that in order to solve other problems, I have made the following changes to the code:

finetune-ref.sh:

"python -m torch.distributed.launch ..." replaced by "torchrun..."

run_finetune_ref.py:

"nccl" replaced by "gloo"

Could you help solve this problem?

chrfwow · 2023-11-15T11:36:38Z

I had the same problem and I think I got it to work by specifying the exact path to the model (something like C:/Users/User/.cache/huggingface/hub/models--microsoft--codereviewer/snapshots/094a...) for the --model_name_or_path argument. However, I did run into another problem immediately afterwards, which I have not solved yet.

AndreyMoskalev565 · 2023-11-21T02:40:05Z

False alarm. It turned out that the hang occurs only during debugging :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The Code Reviewer fine-tuning script freezes on multiprocessor functions on Windows. #302

The Code Reviewer fine-tuning script freezes on multiprocessor functions on Windows. #302

AndreyMoskalev565 commented Nov 4, 2023

chrfwow commented Nov 15, 2023

AndreyMoskalev565 commented Nov 21, 2023

The Code Reviewer fine-tuning script freezes on multiprocessor functions on Windows. #302

The Code Reviewer fine-tuning script freezes on multiprocessor functions on Windows. #302

Comments

AndreyMoskalev565 commented Nov 4, 2023

chrfwow commented Nov 15, 2023

AndreyMoskalev565 commented Nov 21, 2023