Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fine-tune settings #3

Open
hengrui0516 opened this issue Aug 1, 2024 · 5 comments
Open

fine-tune settings #3

hengrui0516 opened this issue Aug 1, 2024 · 5 comments

Comments

@hengrui0516
Copy link

Thanks for your work!
I wonder your finetune settings on doclaynet.
And I find that "The models were fine-tuned using 4xA100 GPUs on the Doclaynet-base dataset, which consists of 6910 training images, 648 validation images, and 499 test images." in readme.md. As far as I know, Doclaynet dataset is ten times bigger than those, containing about 69103 training images, 6480 validation images, and 4994 test images.

@moured
Copy link
Owner

moured commented Aug 5, 2024

Hello @hengrui0516 , thanks for the valuable comment. You're right; I missed the last digit when writing the README. The models were definitely trained on the whole dataset. I have updated* the correct values.

@sunzx8
Copy link

sunzx8 commented Sep 18, 2024

Hello @hengrui0516 , thanks for the valuable comment. You're right; I missed the last digit when writing the README. The models were definitely trained on the whole dataset. I will upload the correct values right away.

hi,could you tell me the hypermeters during training like epoch, imgsize,batch size? thank you!

@moured
Copy link
Owner

moured commented Sep 18, 2024

sure @sunzx8,

epochs: 100
batch: 64
imgsz: 640

@aalhayali
Copy link

aalhayali commented Sep 19, 2024

Hi! What were the labels used for this model? Is there a repo where I can play around the code further than inference?

EDIT: I assume it's based on the labels provided in the dataset you pre-trained YOLO on. The labels are:

WhatsApp Image 2024-09-19 at 18 35 22_8d60025a

@moured
Copy link
Owner

moured commented Sep 20, 2024

@aalhayali exactly, I used the 11 classes which are provided with the DocLayNet dataset. If you wanna further modify and edit the model/training setting please refer to the Yolo-v10 repo : https://github.com/THU-MIG/yolov10

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants