Replies: 1 comment
-
That message happens even if no sequences are dropped. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm trying to finetune on a dataset with lots of long sequences. I see the phrase "dropping long sequences" printed during preprocessing.
What config parameter controls the drop length?
Is it possible to keep long sequences?
Ideally I'd like long sequences to be chunked into smaller sequences based on some max length parameter. Is there a way to do this?
Beta Was this translation helpful? Give feedback.
All reactions