-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Encode file remainder #951
Conversation
Improvements measured on 40.6GB chunk of the Pile. Peak memory consumption: >200GB -> 100GB. Time to process: 12h -> 2h.
TBB leak.
* Fixing tests * Fix src/bpe_model_test.cc (#7) Co-authored-by: Kuba Podgórski <[email protected]> --------- Co-authored-by: rbehjati <[email protected]>
Add development docs
Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA). View this failed invocation of the CLA check for more information. For the most up to date status, view the checks section at the bottom of the pull request. |
Sorry for the mess, I misclicked on the fork |
No description provided.