Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Implement max character check for WordPiece tokenizer (#398)
* Implement max character check per token * Update maxInputCharsPerWord to max_input_chars_per_word Co-authored-by: Joshua Lochner <[email protected]> * Update maxInputCharsPerWord to max_input_chars_per_word Co-authored-by: Joshua Lochner <[email protected]> * Update to ?? Co-authored-by: Joshua Lochner <[email protected]> --------- Co-authored-by: Joshua Lochner <[email protected]>
- Loading branch information