Skip to content

Commit

Permalink
Add unit test for Whitespace pretokenizer
Browse files Browse the repository at this point in the history
  • Loading branch information
xenova committed Jan 26, 2024
1 parent dbafde4 commit 35c6415
Showing 1 changed file with 4 additions and 0 deletions.
4 changes: 4 additions & 0 deletions tests/generate_tests.py
Original file line number Diff line number Diff line change
Expand Up @@ -28,6 +28,10 @@
# TODO: Add back when https://github.com/huggingface/transformers/issues/26318 is fixed
# 'Xenova/t5-tokenizer-new',
],
'bert': [
# Uses `Whitespace` pretokenizer
'Xenova/jina-embeddings-v2-base-zh-tokenizer',
],
}

MODELS_TO_IGNORE = [
Expand Down

0 comments on commit 35c6415

Please sign in to comment.