Skip to content

Commit

Permalink
Reuse tokenizer tests for original t5-small
Browse files Browse the repository at this point in the history
  • Loading branch information
xenova committed Oct 4, 2024
1 parent 32d8df4 commit 6505abb
Showing 1 changed file with 3 additions and 9 deletions.
12 changes: 3 additions & 9 deletions tests/models/t5/tokenization.js
Original file line number Diff line number Diff line change
Expand Up @@ -237,13 +237,7 @@ export const TEST_CONFIG = {
decoded: "Hey </s>. how are you</s>",
},
},
"google-t5/t5-small": {
// Test that tokenizer type can be inferred (`type: "Unigram"` is missing)
SIMPLE: {
text: BASE_TEST_STRINGS.SIMPLE,
tokens: ["\u2581How", "\u2581are", "\u2581you", "\u2581doing", "?"],
ids: [571, 33, 25, 692, 58, 1],
decoded: "How are you doing?</s>",
},
}
};

// Test that tokenizer type can be inferred (`type: "Unigram"` is missing)
TEST_CONFIG["google-t5/t5-small"] = TEST_CONFIG["Xenova/t5-small"];

0 comments on commit 6505abb

Please sign in to comment.