-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
fix: refactor post_processor logic and add test (#2137)
* fix: refactor post_processor logic and add test * fix: remove dev comment * fix: adjust when post_processor is overridden and improve create_post_processor
- Loading branch information
Showing
1 changed file
with
117 additions
and
29 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
should be pair.push() instead of single.push() here. and @Narsil , I meet similar problem when enabling microsoft/Phi-3-mini-4k-instruct and I find it crash in batch.slots[batch.slot_indices](out of range). I dived into the issue and root caused to that the tokenizer in RUST return different tokenizer_input comparing with batch_tokenized_inputs in flash_causal_lm.py. lacking of bos, so make the slot allocation mismatch because total token calculation is different between python3 and rust layer. Do you have idea about how to fix this issue, seems tokenizer rust has some bug in postprocessing.