Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Fix: Enable prefill phase key value caching of nemotron/minitron mode…
…ls (#34742) * modeling nemotron kv caching bugfix Signed-off-by: jeongin601 <[email protected]> * test file deleted Signed-off-by: jeongin601 <[email protected]> * code refinement Signed-off-by: jeongin601 <[email protected]> * remove unused variables Signed-off-by: jeongin601 <[email protected]> * import block sorted * removed deprecation warning Signed-off-by: jeongin601 <[email protected]> * removed support for tuple shape past_key_values Signed-off-by: jeongin601 <[email protected]> * Update conditional statement for cache initialization Co-authored-by: Arthur <[email protected]> --------- Signed-off-by: jeongin601 <[email protected]> Co-authored-by: Arthur <[email protected]>
- Loading branch information