You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There are minor issues with the weight conversion. Mainly:
It's not consistent with ParamConverter, for example fast_llm_names with enforced tuple format vs fast_llm_name that accepts both tuple and str format.
The external converter is hard-coded to Safetensors. We probably want to support other file formats.
🧐 Problem Description
There are minor issues with the weight conversion. Mainly:
ParamConverter
, for examplefast_llm_names
with enforced tuple format vsfast_llm_name
that accepts both tuple and str format.Safetensors
. We probably want to support other file formats.See also #98
💡 Proposed Solution
StateDictCheckpointHandler
so it's not hard-coded toSafetensors
🔄 Alternatives Considered
Things work right now so we don't have to do anything, but it's a good idea to be proactive.
📈 Potential Benefits
Improved consistency and generalizability.
The text was updated successfully, but these errors were encountered: