-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update user docs for running llm server
+ upgrade gguf
to 0.11.0
#676
Conversation
Add back `sentencepiece` as requirement for `sharktank` to enable `export_paged_llm_v1`
llm server
llm server
+ upgrade gguf
to 0.11.0
Update `stable` and `nightly` installation instructs, Only set lower-bound for `gguf`
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I went through this full document earlier today. Typing up some feedback in an issue to follow up.
|
This made it into the commit description. If the PR content changes, please remember to update the PR description or at least the squash commit message accordingly. |
nod-ai#676) # Description Did a pass through and made updates + fixes to the user docs for `e2e_llama8b_mi300x.md`. 1. Update install instructions for `shark-ai` 2. Update nightly install instructions for `shortfin` and `sharktank` 3. Update paths for model artifacts to ensure they work with `llama3.1-8b-fp16-instruct` 4. Remove steps to `write edited config`. No longer needed after nod-ai#487 Added back `sentencepiece` as a requirement for `sharktank`. Not having it caused `export_paged_llm_v1` to break when installing nightly: ```text ModuleNotFoundError: No module named 'sentencepiece' ``` This was obfuscated when building from source, because `shortfin` includes `sentencepiece` in `requirements-tests.txt`.
#676) # Description Did a pass through and made updates + fixes to the user docs for `e2e_llama8b_mi300x.md`. 1. Update install instructions for `shark-ai` 2. Update nightly install instructions for `shortfin` and `sharktank` 3. Update paths for model artifacts to ensure they work with `llama3.1-8b-fp16-instruct` 4. Remove steps to `write edited config`. No longer needed after #487 Added back `sentencepiece` as a requirement for `sharktank`. Not having it caused `export_paged_llm_v1` to break when installing nightly: ```text ModuleNotFoundError: No module named 'sentencepiece' ``` This was obfuscated when building from source, because `shortfin` includes `sentencepiece` in `requirements-tests.txt`.
Description
Did a pass through and made updates + fixes to the user docs for
e2e_llama8b_mi300x.md
.shark-ai
shortfin
andsharktank
llama3.1-8b-fp16-instruct
write edited config
. No longer needed after Make config.json consistent between shortfin and sharktank #487Added back
sentencepiece
as a requirement forsharktank
. Not having it causedexport_paged_llm_v1
to break when installing nightly:This was obfuscated when building from source, because
shortfin
includessentencepiece
inrequirements-tests.txt
.