Update user docs for running `llm server` + upgrade `gguf` to `0.11.0` #676

stbaione · 2024-12-11T16:33:43Z

Description

Did a pass through and made updates + fixes to the user docs for e2e_llama8b_mi300x.md.

Update install instructions for shark-ai
Update nightly install instructions for shortfin and sharktank
Update paths for model artifacts to ensure they work with llama3.1-8b-fp16-instruct
Remove steps to write edited config. No longer needed after Make config.json consistent between shortfin and sharktank #487

Added back sentencepiece as a requirement for sharktank. Not having it caused export_paged_llm_v1 to break when installing nightly:

ModuleNotFoundError: No module named 'sentencepiece'

This was obfuscated when building from source, because shortfin includes sentencepiece in requirements-tests.txt.

Add back `sentencepiece` as requirement for `sharktank` to enable `export_paged_llm_v1`

docs/shortfin/llm/user/e2e_llama8b_mi300x.md

sharktank/requirements.txt

docs/shortfin/llm/user/e2e_llama8b_mi300x.md

sharktank/requirements.txt

Update `stable` and `nightly` installation instructs, Only set lower-bound for `gguf`

ScottTodd

I went through this full document earlier today. Typing up some feedback in an issue to follow up.

ScottTodd · 2024-12-12T23:42:18Z

I went through this full document earlier today. Typing up some feedback in an issue to follow up.

#691

marbre · 2024-12-13T09:57:42Z

Added back sentencepiece as a requirement for sharktank. Not having it caused export_paged_llm_v1 to break when installing nightly:
ModuleNotFoundError: No module named 'sentencepiece'
This was obfuscated when building from source, because shortfin includes sentencepiece in requirements-tests.txt.

This made it into the commit description. If the PR content changes, please remember to update the PR description or at least the squash commit message accordingly.

nod-ai#676) # Description Did a pass through and made updates + fixes to the user docs for `e2e_llama8b_mi300x.md`. 1. Update install instructions for `shark-ai` 2. Update nightly install instructions for `shortfin` and `sharktank` 3. Update paths for model artifacts to ensure they work with `llama3.1-8b-fp16-instruct` 4. Remove steps to `write edited config`. No longer needed after nod-ai#487 Added back `sentencepiece` as a requirement for `sharktank`. Not having it caused `export_paged_llm_v1` to break when installing nightly: ```text ModuleNotFoundError: No module named 'sentencepiece' ``` This was obfuscated when building from source, because `shortfin` includes `sentencepiece` in `requirements-tests.txt`.

#676) # Description Did a pass through and made updates + fixes to the user docs for `e2e_llama8b_mi300x.md`. 1. Update install instructions for `shark-ai` 2. Update nightly install instructions for `shortfin` and `sharktank` 3. Update paths for model artifacts to ensure they work with `llama3.1-8b-fp16-instruct` 4. Remove steps to `write edited config`. No longer needed after #487 Added back `sentencepiece` as a requirement for `sharktank`. Not having it caused `export_paged_llm_v1` to break when installing nightly: ```text ModuleNotFoundError: No module named 'sentencepiece' ``` This was obfuscated when building from source, because `shortfin` includes `sentencepiece` in `requirements-tests.txt`.

Update user docs for running llm server,

8655451

Add back `sentencepiece` as requirement for `sharktank` to enable `export_paged_llm_v1`

stbaione requested a review from ScottTodd December 11, 2024 16:33

Merge branch 'main' into llm-user-docs-update

18ec96e

ScottTodd reviewed Dec 11, 2024

View reviewed changes

docs/shortfin/llm/user/e2e_llama8b_mi300x.md Outdated Show resolved Hide resolved

sharktank/requirements.txt Outdated Show resolved Hide resolved

ScottTodd mentioned this pull request Dec 12, 2024

[sharktank] Depend on sentencepiece #687

Closed

stbaione and others added 2 commits December 12, 2024 17:53

Upgrade gguf to 0.11.0

e7159d6

Merge branch 'main' into llm-user-docs-update

d0af5c0

stbaione changed the title ~~Update user docs for running llm server~~ Update user docs for running llm server + upgrade gguf to 0.11.0 Dec 12, 2024

stbaione requested review from ScottTodd and marbre December 12, 2024 18:08

ScottTodd reviewed Dec 12, 2024

View reviewed changes

docs/shortfin/llm/user/e2e_llama8b_mi300x.md Outdated Show resolved Hide resolved

docs/shortfin/llm/user/e2e_llama8b_mi300x.md Outdated Show resolved Hide resolved

ScottTodd reviewed Dec 12, 2024

View reviewed changes

sharktank/requirements.txt Outdated Show resolved Hide resolved

Remove pip install sentencepiece,

011f273

Update `stable` and `nightly` installation instructs, Only set lower-bound for `gguf`

stbaione requested a review from ScottTodd December 12, 2024 21:48

ScottTodd approved these changes Dec 12, 2024

View reviewed changes

stbaione merged commit f7d2681 into nod-ai:main Dec 12, 2024
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update user docs for running `llm server` + upgrade `gguf` to `0.11.0` #676

Update user docs for running `llm server` + upgrade `gguf` to `0.11.0` #676

stbaione commented Dec 11, 2024

ScottTodd left a comment

ScottTodd commented Dec 12, 2024

marbre commented Dec 13, 2024

Update user docs for running llm server + upgrade gguf to 0.11.0 #676

Update user docs for running llm server + upgrade gguf to 0.11.0 #676

Conversation

stbaione commented Dec 11, 2024

Description

ScottTodd left a comment

Choose a reason for hiding this comment

ScottTodd commented Dec 12, 2024

marbre commented Dec 13, 2024

Update user docs for running `llm server` + upgrade `gguf` to `0.11.0` #676

Update user docs for running `llm server` + upgrade `gguf` to `0.11.0` #676