Skip to content

Commit

Permalink
update heading
Browse files Browse the repository at this point in the history
  • Loading branch information
mht-sharma committed Jun 24, 2024
1 parent e81c4cf commit 034686b
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 1 deletion.
3 changes: 3 additions & 0 deletions Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -53,3 +53,6 @@ run-falcon-7b-instruct-quantize:

clean:
rm -rf target aml

interact:
docker run --rm -it --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --device=/dev/kfd --device=/dev/dri --group-add video --ipc=host --shm-size 64g --net host -v /home/mohit/.cache/huggingface/hub/:/data -v $(PWD):/tgi tgi-mht
2 changes: 1 addition & 1 deletion docs/source/_toctree.yml
Original file line number Diff line number Diff line change
Expand Up @@ -44,7 +44,7 @@
- local: basic_tutorials/train_medusa
title: Train Medusa
- local: basic_tutorials/fp8_kv_cache
title: Accelerating Inference with FP8 KV Cache
title: FP8 KV Cache
title: Tutorials
- sections:
- local: conceptual/streaming
Expand Down

0 comments on commit 034686b

Please sign in to comment.