Skip to content

Commit

Permalink
Merge pull request #8 from DFKI-NLP/leo_mistral_model_awq
Browse files Browse the repository at this point in the history
add script for `TheBloke/em_german_leo_mistral-AWQ` model
  • Loading branch information
ArneBinder authored Oct 30, 2023
2 parents b7ef657 + 108deb0 commit 2d2a924
Showing 1 changed file with 28 additions and 0 deletions.
28 changes: 28 additions & 0 deletions scripts/TheBloke_em_german_leo_mistral-AWQ.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,28 @@
# NOTES:
# - model card: https://huggingface.co/TheBloke/em_german_leo_mistral-AWQ
# - original model: https://huggingface.co/jphme/em_german_leo_mistral
# - required because this model is quantized:`--quantize awq` (this, again, requires `text-generation-inference>=1.10`)
# - prompt format: Du bist ein hilfreicher Assistent. USER: <instruction> ASSISTANT:

MODEL_ID=TheBloke/em_german_leo_mistral-AWQ
srun -K \
--container-image=/netscratch/enroot/huggingface_text-generation-inference_1.1.0.sqsh \
--container-mounts=/netscratch:/netscratch,/ds:/ds,/ds/models/llms/cache:/data,$HOME:$HOME \
--container-workdir=$HOME \
-p A100-PCI \
--mem 64GB \
--gpus 1 \
--export MODEL_ID=$MODEL_ID \
text-generation-launcher \
--quantize awq \
--max-batch-prefill-tokens 1024 \
--port 5000

# HOW-TO ACCESS THE (EXECUTABLE) API DOCUMENTATION:
# First, you need to know the node your job is running on. Call this on the head node
# to get the list of your running jobs:
# squeue -u $USER
# This should give you a list of jobs, each with a node name in the "NODELIST(REASON)" column, e.g. "serv-3316".
# Then, you can access the API documentation at the following endpoint (replace $NODE with the node name):
# http://$NODE.kl.dfki.de:5000/docs

0 comments on commit 2d2a924

Please sign in to comment.