Skip to content

Commit

Permalink
feat: set chat_format to None and updated phi-2 gguf model
Browse files Browse the repository at this point in the history
  • Loading branch information
limcheekin committed Dec 24, 2023
1 parent 617b178 commit 7e055e1
Show file tree
Hide file tree
Showing 2 changed files with 3 additions and 2 deletions.
2 changes: 1 addition & 1 deletion phi-2/Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -15,4 +15,4 @@ RUN pip install -U pip setuptools wheel && \

# Download model
RUN mkdir model && \
curl -L https://huggingface.co/radames/phi-2-quantized/resolve/main/model-v2-q80.gguf -o model/gguf-model.bin
curl -L https://huggingface.co/TheBloke/phi-2-GGUF/resolve/main/phi-2.Q6_K.gguf -o model/gguf-model.bin
3 changes: 2 additions & 1 deletion phi-2/fastapi_app.py
Original file line number Diff line number Diff line change
Expand Up @@ -21,7 +21,8 @@ def fastapi_app():
Settings(
n_threads=4,
model="/model/gguf-model.bin",
embedding=False
embedding=False,
chat_format=None
)
)
return app
Expand Down

0 comments on commit 7e055e1

Please sign in to comment.