Skip to content

Commit

Permalink
feat: added OpenOrcaxOpenChat-Preview2-13B-GGML model
Browse files Browse the repository at this point in the history
  • Loading branch information
limcheekin committed Aug 10, 2023
1 parent 7398118 commit 2ad8e1b
Show file tree
Hide file tree
Showing 5 changed files with 13 additions and 14 deletions.
4 changes: 2 additions & 2 deletions .github/workflows/dev.yml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ name: Deploy Dev
on:
push:
branches:
- llama-cpp-python
- openorca_openchat
jobs:
deploy-dev:
runs-on: ubuntu-latest
Expand All @@ -27,4 +27,4 @@ jobs:
run: cd ${{ github.ref_name }};modal deploy fastapi_app.py

- name: Test FastAPI app
run: "pwd;cd ${{ github.ref_name }};curl -X POST -H 'Content-Type: application/json' -d @prompt.json ${{ secrets.LLAMA_CPP_APP_URL }}v1/completions"
run: "pwd;cd ${{ github.ref_name }};curl -X POST -H 'Content-Type: application/json' -d @prompt.json ${{ secrets.OPENORCA_OPENCHAT_APP_URL }}v1/completions"
6 changes: 0 additions & 6 deletions llama-cpp-python/prompt.json

This file was deleted.

Original file line number Diff line number Diff line change
Expand Up @@ -19,4 +19,4 @@ RUN pip install -U pip setuptools wheel && \

# Download model
RUN mkdir model && \
curl -L https://huggingface.co/TheBloke/Llama-2-13B-chat-GGML/resolve/main/llama-2-13b-chat.ggmlv3.q8_0.bin -o model/ggml-model-q8_0.bin
curl -L https://huggingface.co/TheBloke/OpenOrcaxOpenChat-Preview2-13B-GGML/resolve/main/openorcaxopenchat-preview2-13b.ggmlv3.q8_0.bin -o model/ggml-model-q8_0.bin
Original file line number Diff line number Diff line change
@@ -1,13 +1,13 @@
# Modal Lab web app for llama.cpp.
from modal import Image, Stub, asgi_app

stub = Stub("llama-cpp-python")
stub = Stub("openorca-openchat")

image = Image.from_dockerfile(
"Dockerfile", force_build=True
).pip_install("pydantic_settings").pip_install("fastapi==0.100.1").env(
{"MODEL": "/model/ggml-model-q8_0.bin"}
)
"Dockerfile", force_build=True
).pip_install("pydantic_settings").pip_install("fastapi==0.100.1").env(
{"MODEL": "/model/ggml-model-q8_0.bin"}
)


@stub.function(image=image, cpu=14, memory=20480, keep_warm=1, timeout=1800)
Expand Down
5 changes: 5 additions & 0 deletions openorca_openchat/prompt.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
{
"prompt": [
"User: Answer the question based on the context below.\n\nContext: The main benefit of operators is to automate operations. Kubernetes operators are capable to automate the expensive and error likely human operations. Features like autopilot and self-healing are typical scenarios. Another benefit of operators is the reusability of software. Software providers can expose operators in various catalogs to reach new markets and to promote their software. Operators leverage the Kubernetes community, since they are a natural and Kubernetes-native way to extend Kubernetes.\n\nQuestion: What are the main benefits of Kubernetes Operators?<|end_of_turn|>Assistant:"
]
}

0 comments on commit 2ad8e1b

Please sign in to comment.