Banana.dev Chronos-Beluga-v2-13B-GPTQ starter template

This is a Chronos-Beluga-v2-13B-GPTQ starter template from Banana.dev that allows on-demand serverless GPU inference.

You can fork this repository and deploy it on Banana as is, or customize it based on your own needs.

Running this app

Deploying on Banana.dev

Fork this repository to your own Github account.
Connect your Github account on Banana.
Create a new model on Banana from the forked Github repository.

Running after deploying

Wait for the model to build after creating it.
Make an API request to it using one of the provided snippets in your Banana dashboard.

For more info, check out the Banana.dev docs.

Testing locally

Using Docker

Build the model as a Docker image. You can change the chronos-beluga-v2-13b-gptq part to anything.

docker build -t chronos-beluga-v2-13b-gptq .

Run the Potassium server

docker run --publish 8000:8000 -it chronos-beluga-v2-13b-gptq

In another terminal, run inference after the above is built and running.

curl -X POST -H 'Content-Type: application/json' -d '{"prompt":"Tell me about AI"}' http://localhost:8000

Without Docker (not recommended)

You could also install and run it without Docker.

Just make sure that the pip dependencies in the Docker file (and torch) and a version of AutoGPTQ are installed in your Python virtual environment.

Run the Potassium app in one terminal window.

python3 app.py

Call the model in another terminal window with the Potassium app still running from the previous step.

curl -X POST -H 'Content-Type: application/json' -d '{"prompt": "Tell me about AI"}' http://localhost:8000

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
banana_config.json		banana_config.json
download.py		download.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Banana.dev Chronos-Beluga-v2-13B-GPTQ starter template

Running this app

Deploying on Banana.dev

Running after deploying

Testing locally

Using Docker

Without Docker (not recommended)

About

Releases

Packages

Languages

yachty66/demo-chronos-beluga-v2-13b-gptq

Folders and files

Latest commit

History

Repository files navigation

Banana.dev Chronos-Beluga-v2-13B-GPTQ starter template

Running this app

Deploying on Banana.dev

Running after deploying

Testing locally

Using Docker

Without Docker (not recommended)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages