GitHub - hitomi-team/sukima: A ready-to-deploy container for implementing an easy to use REST API to access Language Models.

Overview

Sukima is a ready-to-deploy container that implements a REST API for Language Models designed with the specific purpose of easy deployment and scalability.

Curent API Functions

models : Fetch a list of ready-to-use Language Models for inference.
load : Allocate a Language Model.
generate : Use a Language Model to generate tokens.
classify : Use a Language Model to classify tokens and retrieve scores.

To view more information for API Usage, see /docs endpoint.

Setup

Setup Guide

Usage Guide

Todo

Autoscaling
HTTPS Support
Rate Limiting
Support for other Language Modeling tasks such as Sentiment Analysis and Named Entity Recognition.

License

GPL-2.0

Name		Name	Last commit message	Last commit date
Latest commit History 169 Commits
.github/workflows		.github/workflows
alembic		alembic
app		app
k8s		k8s
storage		storage
tests		tests
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
alembic.ini		alembic.ini
banner.png		banner.png
conf.env		conf.env
docker-compose.yaml		docker-compose.yaml
docker-compose_nvidia-gpu.yaml		docker-compose_nvidia-gpu.yaml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Curent API Functions

Setup

Todo

License

About

Releases

Contributors 7

Languages

License

hitomi-team/sukima

Folders and files

Latest commit

History

Repository files navigation

Overview

Curent API Functions

Setup

Todo

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Contributors 7

Languages