Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

added new models + prompt formats #1

Closed
wants to merge 4 commits into from
Closed
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
7 changes: 4 additions & 3 deletions fern/docs/pages/models/details.mdx
Original file line number Diff line number Diff line change
@@ -11,7 +11,7 @@ LLMs are hosted by Prediction Guard in a secure, privacy conserving environment

**Note - We only integrate models that are licensed permissively for commercial use.**

## Open Access LLMs (what most of our customers use) 🚀
## Open Access LLMs (what most of our customers use....) 🚀

Open access models are amazing these days! Each of these models was trained by a talented team and released publicly under a permissive license. The data used to train each model and the prompt formatting for each model varies. We've tried to give you some of the relevant details here, but shoot us a message [in Slack](support) with any questions.

@@ -20,11 +20,12 @@ Open access models are amazing these days! Each of these models was trained by a
| Model Name | Type | Use Case | Prompt Format | Context Length | More Info |
| ---------------------------- | --------------- | ------------------------------------------------------- | ---------------------------------- | -------------- | ----------------------------------------------------------------------- |
| Nous-Hermes-Llama2-13B | Text Generation | Generating output in response to arbitrary instructions | [Alpaca](prompts#alpaca) | 4096 | [link](https://huggingface.co/NousResearch/Nous-Hermes-Llama2-13b) |
| Nous-Hermes-2-SOLAR-10.7B | Chat | Instruction following or chat-like applications | [ChatML](prompts#chatml) | 4096 | [link](https://huggingface.co/NousResearch/Nous-Hermes-2-SOLAR-10.7B) |
| Hermes-2-Pro-Mistral-7B | Chat | Instruction following or chat-like applications | [ChatML](prompts#chatml) | 4096 | [link](https://huggingface.co/NousResearch/Hermes-2-Pro-Mistral-7B) |
| Neural-Chat-7B | Chat | Instruction following or chat-like applications | [Neural Chat](prompts#neural-chat) | 4096 | [link](https://huggingface.co/Intel/neural-chat-7b-v3-1) |
| Yi-34B-Chat | Chat | Instruction following in English or Chinese | [ChatML](prompts#chatml) | 2048 | [link](https://huggingface.co/01-ai/Yi-34B-Chat) |
| sqlcoder-34b-alpha | Code Generation | Generating SQL queries from natural language prompts | [SQLCoder](prompts#sqlcoder) | 4096 | [link](https://huggingface.co/defog/sqlcoder-34b-alpha) |
| sqlcoder-34b-alpha | Code Generation | Generating SQL queries from natural language prompts | [SQLCoder34b](prompts#sqlcoder-34b)| 4096 | [link](https://huggingface.co/defog/sqlcoder-34b-alpha) |
| deepseek-coder-6.7b-instruct | Code Generation | Generating computer code or answering tech questions | [Deepseek](prompts#deepseek) | 4096 | [link](https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-instruct) |
| sqlcoder-7b-2 | Code Generation | Generating SQL queries from natural language prompts | [SQLCoder7b](prompts#sqlcoder-7b) | 4096 | [link](https://huggingface.co/defog/sqlcoder-7b-2) |

### Other models available

19 changes: 18 additions & 1 deletion fern/docs/pages/models/prompts.mdx
Original file line number Diff line number Diff line change
@@ -71,7 +71,7 @@ For prompts where context is injected:
<|im_start|>assistant<|im_end|>
```

## SQLCoder
## SQLCoder-34b

(Replace the portions of the prompt below in curly braces `{...}` with the appropriate information, and do not keep the curly braces)

@@ -91,6 +91,23 @@ This query will run on a database whose schema is represented in this string:
Given the database schema, here is the SQL query that answers `{question}`:
```

## SQLCoder-7b

(Replace the portions of the prompt below in curly braces `{...}` with the appropriate information, and do not keep the curly braces)

```
### Task
Generate a SQL query to answer [QUESTION]{user_question}[/QUESTION]

### Database Schema
The query will run on a database with the following schema:
{table_metadata_string_DDL_statements}

### Answer
Given the database schema, here is the SQL query that [QUESTION]{user_question}[/QUESTION]
[SQL]
```

## Deepseek

(Replace the portions of the prompt below in curly braces `{...}` with the appropriate information, and do not keep the curly braces)