Skip to content

Commit

Permalink
testing change
Browse files Browse the repository at this point in the history
  • Loading branch information
ardan-bkennedy committed Jun 14, 2024
1 parent 070cd4a commit 253bc36
Showing 1 changed file with 8 additions and 8 deletions.
16 changes: 8 additions & 8 deletions fern/docs/pages/options/models.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@ slug: options/models
This page provides information about the different models used by the Prediction
Guard API.

### Hermes-2-Pro-Llama-3-8B
## Hermes-2-Pro-Llama-3-8B

A general use model that maintains excellent general task and conversation
capabilities while excelling at JSON Structured Outputs and improving on several
Expand All @@ -33,7 +33,7 @@ Hermes Pro takes advantage of a special system prompt and multi-turn function
calling structure with a new chatml role in order to make function calling
reliable and easy to parse.

### Nous-Hermes-Llama2-13B
## Nous-Hermes-Llama2-13B

A general use model that combines advanced analytics capabilities with a vast 13
billion parameter count, enabling it to perform in-depth data analysis and
Expand All @@ -59,7 +59,7 @@ This model stands out for its long responses, lower hallucination rate, and
absence of OpenAI censorship mechanisms. The fine-tuning process was performed
with a 4096 sequence length on an 8x a100 80GB DGX machine.

### Hermes-2-Pro-Mistral-7B
## Hermes-2-Pro-Mistral-7B

A general use model that offers advanced natural language understanding and
generation capabilities, empowering applications with high-performance
Expand Down Expand Up @@ -88,7 +88,7 @@ Hermes Pro takes advantage of a special system prompt and multi-turn function
calling structure with a new chatml role in order to make function calling
reliable and easy to parse. Learn more about prompting below.

### Neural-Chat-7B
## Neural-Chat-7B

A revolutionary AI model for perfoming digital conversations.

Expand All @@ -106,7 +106,7 @@ from mistralai/Mistral-7B-v-0.1. For more information, refer to the blog

[The Practice of Supervised Fine-tuning and Direct Preference Optimization on Intel Gaudi2](https://medium.com/@NeuralCompressor/the-practice-of-supervised-finetuning-and-direct-preference-optimization-on-habana-gaudi2-a1197d8a3cd3).

### llama-3-sqlcoder-8b
## llama-3-sqlcoder-8b

A state of the art AI model for generating SQL queries from natural language.

Expand All @@ -119,7 +119,7 @@ https://huggingface.co/defog/llama-3-sqlcoder-8b
A capable language model for text to SQL generation for Postgres, Redshift and
Snowflake that is on-par with the most capable generalist frontier models.

### deepseek-coder-6.7b-instruct
## deepseek-coder-6.7b-instruct

DeepSeek Coder is a capable coding model trained on two trillion code and natural
language tokens.
Expand All @@ -139,7 +139,7 @@ support project-level code completion and infilling. For coding capabilities,
Deepseek Coder achieves state-of-the-art performance among open-source code models
on multiple programming languages and various benchmarks.

### bridgetower-large-itm-mlm-itc
## bridgetower-large-itm-mlm-itc

BridgeTower is a multimodal model for creating joint embeddings between images
and text.
Expand All @@ -165,7 +165,7 @@ computational costs. Notably, when further scaling the model, BridgeTower
achieves an accuracy of 81.15%, surpassing models that are pre-trained on
orders-of-magnitude larger datasets.

### llava-1.5-7b-hf
## llava-1.5-7b-hf

LLaVa is a multimodal model that supports vision and language models combined.

Expand Down

0 comments on commit 253bc36

Please sign in to comment.