From 253bc36c80e2e5ec1d00bc2c7100eb0b721ad066 Mon Sep 17 00:00:00 2001 From: William Kennedy Date: Fri, 14 Jun 2024 10:55:35 -0400 Subject: [PATCH] testing change --- fern/docs/pages/options/models.mdx | 16 ++++++++-------- 1 file changed, 8 insertions(+), 8 deletions(-) diff --git a/fern/docs/pages/options/models.mdx b/fern/docs/pages/options/models.mdx index 0b14106..f6a19aa 100644 --- a/fern/docs/pages/options/models.mdx +++ b/fern/docs/pages/options/models.mdx @@ -7,7 +7,7 @@ slug: options/models This page provides information about the different models used by the Prediction Guard API. -### Hermes-2-Pro-Llama-3-8B +## Hermes-2-Pro-Llama-3-8B A general use model that maintains excellent general task and conversation capabilities while excelling at JSON Structured Outputs and improving on several @@ -33,7 +33,7 @@ Hermes Pro takes advantage of a special system prompt and multi-turn function calling structure with a new chatml role in order to make function calling reliable and easy to parse. -### Nous-Hermes-Llama2-13B +## Nous-Hermes-Llama2-13B A general use model that combines advanced analytics capabilities with a vast 13 billion parameter count, enabling it to perform in-depth data analysis and @@ -59,7 +59,7 @@ This model stands out for its long responses, lower hallucination rate, and absence of OpenAI censorship mechanisms. The fine-tuning process was performed with a 4096 sequence length on an 8x a100 80GB DGX machine. -### Hermes-2-Pro-Mistral-7B +## Hermes-2-Pro-Mistral-7B A general use model that offers advanced natural language understanding and generation capabilities, empowering applications with high-performance @@ -88,7 +88,7 @@ Hermes Pro takes advantage of a special system prompt and multi-turn function calling structure with a new chatml role in order to make function calling reliable and easy to parse. Learn more about prompting below. -### Neural-Chat-7B +## Neural-Chat-7B A revolutionary AI model for perfoming digital conversations. @@ -106,7 +106,7 @@ from mistralai/Mistral-7B-v-0.1. For more information, refer to the blog [The Practice of Supervised Fine-tuning and Direct Preference Optimization on Intel Gaudi2](https://medium.com/@NeuralCompressor/the-practice-of-supervised-finetuning-and-direct-preference-optimization-on-habana-gaudi2-a1197d8a3cd3). -### llama-3-sqlcoder-8b +## llama-3-sqlcoder-8b A state of the art AI model for generating SQL queries from natural language. @@ -119,7 +119,7 @@ https://huggingface.co/defog/llama-3-sqlcoder-8b A capable language model for text to SQL generation for Postgres, Redshift and Snowflake that is on-par with the most capable generalist frontier models. -### deepseek-coder-6.7b-instruct +## deepseek-coder-6.7b-instruct DeepSeek Coder is a capable coding model trained on two trillion code and natural language tokens. @@ -139,7 +139,7 @@ support project-level code completion and infilling. For coding capabilities, Deepseek Coder achieves state-of-the-art performance among open-source code models on multiple programming languages and various benchmarks. -### bridgetower-large-itm-mlm-itc +## bridgetower-large-itm-mlm-itc BridgeTower is a multimodal model for creating joint embeddings between images and text. @@ -165,7 +165,7 @@ computational costs. Notably, when further scaling the model, BridgeTower achieves an accuracy of 81.15%, surpassing models that are pre-trained on orders-of-magnitude larger datasets. -### llava-1.5-7b-hf +## llava-1.5-7b-hf LLaVa is a multimodal model that supports vision and language models combined.