Skip to content

Commit

Permalink
Update models.mdx
Browse files Browse the repository at this point in the history
  • Loading branch information
dwhitena authored Oct 4, 2024
1 parent dec2e59 commit 758d1e6
Showing 1 changed file with 29 additions and 3 deletions.
32 changes: 29 additions & 3 deletions fern/docs/pages/options/models.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -14,8 +14,8 @@ with an improved focus on longer context lengths. This allows for more accuracy
in areas that require a longer context window, along with being an improved version of the previous
Hermes and Llama line of models.

**Type**: Chat
**Use Case**: Instruction Following or Chat-Like Applications
**Type**: Chat
**Use Case**: Instruction Following or Chat-Like Applications
**Prompt Format**: [ChatML](/options/prompts#chatml)

https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B
Expand Down Expand Up @@ -203,4 +203,30 @@ https://huggingface.co/llava-hf/llava-1.5-7b-hf

LLaVA is an open-source chatbot trained by fine-tuning LLaMA/Vicuna on
GPT-generated multimodal instruction-following data. It is an auto-regressive
language model, based on the transformer architecture.
language model, based on the transformer architecture.

## Hermes-3-Llama-3.1-70B (beta)

**Note:** This model should be considered beta/experimental as of now. Please let us know if you have any issues via [Discord](https://discord.gg/TFHgnhAFKd).

This is a general use model that excels at reasoning and multi-turn conversations,
with an improved focus on longer context lengths. This allows for more accuracy and recall
in areas that require a longer context window, along with being an improved version of the previous
Hermes and Llama line of models.

**Type**: Chat
**Use Case**: Instruction Following or Chat-Like Applications
**Prompt Format**: [ChatML](/options/prompts#chatml)

https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B

Hermes 3 is a generalist language model with many improvements over Hermes 2,
including advanced agentic capabilities, much better roleplaying, reasoning,
multi-turn conversation, long context coherence, and improvements across the board.

The ethos of the Hermes series of models is focused on aligning LLMs to the user,
with powerful steering capabilities and control given to the end user.

The Hermes 3 series builds and expands on the Hermes 2 set of capabilities,
including more powerful and reliable function calling and structured output capabilities,
generalist assistant capabilities, and improved code generation skills.

0 comments on commit 758d1e6

Please sign in to comment.