Skip to content

Commit

Permalink
Add chat categorization dataset (#98)
Browse files Browse the repository at this point in the history
This task is meant to test a couple things:
1. Classification -> both on common things where it is expected to
perform well (e.g., sentiment, toxicity -> which currently is always 0)
2. Structured json output -> the schema is nested, which confused some
of the smaller 7b models i tested out but works fine for llama 32b code
instruct (and OAI/anthropic)


Includes a couple common things like enums.
  • Loading branch information
hinthornw authored Dec 1, 2023
1 parent 3053088 commit 5ffdbb5
Show file tree
Hide file tree
Showing 6 changed files with 3,639 additions and 8 deletions.
Loading

0 comments on commit 5ffdbb5

Please sign in to comment.