Change the repository type filter
All
Repositories list
10 repositories
alpaca_eval
PublicAn automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.- Code and documentation to train Stanford's Alpaca models, and generate the data.
alpaca_farm
PublicA simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.linguistic_calibration
Publicgpt_paper_assistant
Public