Popular repositories Loading
-
-
chatgpt-evaluation
chatgpt-evaluation PublicThis respository contains the code for extracting the test samples we used in our paper: "A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity"
Repositories
- belief-revision Public
Belief-R test LMs' belief revision ability when presented with new evidence. Inspired by how humans suppress prior inferences, this task assesses LMs within delta reasoning (ΔR) framework. Belief-R features sequences of premises designed to simulate scenarios where additional information could necessitate revision on prior conclusions drawn by LMs.
HLTCHKUST/belief-revision’s past year of commit activity - llm-political-bias Public
HLTCHKUST/llm-political-bias’s past year of commit activity - long-biomedical-model Public
How Long Is Enough? Exploring the Optimal Intervals of Long-Range Clinical Note Language Modeling
HLTCHKUST/long-biomedical-model’s past year of commit activity - sensational_headline Public
This is the repo for sensational headline generation of our published paper in EMNLP 2019
HLTCHKUST/sensational_headline’s past year of commit activity - InstructAlign Public
HLTCHKUST/InstructAlign’s past year of commit activity - cantonese-asr Public
HLTCHKUST/cantonese-asr’s past year of commit activity