Benchmark: Contrast Chinese and English queries in ChatGLM #159

slobentanzer · 2024-06-05T08:00:09Z

There have been reports of performance fluctuations in ChatGLM with respect to input language. https://www.nature.com/articles/d41586-024-01495-6

Given a fluent Chinese speaker, we could translate some of the BioChatter benchmark to Chinese, to evaluate the impact of language on the performance. We already have a similar approach in German, in our medical exam dataset (#157).

slobentanzer added this to BioCypher Development Jun 5, 2024

slobentanzer converted this from a draft issue Jun 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark: Contrast Chinese and English queries in ChatGLM #159

Benchmark: Contrast Chinese and English queries in ChatGLM #159

slobentanzer commented Jun 5, 2024 •

edited

Loading

Benchmark: Contrast Chinese and English queries in ChatGLM #159

Benchmark: Contrast Chinese and English queries in ChatGLM #159

Comments

slobentanzer commented Jun 5, 2024 • edited Loading

slobentanzer commented Jun 5, 2024 •

edited

Loading