Skip to content

Commit

Permalink
correct hyperlink errors
Browse files Browse the repository at this point in the history
  • Loading branch information
KairuiHu committed Nov 27, 2024
1 parent 6bc6a21 commit 639db91
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion docs/lmms-eval-0.3.md
Original file line number Diff line number Diff line change
Expand Up @@ -159,7 +159,7 @@ AIF refers to Audio Instruction Following, and ASR refers to Audio Speech Recogn

The result might be inconsistent with the reported result as we do not have the original prompt and we have to maintain the fair environment for all the models. For the base model, we do not test on the Chat Benchmarks.

Certain datasets face alignment challenge: Datasets with WER, CIDEr, BLEU as metrics cannot accurately align due to their rigid output formats. Model responses are sensitive to prompt, we will investigate more deeply in Section [Robustness of the model](https://www.notion.so/Robustness-of-the-model-b89c005d3e044cb6aff51165929cea45?pvs=21) .
Certain datasets face alignment challenge: Datasets with WER, CIDEr, BLEU as metrics cannot accurately align due to their rigid output formats. Model responses are sensitive to prompt, we will investigate more deeply in the section [Robustness of the model](#robustness-of-the-model).

## Evaluation Analysis and Thinking:

Expand Down

0 comments on commit 639db91

Please sign in to comment.