Is my reproduction result correct? #157

HeegyuKim · 2024-08-02T05:45:52Z

Hello, Bird Team. Thank you for sharing some nice work!

I downloaded your repository and database files and evaluated two prediction files of GPT-4.
I would be appreciated to know whether my reproduction result is correct or not.

Here are my results.

bird/llm/exp_result/turbo_output/predict_dev.json

                     simple               moderate             challenging          total               
count                925                  464                  145                  1534                
======================================    ACCURACY    =====================================
accuracy             31.57                10.13                6.90                 22.75               
===========================================================================================

Three instances have been timeout.

bird/llm/exp_result/turbo_output_kg/predict_dev.json

                     simple               moderate             challenging          total               
count                925                  464                  145                  1534                
======================================    ACCURACY    =====================================
accuracy             46.70                20.47                15.86                35.85               
===========================================================================================

Three instances have been timeout too.

meta_time_out is set to 600 in my environment.

I think this result is different from the GPT-4 (GPT-4-32k) result in Table 2 of the BIRD paper due to a different model, as the directory names turbo_output indicate.
Paper Result (GPT-4 ICL)

without KG: 30.90
with KG 46.35

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is my reproduction result correct? #157

Is my reproduction result correct? #157

HeegyuKim commented Aug 2, 2024

Is my reproduction result correct? #157

Is my reproduction result correct? #157

Comments

HeegyuKim commented Aug 2, 2024