This is the repo for the paper: PICK: Polished & Informed Candidate Scoring for Knowledge-Grounded Dialogue Systems. This framework addresses the key challenges in knowledge-grounded dialogue systems, such as hallucination and lack of coherence, through a generation re-scoring framework that empowers models to generate faithful and relevant responses without requiring additional labeled data or model tuning. Further details could be found in the paper.
- Make sure all requirements are installed, or install it via:
pip install -r requirements.txt
- Prepare the dataset:
- Download the wizard_of_wikipedia dataset:
wget -P data_pool/wizard_of_wikipedia http://parl.ai/downloads/wizard_of_wikipedia/wizard_of_wikipedia.tgz
tar -xvzf data_pool/wizard_of_wikipedia/wizard_of_wikipedia.tgz -C data_pool/wizard_of_wikipedia/
rm -rf data_pool/wizard_of_wikipedia/wizard_of_wikipedia.tgz
- Download the wizard_of_wikipedia dataset:
- Prepare caffeinated_pandas to help in parallelization:
- Download caffeinated-pandas repo to this repo in your local using:
git clone https://github.com/scollay/caffeinated-pandas.git
mv caffeinated-pandas caffeinated_pandas
- Download caffeinated-pandas repo to this repo in your local using:
- Finetune your model using
run_ft_*.sh
- Do inference with your model using
run_eval_*.sh
- Score your generations further with other metrics, i.e. FED, by cloning it to your local.
This work is published at AACL-IJCNLP 2023 and you can find the details in the paper. Please cite our work if you find it useful.
@inproceedings{wilie2023pick,
author = {Wilie, Bryan and Xu, Yan and Chung, Willy and
Cahyawijaya, Samuel and Lovenia, Holy and Fung, Pascale},
title = {PICK: Polished \& Informed Candidate Scoring for Knowledge-Grounded Dialogue Systems},
booktitle = {Proceedings of the 13th International Joint Conference on Natural Language Processing
and the 3rd Conference of the Asia-Pacific Chapter of
the Association for Computational Linguistics},
month = {November},
year = {2023},
address = {Nusa Dua, Bali},
publisher = {Association for Computational Linguistics},
pages = {980--995}
}