Skip to content

PICK: Polished & Informed Candidate Scoring for Knowledge-Grounded Dialogue Systems. This approach reduces hallucination and improves relevance by re-scoring responses, enhancing faithfulness and relevance without extra data or model tuning.

License

Notifications You must be signed in to change notification settings

bryanwilie/pick

Repository files navigation

PICK: Polished & Informed Candidate Scoring for Knowledge-Grounded Dialogue Systems

This is the repo for the paper: PICK: Polished & Informed Candidate Scoring for Knowledge-Grounded Dialogue Systems. This framework addresses the key challenges in knowledge-grounded dialogue systems, such as hallucination and lack of coherence, through a generation re-scoring framework that empowers models to generate faithful and relevant responses without requiring additional labeled data or model tuning. Further details could be found in the paper.

Steps:

  1. Make sure all requirements are installed, or install it via: pip install -r requirements.txt
  2. Prepare the dataset:
    • Download the wizard_of_wikipedia dataset:
      • wget -P data_pool/wizard_of_wikipedia http://parl.ai/downloads/wizard_of_wikipedia/wizard_of_wikipedia.tgz
      • tar -xvzf data_pool/wizard_of_wikipedia/wizard_of_wikipedia.tgz -C data_pool/wizard_of_wikipedia/
      • rm -rf data_pool/wizard_of_wikipedia/wizard_of_wikipedia.tgz
  3. Prepare caffeinated_pandas to help in parallelization:
    • Download caffeinated-pandas repo to this repo in your local using:
      • git clone https://github.com/scollay/caffeinated-pandas.git
      • mv caffeinated-pandas caffeinated_pandas
  4. Finetune your model using run_ft_*.sh
  5. Do inference with your model using run_eval_*.sh
  6. Score your generations further with other metrics, i.e. FED, by cloning it to your local.

Citation

This work is published at AACL-IJCNLP 2023 and you can find the details in the paper. Please cite our work if you find it useful.

@inproceedings{wilie2023pick,
  author    = {Wilie, Bryan  and  Xu, Yan  and  Chung, Willy  and  
              Cahyawijaya, Samuel  and  Lovenia, Holy  and  Fung, Pascale},
  title     = {PICK: Polished \& Informed Candidate Scoring for Knowledge-Grounded Dialogue Systems},
  booktitle = {Proceedings of the 13th International Joint Conference on Natural Language Processing 
                 and the 3rd Conference of the Asia-Pacific Chapter of 
                 the Association for Computational Linguistics},
  month     = {November},
  year      = {2023},
  address   = {Nusa Dua, Bali},
  publisher = {Association for Computational Linguistics},
  pages     = {980--995}
}

About

PICK: Polished & Informed Candidate Scoring for Knowledge-Grounded Dialogue Systems. This approach reduces hallucination and improves relevance by re-scoring responses, enhancing faithfulness and relevance without extra data or model tuning.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published