MMI_anti_pytorch

This project is a pytorch implementation for the MMI-anti model described in A Diversity-Promoting Objective Function for Neural Conversation Models

Reference

This code is based on OpenNMT-py and word_language_model

Quickstart

Step1: Preprocess the data

python preprocess.py

We will be working with some example data in data/ folder. The data consists of parallel dialogue context (.en) and its response (.vi) data containing one sentence per line with tokens separated by a space:

train.en
train.vi
dev.en
dev.vi

After running the preprocessing, the following files are generated in data/ folder:

dialogue.train.1.pt: serialized PyTorch file containing training data
dialogue.valid.1.pt: serialized PyTorch file containing validation data
dialogue.vocab.pt: serialized PyTorch file containing vocabulary data, which will be used in the training process of language model.

Step2: Train a language model

cd lm/tool/

In this step, we will train a language model based on the responses for the MMI-anti model (example data data/*.vi). Since this language model will be used in the MMI-anti model, it will share the dictionary (data/*.vocab.pt) generated in Step1.

Step2.1: Preprocess the data

python preprocess.py

These preprocessing will turn all responses for the MMI-anti model (example data data/*.vi) into parallel data for the language model.

After running the preprocessing, the following files are generated in lm/data/ folder:

train.en
train.de
dev.en
dev.de

For example, the response "they just want a story" in file data/train.vi will be preprocessed in to "<s> they just want a story" in file lm/data/train.en and "they just want a story </s>" in file lm/data/train.de.

Step2.2: Train a language model

cd ../
python lm.py

This train command will save the language model to lm/model.pt.

To run this code on the CPU, you need to update your pytorch to any version after 24th Feb 2018 and make sure that this piece of code can be found in your torchtext/data/iterator.py:

if not torch.cuda.is_available() and self.device is None:
  self.device = -1

Step2.3: Test your language model

python generate.py

This tool will generate 1000 utterances randomly using the language model lm/model.pt and save them into file lm/generated.txt.

Step2.4: Go back to our MMI-anti model

cd ../

Step3: Train a MMI-anti model

python train.py

Step4: Generate

python translate.py -model model_name

The generation results will be saved in file pred.txt.

Step5: Evaluate the diversity?

cat pred.txt | python diversity.py

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
data		data
docs		docs
lm		lm
onmt		onmt
tools		tools
.gitignore		.gitignore
README.md		README.md
diversity.py		diversity.py
opts.py		opts.py
preprocess.py		preprocess.py
train.py		train.py
translate.py		translate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MMI_anti_pytorch

Reference

Quickstart

Step1: Preprocess the data

Step2: Train a language model

Step2.1: Preprocess the data

Step2.2: Train a language model

Step2.3: Test your language model

Step2.4: Go back to our MMI-anti model

Step3: Train a MMI-anti model

Step4: Generate

Step5: Evaluate the diversity?

About

Releases

Packages

Languages

XinnuoXu/mmi_anti_pytorch

Folders and files

Latest commit

History

Repository files navigation

MMI_anti_pytorch

Reference

Quickstart

Step1: Preprocess the data

Step2: Train a language model

Step2.1: Preprocess the data

Step2.2: Train a language model

Step2.3: Test your language model

Step2.4: Go back to our MMI-anti model

Step3: Train a MMI-anti model

Step4: Generate

Step5: Evaluate the diversity?

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages