Dependency-Based Neural Reordering Model for Statistical Machine Translation

This repository contains codes to build a dependency-based neural reordering model for statistical machine translation.

If you use this code for your work, please cite this paper:

@inproceedings{hadiwinoto2017neuraldep,
	title = {A dependency-based neural reordering model for statistical machine translation},
	booktitle = {Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence},
	author = {Hadiwinoto, Christian and Ng, Hwee Tou},
	year = {2017},
	pages = {109--115},
}

REQUIREMENTS

python==2.7
theano==1.0.3

Preparing Vocabulary

You need to first prepare a vocabulary, like zh.jdp.vcb, consisting of words, part-of-speech tags (suffixed with <T>), dependency relation labels (suffixed with <L>), distance feature (suffixed with <D>), and punctuation existence feature (suffixed with <,>).

Processing Word Embeddings

Downloading and extracting the pretrained embeddings (binary word2vec format):

bash download_emb.sh

Initializing the pretrained embeddings to be loaded to the model:

bash runfilterembedding.sh {linear,dep}

where linear refers to the standard word2vec embeddings and dep refers to the dependency-based word2vec embeddings (Bansal et al., 2014), which we trained based on our best understanding.

Preparing Instances

The input data should consist of training and development (tuning) data. Each of these should contain one dependency-parsed source-language file in CoNLL-X format and one alignment file in Giza++ format, that is, <src_pos>-<trg_pos> ... <src_pos>-<trg_pos>. Please refer to the example shown for the training data, namely train.zh.jdp for the source-language file and train.align for the alignment file, and similarly for development data. We cannot release the training data due to the license.

The training instances should then be extracted from the training and development data (please replace with your real, reasonably-sized, data):

bash runprepare-swap.train.sh
bash runprepare-swap.dev.sh

Training the Models

Training is done by calling the Python program as shown in the following script:

bash runtrain.gen.sh <GPU1> <GPU2> <RANDOM_SEED>

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dev.align		dev.align
dev.zh.jdp		dev.zh.jdp
download_emb.sh		download_emb.sh
runfilterembedding.sh		runfilterembedding.sh
runprepare-swap.dev.sh		runprepare-swap.dev.sh
runprepare-swap.train.sh		runprepare-swap.train.sh
runrandominit_norm.sh		runrandominit_norm.sh
runtrain.gen.sh		runtrain.gen.sh
train.align		train.align
train.zh.jdp		train.zh.jdp
zh.jdp.vcb		zh.jdp.vcb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dependency-Based Neural Reordering Model for Statistical Machine Translation

REQUIREMENTS

Preparing Vocabulary

Processing Word Embeddings

Preparing Instances

Training the Models

About

Releases

Packages

Languages

License

nusnlp/neuralreord-aaai2017

Folders and files

Latest commit

History

Repository files navigation

Dependency-Based Neural Reordering Model for Statistical Machine Translation

REQUIREMENTS

Preparing Vocabulary

Processing Word Embeddings

Preparing Instances

Training the Models

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages