Can transformer model reproduce WMT14 English-German BLEU score? #9

SkyAndCloud · 2018-09-05T06:57:38Z

Hi, thank you for great work and awesome documents. I have a question after I read your transformer example which is used on WMT2017 English-German corpus that if you have tested marian's performance using this example on WMT2014 English-German corpus and achieve equivalent BLEU score as reported in transformer original paper? I think this point is very important because only via this can you prove your transformer implementation is correct and it is also important for research use.
Thanks!

emjotde · 2018-09-25T03:26:04Z

Sorry for missing this issue. Surely there are other ways we can prove that our implementation is correct, for instance by winning the WMT2018 shared task on news translation for English-German:

https://arxiv.org/abs/1809.00196

SkyAndCloud · 2018-09-25T16:21:32Z

Well, I have read your paper and here are my questions:

I think transformer example you provided is equivalent to transformer base model while you use big model in WMT2018, could you share your training config when using big model?
I'm new to neural language modeling and feel confused after reading FAQ about train lm-transformer on monolingual corpus. Could you provide an example of training a transformer style language model? Just config is OK. Thanks for your great work!

snukky · 2018-10-01T08:45:49Z

A config file for training Transformer Big has been mentioned here: Weird results with Transformer-Big settings marian-dev#298 (comment)
A toy example for training Transformer-based LM is in our regression tests: https://github.com/marian-nmt/marian-regression-tests/blob/master/tests/training/lm/test_lm-transformer.sh

SkyAndCloud · 2018-10-01T11:17:38Z

Thanks a lot.

SkyAndCloud closed this as completed Oct 1, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can transformer model reproduce WMT14 English-German BLEU score? #9

Can transformer model reproduce WMT14 English-German BLEU score? #9

SkyAndCloud commented Sep 5, 2018

emjotde commented Sep 25, 2018

SkyAndCloud commented Sep 25, 2018

snukky commented Oct 1, 2018

SkyAndCloud commented Oct 1, 2018

Can transformer model reproduce WMT14 English-German BLEU score? #9

Can transformer model reproduce WMT14 English-German BLEU score? #9

Comments

SkyAndCloud commented Sep 5, 2018

emjotde commented Sep 25, 2018

SkyAndCloud commented Sep 25, 2018

snukky commented Oct 1, 2018

SkyAndCloud commented Oct 1, 2018