Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can transformer model reproduce WMT14 English-German BLEU score? #9

Closed
SkyAndCloud opened this issue Sep 5, 2018 · 4 comments
Closed

Comments

@SkyAndCloud
Copy link

Hi, thank you for great work and awesome documents. I have a question after I read your transformer example which is used on WMT2017 English-German corpus that if you have tested marian's performance using this example on WMT2014 English-German corpus and achieve equivalent BLEU score as reported in transformer original paper? I think this point is very important because only via this can you prove your transformer implementation is correct and it is also important for research use.
Thanks!

@emjotde
Copy link
Member

emjotde commented Sep 25, 2018

Sorry for missing this issue. Surely there are other ways we can prove that our implementation is correct, for instance by winning the WMT2018 shared task on news translation for English-German:

https://arxiv.org/abs/1809.00196

@SkyAndCloud
Copy link
Author

Well, I have read your paper and here are my questions:

  1. I think transformer example you provided is equivalent to transformer base model while you use big model in WMT2018, could you share your training config when using big model?
  2. I'm new to neural language modeling and feel confused after reading FAQ about train lm-transformer on monolingual corpus. Could you provide an example of training a transformer style language model? Just config is OK. Thanks for your great work!

@snukky
Copy link
Member

snukky commented Oct 1, 2018

  1. A config file for training Transformer Big has been mentioned here: Weird results with Transformer-Big settings marian-dev#298 (comment)
  2. A toy example for training Transformer-based LM is in our regression tests: https://github.com/marian-nmt/marian-regression-tests/blob/master/tests/training/lm/test_lm-transformer.sh

@SkyAndCloud
Copy link
Author

Thanks a lot.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants