Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

blue score #15

Open
jigyasa06 opened this issue Oct 7, 2018 · 2 comments
Open

blue score #15

jigyasa06 opened this issue Oct 7, 2018 · 2 comments

Comments

@jigyasa06
Copy link

hi i followed all your steps, took your dataset and done all the pre processing and tokenization but i did not got the same results as you got. I got BLUE score as 0. can you please tell me why is it so? @Avmb @rsennrich

@jigyasa06
Copy link
Author

jigyasa06 commented Oct 7, 2018

output_code_documentation.txt

PLease see this file one's this is code documentation file after using translate.py
output_code_generation.txt

this is code generation
and i want to know in the nmt_output you have written that you have done post-processing what is that all about? @rsennrich @Avmb
Please let me know as soon as possible

@Avmb
Copy link
Collaborator

Avmb commented Nov 14, 2018

Hi,
it's difficult for us to determine what went wrong.

My suggestion is that you first train a translation model between natural languages, for instance English to German. There are tutorials to do this for Nematus http://data.statmt.org/wmt17_systems/training/ or Marian https://github.com/marian-nmt/marian-examples .

Once managed to set up a system, do translation and get BLEU scores, you can swap in the code-docstring-corpus files and re-run your pipeline.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants