Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[distsim] LIN-PROX has no nouns (German, CONLL corpus) #291

Open
gilnoh opened this issue Nov 6, 2013 · 1 comment
Open

[distsim] LIN-PROX has no nouns (German, CONLL corpus) #291

gilnoh opened this issue Nov 6, 2013 · 1 comment

Comments

@gilnoh
Copy link
Member

gilnoh commented Nov 6, 2013

After successful generation and redis-conversion; the lexical resource based on Lin proximity funtions for the German.

However, the resource has no (or almost no) nouns. I tried with various common German nouns, but couldn't generate any match.

For the moment, I do not know how I can iterate over all rules (or all entries), so this is just a suspects. But it is quite likely that LIN-PROX has only ADV, V and ADJ.

Is this normal for LIN-PROX? or something gone wrong?

@gilnoh
Copy link
Member Author

gilnoh commented Nov 6, 2013

You can reproduce this with the following intermediate size corpus: (1/30th of SDEWAC)
http://www.cl.uni-heidelberg.de/~noh/sdewac_part01.mstparsed.utf8.conll.gz

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant