Skip to content

RevitaAI/universal-lemmatizer

 
 

Repository files navigation

universal-lemmatizer

A neural sequence-to-sequence model for lemmatization using OpenNMT and pytorch libraries.

The Universal Lemmatizer is part of the Turku-neural-parser-pipeline (https://turkunlp.github.io/Turku-neural-parser-pipeline/) with pre-trained models for more than 50 languages, and state-of-the-art lemmatization results in the CoNLL-18 Shared Task on Parsing Universal Dependencies. See TurkuNLP entry at http://universaldependencies.org/conll18/results-lemmas.html.

Needs python3 environment with pytorch installed.

Running the lemmatizer with pre-trained models

See Turku-neural-parser-pipeline documentation at https://turkunlp.github.io/Turku-neural-parser-pipeline/.

Training new models

See documentation at https://turkunlp.org/Turku-neural-parser-pipeline/training.

About

Instructions how to download and run ready made models for more than 50 languages: https://turkunlp.github.io/Turku-neural-parser-pipeline/

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 93.8%
  • Shell 6.2%