Skip to content

Linguistic Analysis Pipeline

rzanoli edited this page May 22, 2015 · 6 revisions

The Linguistic Analysis Pipeline (LAP) is a collection of annotation components for Natural Language Processing (NLP) where component integration is based on the Apache UIMA framework, and this is achieved by using the DKPro tool.
The table below reports the available pipelines, which language they can be used with (LG) and if they are ready for use or have to be installed (INSTL; "YES" if it needs to be installed, "NO" otherwise). EDAs that are currently using those pipelines are reported too (EDA) whereas the field description (DESC) contains links to the pipeline pages.

| Pipeline | LG | INSTL | EDA | |----------------------| :------: | :--------: | :---------: | :---------: | | OpenNLP | ENG,ITA,GER | NO | BIUTEE,EditDistance,TIE | | TextPro | ITA | YES | EditDistance | | TreeTagger | ENG,ITA,GER | YES | EditDistance,TIE,P1EDA,AdArte | | CLaRK | BUL | NO | EditDistance |

* We use an asterisk to highlight those pipelines that are available for a specific EDA only. Other EDAs cannot use it. It could be due to the fact that the pipeline does not implement the expected interface yet.

Clone this wiki locally