module__org.bibliome.alvisnlp.modules.ccg.CCGParser

Jump to bottom

Robert Bossy edited this page Jul 27, 2017 · 1 revision

#org.bibliome.alvisnlp.modules.ccg.CCGParser

Synopsis

Syntax parsing with CCG Parser.

Description

org.bibliome.alvisnlp.modules.ccg.CCGParser applies the CCG Parser to sentences specified as annotations from the sentenceLayerName layer. Sentence words are specified by annotations in the wordLayerName layer. For each sentence, only words entirely included in the sentence will be considered; WoSMig and SeSMig should create these layers with the appropriate annotations. Additionally CCGParser takes advantage of word POS tag specified in the posFeatureName feature.

org.bibliome.alvisnlp.modules.ccg.CCGParser creates a relation named relationName in each section and a tuple in this relation for each dependency. This relation is ternary:

sentenceRole: the first argument is the sentence in which the dependency was found;
headRole: the second argument is the head word of the dependency;
dependentRole: the third argument is the dependent word of the dependency.

org.bibliome.alvisnlp.modules.ccg.CCGParser adds to each dependency tuple a feature linkageNumberFeature with the linkage number to which begongs the tuple, and a feature dependencyLabelFeature with the label of the dependency.

Parameters

executable

Optional

Type: ExecutableFile

Path to the CCG Parser executable.

parserModel

Optional

Type: InputDirectory

Path to the parser model file.

superModel

Optional

Type: InputDirectory

Path to the CCG supertagger model file.

constantRelationFeatures

Optional

Constant features to add to each relation created by this module

constantTupleFeatures

Optional

Constant features to add to each tuple created by this module

stanfordMarkedUpScript

Optional

Type: InputFile

Path to the markedup script for Stanford tagset output. See Biomedical parsing for CCG.

stanfordScript

Optional

Type: ExecutableFile

Post-processing script for Stanford tagset output. See Biomedical parsing for CCG.

dependentRole

Default value: dependent

Type: String

Name of the role that denote the dependent word in the dependency tuple.

documentFilter

Default value: true

Type: Expression

Only process document that satisfy this filter.

formFeatureName

Default value: form

Type: String

Name of the feature containing the word surface form.

headRole

Default value: head

Type: String

Name of the role that denote the head word in the dependency tuple.

internalEncoding

Default value: UTF-8

Type: String

Character encoding of CCG tools input and output.

labelFeatureName

Default value: label

Type: String

Name of the feature containing the dependency label.

lpTransformation

Default value: false

Either to translate into LP tag-set.

maxRuns

Default value: 1

Maximal number of CCG runs.

maxSuperCats

Default value: 500000

Maximum number of supercats before the parse explodes (cited from CCG documentation).

posFeatureName

Default value: pos

Type: String

Name of the feature containing the word POS tag.

relationName

Default value: dependencies

Type: String

Name of the relation containing dependencies.

sectionFilter

Default value: boolean:and(true, boolean:and(nav:layer:sentences(), nav:layer:words()))

Type: Expression

Process only sections that satisfy this filter.

sentenceFilter

Default value: true

Type: Expression

Process only sentences that satisfy this filter.

sentenceLayerName

Default value: sentences

Type: String

Name of the layer containing sentence annotations.

sentenceRole

Default value: sentence

Type: String

Name of the role that denote the sentence to which belongs a dependency tuple.

wordLayerName

Default value: words

Type: String

Name of the layer containing word annotations.

AlvisNLP/ML Wiki

User guides

Developer guides

Clone this wiki locally