You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
For a long time PyDelphin has included on MRS, DMRS, and EDS a slot for the 'identifier' ('ident' in the original DTDs) field, which is basically unused. The field only gets filled in if it is encoded in a representation that is read in. There are few comments about it in the LKB code. Here's one from lingo/lkb/src/rmrs/dtd-notes.txt:
ident is an attribute on rmrs's to identify which utterance they
belong with. The HoG currently uses a wrapper around the RMRS, with
identifying information there instead. Hinoki uses the ident
identifier but may switch to a wrapper, in which case ident may be
removed. In any case it is optional.
In PyDelphin, not all codecs can handle identifiers (the PENMAN ones don't, nor do any EDS ones). These identifiers could be useful for, e.g., exporting a corpus of *MRS representations which encode which items they came from.
It seems like the appropriate form of the identifier may depend on the task. In some cases, just an i-id from a profile would be enough, while for others a parse-id and result-id may be needed to distinguish among multiple MRSs from one item.
The text was updated successfully, but these errors were encountered:
For a long time PyDelphin has included on MRS, DMRS, and EDS a slot for the 'identifier' ('ident' in the original DTDs) field, which is basically unused. The field only gets filled in if it is encoded in a representation that is read in. There are few comments about it in the LKB code. Here's one from
lingo/lkb/src/rmrs/dtd-notes.txt
:(note that XMT uses the HoG strategy)
And in
lingo/lkb/src/tsdb/lisp/redwoods.lisp
, it seems to be formatted using a few other fields:And in
lingo/lkb/src/rmrs/dmrs.lisp
, it (as far as I can tell) uses the first column of a [incr tsdb()] file:In PyDelphin, not all codecs can handle identifiers (the PENMAN ones don't, nor do any EDS ones). These identifiers could be useful for, e.g., exporting a corpus of *MRS representations which encode which items they came from.
It seems like the appropriate form of the identifier may depend on the task. In some cases, just an
i-id
from a profile would be enough, while for others aparse-id
andresult-id
may be needed to distinguish among multiple MRSs from one item.The text was updated successfully, but these errors were encountered: