Skip to content

Commit

Permalink
Incremented version to 0.2.2
Browse files Browse the repository at this point in the history
  • Loading branch information
bill-baumgartner committed Jul 3, 2019
1 parent 2279144 commit 532c670
Show file tree
Hide file tree
Showing 2 changed files with 8 additions and 2 deletions.
6 changes: 6 additions & 0 deletions CHANGES.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,9 @@
# Release v0.2.2

* Updated all Document Readers to validate spans as they are imported. Specifically, discontinuous spans are validated in two ways. One, if a discontinuous span contains adjacent component spans, e.g. \[35..43\]\[44..52\], or component spans are are separated by only whitespace, then the component spans are combined, e.g. \[35..52\]. Second, if the discontinuous span contains a component span that is nested in another component span, e.g. \[78..92\]\[88..92\], then the nested span is removed, e.g. \[78..92\].
* Revised the CoNLLCoref Document Writer to exclude two annotation types that are included in the CRAFT coreference annotations, but that should not be included in the CoNLL-Coref 2011/12 file format, namely 'nonreferential pronoun' and 'partonymy relation'.
* Added discontinuous span validation for the CoNLLCorefDocumentWriter. Mapping spans to token boundaries can cause instances of nested discontinuous spans, so the validation code for discontinuous spans was added to the CoNLL-Coref document writer. There was a case in 16628246.xml (coreference annotations) where "7.5 dbc embryos" was annotated as "7" .. "5 dbc embryos". In this case the "7" maps to the "7.5" token and the "5" also maps to the "7.5" token, so the final annotation had two instances of the "7.5" token span. Seems like the original annotation might be faulty, i.e. the "7" .. "5" split, but that's the way it is, so a fix was required.

# Release v0.2.1

* Revised treebank-to-dependency conversion to output CoNLL-X format
Expand Down
4 changes: 2 additions & 2 deletions pom.xml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@
<modelVersion>4.0.0</modelVersion>
<groupId>edu.ucdenver.ccp</groupId>
<artifactId>file-conversion</artifactId>
<version>0.2.1</version>
<version>0.2.2</version>

<properties>
<project.build.sourceEncoding>UTF-8</project.build.sourceEncoding>
Expand All @@ -28,7 +28,7 @@
<dependency>
<groupId>edu.ucdenver.ccp</groupId>
<artifactId>datasource-fileparsers</artifactId>
<version>0.7.1</version>
<version>0.7.3</version>
</dependency>
<dependency>
<groupId>edu.ucdenver.ccp</groupId>
Expand Down

0 comments on commit 532c670

Please sign in to comment.