Releases: JULIELab/jcore-base
v2.6.1
Bug Fixes
- [bug] Fix a bug in XML mapper where empty elements are skipped over #171
- [bug] jcore-lingpipe-gazetteer-ae: Repair tests #110
Miscellaneous Changes
- [closed] Switch from Travis to GitHub Actions #122
- [closed] XMLDBReader: Add check for documents unchanged in the XMI table #119
- [closed] Update all test to JUnit 5 #118
- [closed] XMI DB Reader & DB Writer: Take care of XmiMetaData #89
- [closed] PMC reader check section offset #57
- [closed] JSBD: Set document structure borders #37
v2.6.0
New components
- [new component] Add GNP Multiplier for PMC DB Reader #152
- [new component] Add GNP multipliers for XMI and XML DB multiplier readers #150
- [new component] Add a GNormPlus UIMA component #147
- [new component] Add reader for NLM-Gene #145
- [new component] Add an MMAX2 reader. #136
- [new component] Add GNormPlus BioC XML reader #131
- [new component] Add a GNormPlus BioC Writer #129
- [new component] Add a PMC DB Reader #128
- [new component] Add a consumer to write relations to Neo4j #113
- [new component] Add an embedding writer #102
- [new component] Add a Flair embedding adding component #101
- [new component] Add a FLAIR NER component #100
General Enhancements
- [enhancement] Add an annotation remover. #154
- [enhancement] Add a truncation size parameter to PMC XML multipliers #153
- [enhancement] Allow regular expression feature value filters for EntityConsumer #146
- [enhancement] Add H2-based annotation source for AnnotationAdder #138
- [enhancement] Add a feature-based format to the annotation adder #137
- [enhancement] Let GNP BioC writer add Gene annotations. #134
- [enhancement] Implement max XMI ID retrieval for the BioC GNormPlus Reader #133
- [enhancement] Add a mechanism to avoid mirror subset reset for updated JeDIS document whose text hasn't changed #130
- [enhancement] PMCReader: Add option to omit bibliographical references #127
Bug Fixes
- [bug] Fix a bug where half of the DBMultiplierReader IDs were not actually processed #144
- [bug] ESConsumer: Fix bugs with cache index when resource file is updated #126
- [bug] Fix Bug in JSBD where Cut Away Types can still be included in the begin of a sentence. #125
- [bug] JSBD: Fix bug that occurs when cut away types and sentence delimiter types are given #121
Miscellaneous Changes
- [closed] Map payloads to features for AnnotationAdder #124
- [closed] ES Consumer: Provide persistent Map and Addon terms providers #123
- [closed] Add a flow controller that creates a flow based on a CAS annotation #120
- [closed] JCoreOverlapIndex: Return the list instead of a stream #117
- [closed] lingpipe gazetteer: fix offset issues #116
v2.5.0
v2.4.0
This release sees quite a few bug fixes and streamline in various components.
Most importantly, there are a few new components:
jcore-annotation-adder-ae
: Reads annotations from external sources and adds them to the CAS. Helpful for the integration of annotations that could not be created from within UIMA.jcore-db-checkpoint-ae
: This is a JeDIS component. It helps to set checkpoints within a pipeline that are stored in the database and thus provide collection processing progression information. This component can also mark the end of a pipeline, setting documents that have reached this point in a pipeline tois_processed
and removing theis_in_process
flag.jcore-bc2gm-reader
: Allows to read the original format used for the data of the BioCreative II Gene Mention challenge.jcore-bc2gmformat-writer
writes CAS annotation in the BC2GM format.jcore-lingscope-ae
wraps the Lingscope code as a UIMA AE. The component needs to be further configured regarding which models to use, possibly requiring downloads from the original project. One concrete configuration has been made available for JCoRe in the jcore-projects repository.jcore-mutationfinder-ae
wraps the MutationFinder algorithm as a UIMA component. Finds single point gene mutations via regular expressions in text and normalizes them.jcore-ppd-writer
writes CAS annotation data into the piped format (that is, tokens that are annotated with meta data, separated by the pipe character: The|DET blue|ADJ house|NN.|.). This format is used be the JNET named entity recognizer for training.
v2.3.8.1
This is the first bugfix release for version 2.3.8, hence 2.3.8.1. JeDIS components have reached version 2.3.9.
This is a critical bugfix release for JeDIS. There was a bug in the jcore-xmi-splitter
that could destroy data consistence in the database when adding annotation after the base document was already written.
v2.3.8
v2.3.3
v2.3.0
This update sees a lot of cleaning (e.g. descriptor versions) as well as new components. Amongst others, the JeDIS components that realize a UIMA-based annotation warehouse around a Postgres database have been added. Those consist of the jcore-xmi-db-writer
, the jcore-xmi-db-reader
and the jcore-xmi-splitter
, where the latter is at at the core at the two former components and found in the JCoRe Dependencies repository.
For more new components, browse the repository list associated with this release.
JCoRe 2.2.0 - The medical & DH Update
This release features a new reader (jcore-dta-reader) and changes so that medical models (namely FraMed) could be trained with jsbd and jtbd.
The MST Parser was dismissed due to ongoing problems with our implementation and the trove library.
We're planning on having a release 2.3.0 in short time that incorporates MATE Tools as new dependency parser.
If you're planning on using a dependency parser in the time between the two releases please refer to JCoRe 2.1.0.
JCoRe Base 2.1.0
v2.1.0 changed to version 1.5.0