Releases: axa-group/Parsr
Releases · axa-group/Parsr
v0.6: Merge branch 'develop'
Changes
- Added Jupyter Notebook
- Improved Headings detection module (Hight reduction of false positives)
- Improved Table detection module
- Improved Paragraph detection module
- Improved git Readme
- Several GUI & Server bug fixes
v0.5: Merge branch 'develop'
Changes
- New List detection Module (bullet and numeric type list)
- Improved Link detection module for pdfMiner extractor
- Improved Heading detection module (font usage ratio used to detect headings)
- Markdown exporter updated to export tables using standard syntax instead of html syntax
- Improved overall output accuracy
- Several GUI improvements
Dependencies
- Added GraphicksMagick for GUI thumbnails generation
v0.4
Changes
- Highly improved LinesToParagraph module
- Highly improved Headings detection module.
- Promotion of pdfminer as the primary PDF extracter + related output cleaning.
- Improved text redundancy/duplication detection and treatment.
- Leaner docker implementation for faster deploys.
- Several Vue UI improvements (
demo/vue-viewer
), including text inspector, forward, next buttons, and more. - Several bugfixes in markdown export, including more flexible tables including rawspans and colspans.
- Windows deployment improvements under both bare-metal and docker flavors.
v0.3
v0.2
Changes
- New responsive Vue based UI for visualising output under
demo/vue-viewer
- New pdf extractor option:
pdfminer
- New image extractor option:
google-vision
- Multilingual documentation (starting with Chinese for now)
- Better naming of different areas (input, output..) of the pipeline for easier code understadability.
- Improved header detection
- Externalized default configuration files for modules (
module/abc/defaultConfig.json
) - Several bugfixes