Marker v1!
This is the release of marker v1, a complete rewrite from scratch.
- 2x faster due to a new layout model
- Consistent internal schema for blocks and pages
- Modular architecture with processors and renderers that can easily be overridden
- JSON chunk and markdown output
- Lots of units tests
- Much higher output quality
What's Changed
- feat: API server file upload support by @tjbck in #332
- Upgrade line joining by @iammosespaulr in #344
- Surya Layout model and batch multiplier updates by @iammosespaulr in #335
- Initial document skeleton by @VikParuchuri in #345
- Add PDF Provider by @iammosespaulr in #346
- Add Layout Merging by @iammosespaulr in #348
- Vik v2 by @VikParuchuri in #349
- Layout Merging fixes and tests by @iammosespaulr in #350
- Vik v2 by @VikParuchuri in #351
- Decouple Span from Line by @iammosespaulr in #352
- Vik v2 by @VikParuchuri in #353
- Add simple line and span renderer, add blocktype class by @VikParuchuri in #357
- Add markdown renderer, swap how ids are named by @VikParuchuri in #358
- Fix markdown output by @VikParuchuri in #359
- Add OCR Builder by @iammosespaulr in #356
- Output images, clean up other output formats by @VikParuchuri in #362
- Vik v2 by @VikParuchuri in #364
- Cleanup and speed up tests by @iammosespaulr in #363
- Add CI tests by @iammosespaulr in #366
- Add debug utils, fix output quality issues by @VikParuchuri in #367
- Allow Overriding Node Classes by @iammosespaulr in #368
- Reorganize tests by @VikParuchuri in #369
- Minor debugging and misc fixes by @iammosespaulr in #370
- Chunk JSON output by @VikParuchuri in #371
- Vik v2 by @VikParuchuri in #372
- Add code processor, fix issues with structure by @VikParuchuri in #375
- Add Line merging across Pages and Columns by @iammosespaulr in #373
- PDF Converter Initialization refactor + Tests by @iammosespaulr in #379
- Wire up convert_single by @VikParuchuri in #380
- Fix tests by @VikParuchuri in #381
- Add Docstrings for Processors, Builders and Converters and
-l
to list them from theconvert.py
CLI + Misc Fixes by @iammosespaulr in #382 - Fix broken text by @VikParuchuri in #383
- Fix marker app by @VikParuchuri in #384
- Fix marker server by @VikParuchuri in #385
- Misc Bugfixes by @iammosespaulr in #386
- Vik v2 by @VikParuchuri in #387
- Update tests by @iammosespaulr in #388
- Additional Fixes by @iammosespaulr in #390
- Vik v2 by @VikParuchuri in #391
- Marker v2 by @VikParuchuri in #392
- Improve comparison performance by @VikParuchuri in #394
- Dev by @VikParuchuri in #395
New Contributors
Full Changelog: v0.3.10...v1.0.0