-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merge music branch to develop. #56
Commits on Sep 1, 2023
-
Configuration menu - View commit details
-
Copy full SHA for fc73bc9 - Browse repository at this point
Copy the full SHA fc73bc9View commit details
Commits on Sep 7, 2023
-
Add translator dictionary, defining translation from internal shorten…
…ed labels (model output) to more verbose format usable by `export_music.py`.
Configuration menu - View commit details
-
Copy full SHA for bda025a - Browse repository at this point
Copy the full SHA bda025aView commit details
Commits on Sep 13, 2023
-
Configuration menu - View commit details
-
Copy full SHA for fe9aceb - Browse repository at this point
Copy the full SHA fe9acebView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2da0f24 - Browse repository at this point
Copy the full SHA 2da0f24View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2f661d2 - Browse repository at this point
Copy the full SHA 2f661d2View commit details
Commits on Sep 14, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 5373283 - Browse repository at this point
Copy the full SHA 5373283View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1b9b676 - Browse repository at this point
Copy the full SHA 1b9b676View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7cc5293 - Browse repository at this point
Copy the full SHA 7cc5293View commit details -
Configuration menu - View commit details
-
Copy full SHA for fae66ce - Browse repository at this point
Copy the full SHA fae66ceView commit details -
Add exporting music directly in
parse_folder.py
usingconfig.ini
……to define settings and `page_parser.py` to create music exporter object of `music/export_music/ExportMusicPage`.
Configuration menu - View commit details
-
Copy full SHA for b02ebb2 - Browse repository at this point
Copy the full SHA b02ebb2View commit details
Commits on Sep 18, 2023
-
Configuration menu - View commit details
-
Copy full SHA for c07ba6a - Browse repository at this point
Copy the full SHA c07ba6aView commit details -
Add sorting music regions "in reading order" using
y_min
of boundin……g box around polygon.
Configuration menu - View commit details
-
Copy full SHA for 0afb958 - Browse repository at this point
Copy the full SHA 0afb958View commit details
Commits on Oct 3, 2023
-
Remove
RegionCategory
andLineCategory
enums hard-coded in `layou……t.py`. Get names only from Yolo `result.names`.
Configuration menu - View commit details
-
Copy full SHA for 9371467 - Browse repository at this point
Copy the full SHA 9371467View commit details -
Configuration menu - View commit details
-
Copy full SHA for b733c74 - Browse repository at this point
Copy the full SHA b733c74View commit details
Commits on Oct 25, 2023
-
Remove music exporter option from
parse_folder.py
and make `export_……music.py` a stand-alone script.
Configuration menu - View commit details
-
Copy full SHA for b40e479 - Browse repository at this point
Copy the full SHA b40e479View commit details
Commits on Oct 27, 2023
-
Add option to have more LineCroppers and ORC engines. Set every other…
… text Layout engine to work only with 'text' lines.
Configuration menu - View commit details
-
Copy full SHA for 7b93d16 - Browse repository at this point
Copy the full SHA 7b93d16View commit details -
Configuration menu - View commit details
-
Copy full SHA for 50418dd - Browse repository at this point
Copy the full SHA 50418ddView commit details
Commits on Nov 3, 2023
-
Add PageLayout splitting enabling running multiple layout parsers eac…
…h with its own setting and set of categories to work with.
Configuration menu - View commit details
-
Copy full SHA for 8853046 - Browse repository at this point
Copy the full SHA 8853046View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5ce86d1 - Browse repository at this point
Copy the full SHA 5ce86d1View commit details -
Merge branch 'develop' into music
# Conflicts: # pero_ocr/core/layout.py
Configuration menu - View commit details
-
Copy full SHA for 16e8ce3 - Browse repository at this point
Copy the full SHA 16e8ce3View commit details
Commits on Dec 1, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 3e2fcb8 - Browse repository at this point
Copy the full SHA 3e2fcb8View commit details -
Configuration menu - View commit details
-
Copy full SHA for a49409c - Browse repository at this point
Copy the full SHA a49409cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 94bcae8 - Browse repository at this point
Copy the full SHA 94bcae8View commit details -
Configuration menu - View commit details
-
Copy full SHA for d688bc0 - Browse repository at this point
Copy the full SHA d688bc0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 65eacb5 - Browse repository at this point
Copy the full SHA 65eacb5View commit details
Commits on Dec 7, 2023
-
Configuration menu - View commit details
-
Copy full SHA for bb88217 - Browse repository at this point
Copy the full SHA bb88217View commit details
Commits on Dec 14, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 3354a5b - Browse repository at this point
Copy the full SHA 3354a5bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 59dd330 - Browse repository at this point
Copy the full SHA 59dd330View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3c3322e - Browse repository at this point
Copy the full SHA 3c3322eView commit details -
New config section parsing and other changes after code review.
- in layout_helpers.py add searching for a region by ID - in page_parser.py change log error (missing crop) back to raise exception - in page_parser.py change naming convention for config section names to {section_name}_\d+. All following examples are possible (with warnings) for OCR section: OCR, OCR_0, OCR_0_asdf...
Configuration menu - View commit details
-
Copy full SHA for 27a82eb - Browse repository at this point
Copy the full SHA 27a82ebView commit details
Commits on Dec 22, 2023
-
Add image_size to Yolo engine. Add
config_get_list
to get list of c……ategories instead of json string. - `image_size` can be either int or (int, int) according to ultralytics docu: https://docs.ultralytics.com/modes/predict/#inference-arguments
Configuration menu - View commit details
-
Copy full SHA for ae516f4 - Browse repository at this point
Copy the full SHA ae516f4View commit details -
Store box confidence (in LayoutExtractorYOLO) to RegionLayout and exp…
…ort to page_xml region custom.
Configuration menu - View commit details
-
Copy full SHA for e5dd2b8 - Browse repository at this point
Copy the full SHA e5dd2b8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 529234e - Browse repository at this point
Copy the full SHA 529234eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 036daf3 - Browse repository at this point
Copy the full SHA 036daf3View commit details -
Add translating short music output to original encoding.
- `TransformerEngineLineOCR` now translates all outputs according to `music_dictionary` dictionary in ocr.json (or omr.json), if exists. - Move `MusicTranslator` to a separate file for nicer imports everywhere.
Configuration menu - View commit details
-
Copy full SHA for 697990c - Browse repository at this point
Copy the full SHA 697990cView commit details
Commits on Dec 29, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 893baca - Browse repository at this point
Copy the full SHA 893bacaView commit details -
Add
CATEGORIES
option to sorters and delete therefore unused functi……on `sort_regions_in_reading_order`
Configuration menu - View commit details
-
Copy full SHA for e2a26de - Browse repository at this point
Copy the full SHA e2a26deView commit details
Commits on Jan 4, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d59884a - Browse repository at this point
Copy the full SHA d59884aView commit details
Commits on Jan 15, 2024
-
- print more informative warning message - add `__init__.py` to enable python package functionality for future.
Configuration menu - View commit details
-
Copy full SHA for 0a1a0c0 - Browse repository at this point
Copy the full SHA 0a1a0c0View commit details
Commits on Jan 16, 2024
-
Delete unused function from
layout
, music integration.- delete `layout.py/PageLayout.get_regions_of_category` as its functionality was substituted by `layout_helpers.py/split_page_layout_by_categories`. For sorting regions use (Naive|Smart)RegionSorter during `parse_folder` execution resulting in regions being sorted in xml output. - in `music_exporter.py` make translator optional argument.
Configuration menu - View commit details
-
Copy full SHA for faf0496 - Browse repository at this point
Copy the full SHA faf0496View commit details
Commits on Jan 18, 2024
-
Configuration menu - View commit details
-
Copy full SHA for dd4bbc8 - Browse repository at this point
Copy the full SHA dd4bbc8View commit details -
Change config categories, line_categories, add decoder filter.
- `LINE_CATEGORIES`: for detected region also create a TextLine for future transcription. - `CATEGORIES`: categories to save in `LayoutExtractorYolo`, delete others. (if `None` or `[]`, save all) - disable YOLO detection results printing - filter categories for `PageDecoder`
Configuration menu - View commit details
-
Copy full SHA for 6fc82e4 - Browse repository at this point
Copy the full SHA 6fc82e4View commit details -
Rename
MusicTranslator
for to more generalOutputTranslator
and e……verything around it. - substitution dictionary needs reversed format now. Translate key to values (to be more obvious). Example follows: - SSemantic music, Model output (now keys of dictionary): >2 + kGM + B3z + C4z + |..." - Semantic music (now values of dictionary): clef-G2 + keySignature-GM + note-B3_eighth + note-C4_eighth + barline...")
Configuration menu - View commit details
-
Copy full SHA for 53ee27a - Browse repository at this point
Copy the full SHA 53ee27aView commit details
Commits on Jan 22, 2024
-
Add option for rendering region categories for non-text regions.
- enable by adding `--output-render-category` argument
Configuration menu - View commit details
-
Copy full SHA for 68e7892 - Browse repository at this point
Copy the full SHA 68e7892View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8ac485e - Browse repository at this point
Copy the full SHA 8ac485eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7032389 - Browse repository at this point
Copy the full SHA 7032389View commit details
Commits on Jan 23, 2024
-
Configuration menu - View commit details
-
Copy full SHA for ef5f36f - Browse repository at this point
Copy the full SHA ef5f36fView commit details -
Add confidence estimation to PageOCR directly after detection. Update…
… TextLine if new confidence is higher. - move `output_substitution` to PageOCR after confidence estimation. - add `TextLine.get_labels` for easier confidence estimation. - add `get_line_confidence_median` to save literally one line of code later...
Configuration menu - View commit details
-
Copy full SHA for 8c738ee - Browse repository at this point
Copy the full SHA 8c738eeView commit details
Commits on Jan 26, 2024
-
Add
PageOCR.get_line_confidence
solving problem of wrong confidence…… estimation. Problem was that confidence was estimated using `confidence_estimation.py/get_line_confidence` without cutting `log_probs` according to `line.logit_coords`. New method solves it.
Configuration menu - View commit details
-
Copy full SHA for 949829a - Browse repository at this point
Copy the full SHA 949829aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0b6e933 - Browse repository at this point
Copy the full SHA 0b6e933View commit details -
Update readme, remove translator.Semantic_to_SSemantic.json because i…
…t was moved to ocr/omr.json (OCR engine config jon).
Configuration menu - View commit details
-
Copy full SHA for 092b04f - Browse repository at this point
Copy the full SHA 092b04fView commit details -
Improve translation of symbols in
output_translator.py
. Return orig……inal symbol if not found in dictionary.
Configuration menu - View commit details
-
Copy full SHA for cc47eef - Browse repository at this point
Copy the full SHA cc47eefView commit details
Commits on Jan 31, 2024
-
Add
atomic
option toOutputTranslator
+ output substitution toggl……es to `PageOCR` config section. - `SUBSTITUTE_OUTPUT` ('yes' or 'no'): enables (or disables) output substitution (enabled by default) - `SUBSTITUTE_OUTPUT_ATOMIC` ('yes' or 'no'): if 'yes' and any symbol cannot be translated, return original line. if 'no', line may be partially translated.
Configuration menu - View commit details
-
Copy full SHA for c7d90a1 - Browse repository at this point
Copy the full SHA c7d90a1View commit details
Commits on May 29, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d9430bc - Browse repository at this point
Copy the full SHA d9430bcView commit details
Commits on May 30, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2f84712 - Browse repository at this point
Copy the full SHA 2f84712View commit details -
Configuration menu - View commit details
-
Copy full SHA for eddf0e3 - Browse repository at this point
Copy the full SHA eddf0e3View commit details -
Add config parameter
UPDATE_TRANSCRIPTION_BY_CONFIDENCE
Parameter sets if PageOCR should update to new line: - every time (false) - only if better confidence (true) Applies in case of rerunning OCR on previously transcribed line)
Configuration menu - View commit details
-
Copy full SHA for 70a7e35 - Browse repository at this point
Copy the full SHA 70a7e35View commit details
Commits on Jun 17, 2024
-
Add ALTO baseline (export + import) in two options (float or points)
- Versions older than 4.2 defines baseline as a simple float. (that's where the original baseline comes from) - version 4.2 and never defines baseline as a PointsType string with recommend format: "x1,y1 x2,y2 ..."
Configuration menu - View commit details
-
Copy full SHA for 9dcd33f - Browse repository at this point
Copy the full SHA 9dcd33fView commit details
Commits on Jun 19, 2024
-
Add ALTO versions (options how to export baseline) + both baseline im…
…port options. - Versions older than 4.2 defines baseline as a simple float. (baseline is exported as mean of all Y baseline points) - version 4.2 and never defines baseline as a PointsType string with recommend format: "x1,y1 x2,y2 ..."
Configuration menu - View commit details
-
Copy full SHA for 1a46c00 - Browse repository at this point
Copy the full SHA 1a46c00View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2141002 - Browse repository at this point
Copy the full SHA 2141002View commit details -
Configuration menu - View commit details
-
Copy full SHA for 34c6584 - Browse repository at this point
Copy the full SHA 34c6584View commit details -
Configuration menu - View commit details
-
Copy full SHA for 82b3e70 - Browse repository at this point
Copy the full SHA 82b3e70View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2aed4bc - Browse repository at this point
Copy the full SHA 2aed4bcView commit details
Commits on Jun 20, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 4efdbab - Browse repository at this point
Copy the full SHA 4efdbabView commit details -
Configuration menu - View commit details
-
Copy full SHA for 134f51a - Browse repository at this point
Copy the full SHA 134f51aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 25f555c - Browse repository at this point
Copy the full SHA 25f555cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 26b0c1a - Browse repository at this point
Copy the full SHA 26b0c1aView commit details
Commits on Jun 25, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 3ce8bbc - Browse repository at this point
Copy the full SHA 3ce8bbcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3ec7902 - Browse repository at this point
Copy the full SHA 3ec7902View commit details
Commits on Jun 27, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 520e3ae - Browse repository at this point
Copy the full SHA 520e3aeView commit details -
Set category filter fallback to
[]
for backward compatibility.Old XMLs on input don't have category => line.category = None, OCR (and others) have to be set to `[]` by default to process ALL PAGES.
Configuration menu - View commit details
-
Copy full SHA for 9b414a0 - Browse repository at this point
Copy the full SHA 9b414a0View commit details -
1) Remove `ultralytics` and `music21` from dependencies for the whole projest. the user will have to install them when needed. 2) Import `ultralytics` only when needed, so it doesn't create import error for specific numpy versions. Ultralytics has this dependency right now: "numpy>=1.23.5,<2.0.0". See current at [github.com/ultralytics/ultralytics/blob/main/pyproject.toml](https://github.com/ultralytics/ultralytics/blob/69cfc8aa228dbf1267975f82fcae9a24665f23b9/pyproject.toml#L67)
Configuration menu - View commit details
-
Copy full SHA for a110f0e - Browse repository at this point
Copy the full SHA a110f0eView commit details
Commits on Jul 11, 2024
-
Configuration menu - View commit details
-
Copy full SHA for f5a7a51 - Browse repository at this point
Copy the full SHA f5a7a51View commit details
Commits on Aug 5, 2024
-
Fix bugs according to Pull request comment.
In `smart_sorter.py`: - if less then to engines filtered, return original page_layout and not only the split one. In `music structures.py`: - change type of `lengths` to numpy array, fix min_length to take from numbers and not names. - ensure `encoded_group` is not None before appending it to the voice. full comment: [pero-ocr/pull/56/#issuecomment-2245202776](#56)
Configuration menu - View commit details
-
Copy full SHA for cc9b0ae - Browse repository at this point
Copy the full SHA cc9b0aeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 62af812 - Browse repository at this point
Copy the full SHA 62af812View commit details -
Add regions to splitting by category. If
region.category
set, move ……whole region to positive or negative (ignore categories of lines inside the region)
Configuration menu - View commit details
-
Copy full SHA for b60196f - Browse repository at this point
Copy the full SHA b60196fView commit details
Commits on Aug 6, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 4d2ddaa - Browse repository at this point
Copy the full SHA 4d2ddaaView commit details
Commits on Aug 27, 2024
-
Disable exporting midi lines if no notes on the line.
Export multirest as a simple default 'whole' rest.
Configuration menu - View commit details
-
Copy full SHA for 7c4251e - Browse repository at this point
Copy the full SHA 7c4251eView commit details
Commits on Sep 24, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d9c64cd - Browse repository at this point
Copy the full SHA d9c64cdView commit details
Commits on Oct 16, 2024
-
Simplify splitting page layouts to allow backwards (only look at regi…
…on category, None = 'text')
Configuration menu - View commit details
-
Copy full SHA for fa1a897 - Browse repository at this point
Copy the full SHA fa1a897View commit details
Commits on Oct 25, 2024
-
Add IndexError to catch expression when calculating transcription con…
…fidence -- in case when there are no logits (i.e. logits.shape[0] == 0) the confidence cannot be calculated.
Configuration menu - View commit details
-
Copy full SHA for f5f2f42 - Browse repository at this point
Copy the full SHA f5f2f42View commit details
Commits on Nov 5, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 747e491 - Browse repository at this point
Copy the full SHA 747e491View commit details