Skip to content

Commit

Permalink
feat(tasks): Drop mojibake fix as unlikely to be needed
Browse files Browse the repository at this point in the history
  • Loading branch information
flooie committed May 29, 2024
1 parent f9c0b3d commit 8d2dcbf
Showing 1 changed file with 0 additions and 3 deletions.
3 changes: 0 additions & 3 deletions doctor/tasks.py
Original file line number Diff line number Diff line change
Expand Up @@ -647,9 +647,6 @@ def extract_recap_pdf(
if page_needs_ocr(page, page_text):
extracted_by_ocr = True
page_text = extract_with_ocr(page, strip_margin=strip_margin)
if "e" not in page_text:
# It's a corrupt PDF from ca9. Fix it.
page_text = fix_mojibake(page_text)
content += f"\n{page_text}"
content = remove_excess_whitespace(content)
return content, extracted_by_ocr

0 comments on commit 8d2dcbf

Please sign in to comment.