Skip to content

Commit

Permalink
pypdf2 to pypdf (#1684)
Browse files Browse the repository at this point in the history
### What problem does this PR solve?

pypdf and PyPDF2 possible Infinite Loop when a comment isn't followed by
a character #59

### Type of change

- [x] Refactoring
  • Loading branch information
KevinHuSh authored Jul 24, 2024
1 parent 7e60800 commit 100b316
Show file tree
Hide file tree
Showing 4 changed files with 4 additions and 1 deletion.
2 changes: 1 addition & 1 deletion deepdoc/parser/pdf_parser.py
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,7 @@
from PIL import Image, ImageDraw
import numpy as np
from timeit import default_timer as timer
from PyPDF2 import PdfReader as pdf2_read
from pypdf import PdfReader as pdf2_read

from api.utils.file_utils import get_project_base_directory
from deepdoc.vision import OCR, Recognizer, LayoutRecognizer, TableStructureRecognizer
Expand Down
1 change: 1 addition & 0 deletions requirements.txt
Original file line number Diff line number Diff line change
Expand Up @@ -79,3 +79,4 @@ word2number==1.1
xgboost==2.1.0
xpinyin==0.7.6
zhipuai==2.0.1
pypdf==4.3.0
1 change: 1 addition & 0 deletions requirements_arm.txt
Original file line number Diff line number Diff line change
Expand Up @@ -153,3 +153,4 @@ groq==0.9.0
wikipedia==1.4.0
Bio==1.7.1
arxiv==2.1.3
pypdf==4.3.0
1 change: 1 addition & 0 deletions requirements_dev.txt
Original file line number Diff line number Diff line change
Expand Up @@ -138,3 +138,4 @@ groq==0.9.0
wikipedia==1.4.0
Bio==1.7.1
arxiv==2.1.3
pypdf==4.3.0

0 comments on commit 100b316

Please sign in to comment.