Extracts highlights from PDF documents as a summary
The pdf can be highlighted using any of the popular tools like Adobe Acrobat, Foxit reader etc
The summary is saved in the same directory as the pdf file named as <filename>_summary.txt
pip install PyMuPDF
Make it executable
chmod +x pdf_highlight_extractor.py
./pdf_highlight_extractor.py
Enter the path to the PDF file: c:\documents\example.pdf
==============================
**** Title: The Linux Programming Interface *****
Extracted Highlights:
==============================
📝 **Page 55**
------------------------------
Portable Operating System Interface)
📝 **Page 66**
------------------------------
Process scheduling:
📝 **Page 66**
------------------------------
Memory management:
*** Saved highlights successfully to: c:\documents\example_summary.txt ***