Skip to content

Latest commit

 

History

History
2 lines (2 loc) · 304 Bytes

README.md

File metadata and controls

2 lines (2 loc) · 304 Bytes

pdfwordfrequencycounter

This is a super crappy script I wrote that parses through Greek PDFs and, poorly, calculates the most common 750 words. It somewhat ignores articles, θα, να, and some other word fragments, but sucks at it. It was enough for my purposes so maybe it will help someone else.