Comb - Making unsearchable pdfs searchable using OCR Check it out (if my ec2 instance is up!): http://www.combpdf.com Forking Dependencies on Ubuntu 12.04: tesseract (OS X also) pytesser (OS X also) django 1.6.2 (OS X also) imagemagick (OS X also) leptonica