Reproduction on the guide written by Dennis Couzin.
bash compile.sh
cd extract python3 pdf.py > ../ocr/pdf_output.txt python3 ocr.py > ../ocr/tesseract_output.txt