Matt McWilliams 625dfb4bba | ||
---|---|---|
extract | ||
ocr | ||
original | ||
NOTES_ON_OPTICAL_PRINTER_TECHNIQUE.md | ||
README.md | ||
compile.sh | ||
style.css |
README.md
NOTES ON OPTICAL PRINTER TECHNIQUE
Reproduction on the guide written by Dennis Couzin.
PDF Dependencies
- pandoc
bash compile.sh
Text extraction dependencies
- Python3.7
- OpenCV 2
- Tesseract
- PIL
cd extract
python3 pdf.py > ../ocr/pdf_output.txt
python3 ocr.py > ../ocr/tesseract_output.txt