Optical Character Recognition -- typically the art of teaching a computer to read printed text (provided as scanned images).
[...]
Three principal open-source engines:
- GOCR (appears to have a Tcl/Tk frontend)
- Ocrad (GNU)
- Tesseract OCR (originally Hewlett-Packard, but now released as open source)
Recommended proprietary packages: