OCR performance is poor

Steps to reproduce

Matterhorn is configured to pull down Leptonica 1.66 and Tesseract 3.00.
I went and retrieved Leptonica 1.67 and Tesseract 3.01 directly, along with the latest Tesseract English dictionary (Reference: http://code.google.com/p/tesseract-ocr/wiki/ReadMe).

The text extraction is now much better than it was a few months ago.

Status

Assignee

Matjaz Rihtar

Reporter

Tobias Wunden

Severity

Performance

Tags (folksonomy)

None

Components

Fix versions

Affects versions

1.3

Priority

Critical
Configure