Tesseract parser is logging warn statement "Page 0"

Steps to reproduce

Steps to reproduce:

1. Ingest a recording with text extraction turned on.

Actual Results:

In many cases, "Page 0" is written to the logs

Activity

Show:
Lukas Rohner
October 10, 2013, 9:30 AM

tesseract version 3.01 is mostly showing a Page 0 error:

=> tesseract 179903_0.tif test
Tesseract Open Source OCR Engine v3.01 with Leptonica
Page 0
Garbage result of merge? Left Ragged (120,187)->(-20,768) w=162 s=0, sort key=139548, boxes=9, partners=0

tesseract version 3.02.02 doesn't show the Page 0 error.

=> tesseract 179903_0.tif test
Tesseract Open Source OCR Engine v3.02.02 with Leptonica

So I propose to ignore this Page 0 error because text-extracting is working with Page 0 output.

Greg Logan
October 10, 2013, 8:31 PM

Merged into 1.4.x with rev 15922.

Fixed and reviewed

Assignee

Lukas Rohner

Reporter

Tobias Wunden

Severity

Operations

Tags (folksonomy)

Components

Fix versions

Affects versions

Priority

Minor
Configure