Discussion:
[tesseract-ocr] Getting a blank tessinput.tif file
Ashish Goel
9 years ago
Permalink
Hello All,

I am trying to do OCR on a bunch of images. Getting some failures, and I
want to analyse them.
So, to do that, I am trying to get the tessinput.tif file so that I can
find out what input actually goes to tesseract.

I am passing "-c tessedit_write_images 1" along with my tesseract to
generate the tessinput.tif file.
Tesseract does generates the tessinput file, but the file is blank (0 bytes)

Did I do anything wrong?
I downloaded tesseract 3.14 and leptonica 1.73 and compiled both.

Version as reported by tesseract -v are:

tesseract 3.04.00
leptonica-1.73
libjpeg 8b (libjpeg-turbo 1.2.0) : libpng 1.2.46 : zlib 1.2.3.4


Any help will be gretaly appreciated...

Regards,
Ashish
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+***@googlegroups.com.
To post to this group, send email to tesseract-***@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/f244961f-009c-40a7-8908-3e3bda490519%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Zdenko Podobný
9 years ago
Permalink
Your leptonica build support only limited number of image formats. What
image you try to process?

Zdenko
...
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+***@googlegroups.com.
To post to this group, send email to tesseract-***@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8yikU0j5on5Cf02npnv6a6G%3DPvDVamjZTY4nsDF0SynEQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
ashish goel
9 years ago
Permalink
I am trying to process a png image. Will it work, if I convert my png to
tiff before OCRing?
...
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+***@googlegroups.com.
To post to this group, send email to tesseract-***@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJSNBUYtFeMsL2Hrq%3D5QcSY72fS46sYrdDsKOGZt6QCUhqKtCA%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
Ashish Goel
9 years ago
Permalink
Hey Zdenko,
I also tried converting my image to tif/tiff, but still it did not gave me
a good tessinput.tif
I found that libtiff is missing on my environment. So, I installed
libtiff4-dev and recompiled leptonica.

Now my version shows up as:

tesseract 3.04.00
leptonica-1.73
libjpeg 8b (libjpeg-turbo 1.2.0) : libpng 1.2.46 : libtiff 3.9.5 : zlib
1.2.3.4

but still tessinput.tif is blank.

Is there anything else that I can try so that I can get tessinput.tif?

Thanks
Ashish
...
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+***@googlegroups.com.
To post to this group, send email to tesseract-***@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/3bc70ebc-269e-4656-a4f7-c4e9e199862d%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Zdenko Podobný
9 years ago
Permalink
What OS are you using?

Zdenko
...
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+***@googlegroups.com.
To post to this group, send email to tesseract-***@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8yc-9gDv2eyaPH%3D%3Dwu-FM_vPWzOpQ7aqkmxfe83npQcsw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
Ashish Goel
9 years ago
Permalink
Ubuntu 12.04
...
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+***@googlegroups.com.
To post to this group, send email to tesseract-***@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/a9423f9b-403f-4772-afe0-5cb4476d4244%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Zdenko Podobný
9 years ago
Permalink
Is there a reason why you do not use leptonica shipped by Ubuntu?
It is difficult to find where is your problem from your description. I
think best approach is it to use sw packaged by your distribution in case
of any problem with custom compiled sw...

Zdenko
...
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+***@googlegroups.com.
To post to this group, send email to tesseract-***@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8xQzxCMcMx5pdDte8rbRufV2G3MaUk1w%3D8SmwWcY25xtQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
Ashish Goel
9 years ago
Permalink
Zdenko,

Thanks for your reply. I will try with standard distro and let know if it
works.

Ashish
...
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+***@googlegroups.com.
To post to this group, send email to tesseract-***@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/c9af1875-54d3-452f-afa1-0bfd36d7f9a2%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Loading...