Discussion:
[tesseract-ocr] Training OCR
MedCo
2015-03-27 17:06:29 UTC
Permalink
Hello,

I need to get text out of small bitmap files. I used tesseract for it and
it works ok. The problem is that the text we have is not generic, it's
comprised of different upper case, lower case characters and some special
symbols.
This will need lot of OCR training.

When I run it to recognize images, that will need to be on through a dll or
command prompt so my automated script can feed in the image and get text
out of it. Tesseract functionality works good through command prompt, but
training part seems very challenging.

Is there any OCR available which is easy to train, may be training with
some GUI interface?

thanks,
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+***@googlegroups.com.
To post to this group, send email to tesseract-***@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/9307a03a-06de-4289-8a85-42d737bf3da7%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Quan Nguyen
2015-03-28 23:46:54 UTC
Permalink
There are several training tools available:

https://code.google.com/p/tesseract-ocr/wiki/AddOns
Post by MedCo
Hello,
I need to get text out of small bitmap files. I used tesseract for it and
it works ok. The problem is that the text we have is not generic, it's
comprised of different upper case, lower case characters and some special
symbols.
This will need lot of OCR training.
When I run it to recognize images, that will need to be on through a dll
or command prompt so my automated script can feed in the image and get text
out of it. Tesseract functionality works good through command prompt, but
training part seems very challenging.
Is there any OCR available which is easy to train, may be training with
some GUI interface?
thanks,
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+***@googlegroups.com.
To post to this group, send email to tesseract-***@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/fe357530-26bf-4d22-a3d8-4ba4cbc1ec52%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Loading...