I follow the steps in this tutorial http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3 and gives me error on the part
mftraining-F-U font_properties unicharset nobatch.tr
The error I get is:
Failed to load font_properties
Reading nobatch.tr ...
nobatch has no defined properties.
!"Missing font_properties entry is a fatal error!":Error:Assert failed:in file mftraining.cpp, line 281
Segmentation fault
What you should do is to read superscripts and subscripts. Any help or advice from any OCR to do this is appreciated.