Posted by: Anonymous Coward
on January 04, 2006 03:07 AM
I see nothing has changed with respect to open source OCR in the past year. I had looked into it about a year ago.What I read at the time was so discouraging that I did not even test any open source OCR. I concluded that one of the windows based products was the only way to go. Thanks for the insight into why it is such a difficult task. It looks more grim than I had suspected. Your accuracy statistics are really discouraging. Some of the windows products claim 98-99% accuracy. Even that is problematic with uncommon formatting.
I am curious about those book scanning projects, where books are scanned to be made available on line. Do they just make images available as the final product? Or, do any of them try to do OCR?