Forum: Ruby on Rails OCR project in Gsoc

Announcement (2017-05-07): www.ruby-forum.com is now read-only since I unfortunately do not have the time to support and maintain the forum any more. Please see rubyonrails.org/community and ruby-lang.org/en/community for other Rails- und Ruby-related community platforms.
05d34036ba0ecc7faf6f1609aa3c8240?d=identicon&s=25 Arulalan (Guest)
on 2009-03-24 18:50
(Received via mailing list)
Hi to all,

   I planned to do project in gsoc... For OCR ( Optimal Character
Recoganisation  ) ...

That is ,,,

             If we scanning one full text page from book, it will open
into open office as word format. so that we can edit the page from
scanned text page... I planned to convert scanned letters to words for
Tamil, English Languages... I will try to support few more languages
also...This OCR project will can done by Using Rmagick , i will do
this successfully.

             This is my idea, if any one of you can suggest me and
guide me to do this...


Thank,

Arulalan.
53be54e5db4dc58e4980db5a8255621b?d=identicon&s=25 Harold (Guest)
on 2009-03-24 20:21
(Received via mailing list)
There are many ways to accomplish this, none of them are easy...

There's ai4r's backpropagation nueural nets implementation, with a
simple OCR example at http://ai4r.rubyforge.org/neuralNetworks.html

There's also gnu Ocrad, which I've never used:
http://www.gnu.org/software/ocrad/,
and just found http://gtamilocr.sourceforge.net/ which does OCR for
Tamil characters as well.

I'd be glad to hear other suggestions...
8c639c73cd3f657683b05bc2da9fc7ea?d=identicon&s=25 Juan José Vidal (Guest)
on 2009-03-24 22:53
(Received via mailing list)
Hi,

What about Google Tesseract???

http://code.google.com/p/tesseract-ocr/


Harold
escribió:> There are many ways to accomplish this, none of them are easy...
This topic is locked and can not be replied to.