Wednesday, September 30, 2009

OCR with Google Docs

At last, Optical Character Recognition (OCR) online with Google Docs.

  • Go to http://googlecodesamples.com/docs/php/ocr.php
  • Use the link to sign in to your Google account.
  • Click browse to find your .jpg, .gif, or .png file to be converted.
  • Click 'Start Import'
  • Your image will now open as text in a Google Doc

My testing showed a fairly accurate conversion with very few mistakes. Because I was doing this in Firefox a squiggly red line appeared under the mistakes and a right click quickly fixed those. (Not sure if it's Firefox or Google Docs that puts the squiggly red line under those misspelled words? Might be both...)

One feature that's not available yet which I'd love to see is the ability to import PDF's that hold scanned images of documents. For some reason I seem to have a ton of those. At the moment, the only way that I can find to import those is to click on each page in the PDF. Copy it to the clipboard. Paste it into Paint.Net and then save it as an image. I then import that image into the doc. This is only practical if your PDF has a few pages. If it has many then this is just not workable.

 

 

 

No comments:

Post a Comment