Google converts scans, PDFs to Docs

 

Optical character recognition extended for PDFs, images.

Google has introduced a new tool to convert text from Adobe .pdf files or high-resolution images into Google Docs documents.

Developed by Google Australia software engineer Jaron Schaeffer, the tool extends an optical character recognition (OCR) experimental feature that was launched last September.

OCR is a means of converting scanned images of handwritten, faxed or otherwise printed documents into a electronic files that may be digitally searched, translated, edited and stored.

Google's OCR tool integrates with its translation capabilities, allowing scanned files to be translated to and from English, French, Italian, German and Spanish.

It may be accessed through the Google Docs upload page.

In a blog post this morning, Schaeffer described using the tool to convert a colleague's family chronicles to digital form, allowing them to be continued in Google Docs.

OCR technology is also used in Google Books, which allows users to search the full text of "millions of books" that Google has scanned and converted to online text.


Google converts scans, PDFs to Docs
 
 
 
Top Stories
Beyond ACORN: Cracking the infosec skills nut
[Blog post] Could the Government's cybercrime focus be a catalyst for change?
 
The iTnews Benchmark Awards
Meet the best of the best.
 
Telstra hands over copper, HFC in new $11bn NBN deal
Value of 2011 deal remains intact.
 
 
Sign up to receive iTnews email bulletins
   FOLLOW US...
Latest Comments
Polls
Who do you trust most to protect your private data?







   |   View results
Your bank
  39%
 
Your insurance company
  3%
 
A technology company (Google, Facebook et al)
  8%
 
Your telco, ISP or utility
  7%
 
A retailer (Coles, Woolworths et al)
  2%
 
A Federal Government agency (ATO, Centrelink etc)
  20%
 
An Australian law enforcement agency (AFP, ASIO et al)
  14%
 
A State Government agency (Health dept, etc)
  6%
TOTAL VOTES: 1785

Vote
Do you support the abolition of the Office of the Information Commissioner?