Recent Blog Posts
-
msbnc.com "Knows a Trend When it Sees One"
Nov 23 20094:11 pm EDT -
Windows 7 Spin May Be on the Money
Nov 23 20098:44 am EDT -
Mapping Company Raises Millions
Nov 20 20094:09 pm EDT -
Facebook Valuations Are All Over the Map
Nov 20 200911:30 am EDT -
The Future of Tech, 2010 Edition
Nov 20 20099:13 am EDT -
Automatic Pancake-Making Machine Attracts $2 Million in Capital
Nov 19 20094:53 pm EDT -
Apple Talk of Microsoft's Annual Meeting
Nov 19 20091:27 pm EDT -
There Is Still Hope for the News Business
Nov 19 200911:50 am EDT -
The Google Phone May Be Near
Nov 18 20094:10 pm EDT -
Amazon Grocery Service Goes Mobile with iPhone
Nov 18 20099:13 am EDT
Links
- Engadget

- Pandora

- GigaOM

- USA TODAY Tech

- Todd Bishop's Microsoft Blog

- Somewhat Frank's tech conference list

- BuzzTracker Tech

- The Long Tail

- Tom Foremski

- Roger McGuinn's Folk Den

- John Battelle's SearchBlog

- Mark Cuban's blog

- SciTech Daily

- Romenesko

- Kevin Maney's site

- Steven Johnson

- Marc Andreessen

- TechCrunch

- Fred Wilson

- paidContent

- Spiedies, mmmm

Google Is Now Scanning Documents
Google has begun to index documents posted online that contain images of text using Optical Character Recognition (OCR) technology, it announced yesterday on its blog.
Previously only docs converted to PDFs with text were indexed and included in results. Since scanned docs are only a picture of text, they are typically more difficult to interpret, and the pages can include wrinkle, smudges or stains.
This advancement opens up a whole new collection of information, including many government and academic documents once hidden from the public searches.
The news comes a few days after Google settled its book-scan suit, giving it the go-ahead to continue its book search project.
By Chris Snyder for Wired.comAlso on Wired.com:
DHS: Scour Blogs to Stop Bombs
Google Yahoo Deal Crumbling, Report
Now Official: No One In Tech Can Defend McCain
Subscribe to Wired magazine






