Robotic Speed Reading

from the useful...-if-it-weren't-for-copyrights dept

As librarians are realizing the benefits of digitizing all of their works, they’re also discovering what a huge process it is to manually scan books. Character recognition software has certainly improved to the point that it’s quite useful, but the issue of manually turning pages is a huge problem. Along comes a robot to the rescue. Apparently, two different companies have designed special book scanning machines that turn the pages for you as they scan – scanning up to 1,000 pages an hour. This, of course, would be much much much more useful if we didn’t have draconian copyright laws that meant putting most books in this machine is a criminal act. Update: Oddly, the original story has disappeared from the site just a day after it came out, so here’s a slightly different version.


Rate this comment as insightful
Rate this comment as funny
You have rated this comment as insightful
You have rated this comment as funny
Flag this comment as abusive/trolling/spam
You have flagged this comment
The first word has already been claimed
The last word has already been claimed
Insightful Lightbulb icon Funny Laughing icon Abusive/trolling/spam Flag icon Insightful badge Lightbulb icon Funny badge Laughing icon Comments icon

Comments on “Robotic Speed Reading”

Subscribe: RSS Leave a comment
2 Comments
Michael Ward (profile) says:

scanning machines / Rocky Mountain News

The page comes up blank for me, too, and with a couple of minutes poking around I still couldn’t find a link elsewhere on their site.
This -may- be the newspaper’s take on the story of a few days ago about the scanner installed at Stanford, currently converting the CSLI archives into digital form. Other interesting projects include Benetech.org in Palo Alto, scanning books for the blind and limited-vision, and the venture MIT and HP put together to digitize the MIT Press backlist, using auto scanners and digital character recognition with a high claimed accuracy.
Our experience at Hidden Knowledge, in digitizing public-domain books for publication as e-books, has been that commercial OCR software is still inadequate and not getting better very fast. OTOH, as storage becomes cheaper and cheaper it becomes more reasonable to store digital images of the pages, say monochrome at 600 dpi.
Mike

Add Your Comment

Your email address will not be published. Required fields are marked *

Have a Techdirt Account? Sign in now. Want one? Register here

Comment Options:

Make this the or (get credits or sign in to see balance) what's this?

What's this?

Techdirt community members with Techdirt Credits can spotlight a comment as either the "First Word" or "Last Word" on a particular comment thread. Credits can be purchased at the Techdirt Insider Shop »

Follow Techdirt

Techdirt Daily Newsletter

Techdirt Deals
Techdirt Insider Discord
The latest chatter on the Techdirt Insider Discord channel...
Loading...