How Google's High Speed Book Scanner De-Warps Pages

Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

How Google's High Speed Book Scanner De-Warps Pages 209

Posted by ScuttleMonkey on Friday May 15, 2009 @04:17PM from the onto-dewarping-brains-next dept.

Hugh Pickens writes "Patent 7,508,978, awarded to Google, shows how the company has already managed to scan more than 7 million books. Google's system uses two cameras and infrared light to automatically correct for the curvature of pages in a book. By constructing a 3D model of each page and then 'de-warping' it afterward, Google can present flat-looking pages online without having to slice books up or mash them onto a flatbed scanner. Stephen Shankland writes that the 'sophistication of the technology illustrates that would-be competitors who want to feature their own digitized libraries won't have a trivial time catching up to Google.' First, a book is placed on a flat surface, while above it, an infrared projector displays a special mazelike pattern onto the pages. Next, two infrared cameras photograph the infrared pattern from different perspectives. 'The images can be stereoscopically combined, using known stereoscopic techniques, to obtain a three-dimensional mapping of the pattern,' according to the patent. 'The pattern falls on the surface of (the) book, causing the three-dimensional mapping of the pattern to correspond to the three-dimensional surface of the page of the book.'"

This discussion has been archived. No new comments can be posted.

How Google's High Speed Book Scanner De-Warps Pages

Load All Comments

Search 209 Comments Log In/Create an Account

Comments Filter:

More importantly (Score:2, Funny)

by Anonymous Coward writes:

Does it run on Linux? Does it work for scanning porn?
- Re: (Score:2)
  
  by moderatorrater ( 1095745 ) writes:
  
  Yes and yes, although only scanning in porn magazines instead of actually using it in the porn itself would be a very unimaginative way to use this technology...
  - Re: (Score:2)
    
    by K. S. Kyosuke ( 729550 ) writes:
    
    Only a real weirdo would want to see those curves flattened, don't you think?
IMPORTANT QUESTIONS (Score:2, Funny)

by space_jake ( 687452 ) writes:

I wonder how ass curvature comes out with that scanner.
- Re:IMPORTANT QUESTIONS (Score:4, Funny)
  
  by Anonymous Coward writes: on Friday May 15, 2009 @04:28PM (#27972145)
  
  The same way as your face.
  
  Parent Share
  twitter facebook
- Re: (Score:2)
  
  by zehaeva ( 1136559 ) writes:
  
  like it says, flat
note to self: (Score:2, Funny)

by circletimessquare ( 444983 ) writes:

do NOT sit on the copier machine with pants down at google hq
Patent!!??!! (Score:5, Funny)

by aashenfe ( 558026 ) writes: on Friday May 15, 2009 @04:27PM (#27972131) Journal

When is the patent office going to quit giving patents for obvious techniques? :)

Share
twitter facebook
- Re:Patent!!??!! (Score:4, Funny)
  
  by sopssa ( 1498795 ) writes: <sopssa@email.com> on Friday May 15, 2009 @04:35PM (#27972253) Journal
  
  So why didnt you do or patent it before?
  
  Parent Share
  twitter facebook
  - Re: (Score:2, Informative)
    
    by aashenfe ( 558026 ) writes:
    
    Simple, I was trying to be funny. Notice the smiley :)
  - - Re:Patent!!??!! (Score:5, Informative)
      
      by Dewin ( 989206 ) writes: on Friday May 15, 2009 @04:57PM (#27972511)
      
      I believe the pattern barcode scanners use is simply trying to look for the barcode in several different directions, but I could be wrong.
      I also believe there's either rudimentary correction for common types of distortion (i.e. on cylindrical objects) or just wide enough tolerances to allow it to work anyways.
      
      Parent Share
      twitter facebook
      - Re:Patent!!??!! (Score:4, Informative)
        
        by profplump ( 309017 ) writes: <zach-slashjunk@kotlarek.com> on Friday May 15, 2009 @05:35PM (#27972963)
        
        It's just wide tolerances. The whole UPC-scanning system was designed so that the output from the light return sensor could be read directly (ignoring some minor gain control/etc.) as a digital data stream, with the clock rate determined by the horizontal scan rate. There's no reason to do distortion correction because it's not reading an image in the first place, it's just reading a series of high/low signal returns as serial data. I'm sure you could build a more complicated system to does 2-D or 3-D imaging and distortion correction, but it's way more work than is necessary to read a linear UPC.
        
        Parent Share
        twitter facebook
    - - Re: (Score:2)
        
        by WCguru42 ( 1268530 ) writes:
        
        they patented this new 3D barcode system.
        Really, I've never seen a 3D barcode on anything. maybe 2D, but definitely no 3D barcodes on my packages.
      - Re: (Score:2)
        
        by SBrach ( 1073190 ) writes:
        
        If you are thinking of 2d data matrices, many people use this technology. Linky. [wikipedia.org]
- Re: (Score:2, Interesting)
  
  by Anonymous Coward writes:
  
  I hate patents as much as anyone else, but:
  1) This isn't so obvious, and requires some fairly complex math
  2) It is pretty complex (in the way it functions), enough that i would actually consider this patent-worthy.
  But, there is some "prior art" of such functions in the visible range for scanning bodies IIRC.
  I believe this was meant to be funny, and i shall accept incoming whooshes of air with joy.
  Have at you.
  note: i still hate patents though.
  I can't see why they would benefit from patenting this method...
  I
  - Re: (Score:2)
    
    by javaxjb ( 931766 ) writes:
    
    I hate patents as much as anyone else, but: 1) This isn't so obvious, and requires some fairly complex math 2) It is pretty complex (in the way it functions), enough that i would actually consider this patent-worthy.
    I would add that at least this patent is not solely a software patent; it has a hardware component.
  - - Re: (Score:2)
      
      by WCguru42 ( 1268530 ) writes:
      
      Then how come nobody else has used it to scan books. All the photocopies of books I've gotten throughout the year seems to belie your statement of this being obvious.
- Re:Patent!!??!! (Score:5, Informative)
  
  by Timmmm ( 636430 ) writes: on Friday May 15, 2009 @05:51PM (#27973099)
  
  You jest, but this technique *has* been around for years. I remember when digital cameras first became available there was a product that could perform a 3D scan by projecting a pattern onto the object and using an offset picture. I think the pattern came on a slide - that's how long ago it was! Here's a whole wikipedia page about the scanning technique: http://en.wikipedia.org/wiki/Structured_Light_3D_Scanner [wikipedia.org]
  This picture is especially good: http://en.wikipedia.org/wiki/File:6-seat.jpg [wikipedia.org]
  Anyway after reading the patent abstract, it isn't about the 3D scanning at all, it appears to be about an algorithm to find the fold once you've already got the point cloud. I would have thought that was fairly trivial. A possible approach would be to take the radon transform of the height map and find the smallest value that's roughly in the middle.
  
  Parent Share
  twitter facebook
  - Re: (Score:3, Funny)
    
    by retchdog ( 1319261 ) writes:
    
    Whoa, "radon transform"? Hold on a second, wiz-kid. Does that use poisonous gas or something? It's certainly not mathematics, because that means stuff like "three times four".
    - Re:Patent!!??!! (Score:4, Informative)
      
      by petermgreen ( 876956 ) writes: <plugwash.p10link@net> on Friday May 15, 2009 @07:25PM (#27974043) Homepage
      
      It certainly is mathematics and it's not that hard to understand either. basically it is the mathematical equivilent of what a hard field tomograph does.
      Consider a function of two values and consider those values to be 2D coordinates. Consider also that the function is zero outside of a defined area.
      Now consider that there are an infiniate number infinitely long number of straight lines passing through that area and each can be defined by two parameters, an angle and an offset from the orgin in the direction perpendicular to the line.
      Along each of those lines an integral can be calculated. those integrals form the radon transform of the function (with each integral being identified by the two parameters).
      Not really that complicated, the trickiest bit is probablly deciding how best to approximate the line integrals from your limited number of data points.
      
      Parent Share
      twitter facebook
      - Re: (Score:3, Insightful)
        
        by retchdog ( 1319261 ) writes:
        
        I almost feel bad. I know what a radon transform is and I've taken a class on inverse problems.
        My point was just that the common view of what is mathematics is rather anemic and quick to give engineering credit to relatively simple ideas. I suspect that the patent office has similar fallacious thinking.
- Re: (Score:2)
  
  by AHuxley ( 892839 ) writes:
  
  The Russians (iirhad a cute trick too. A tiny spy cam with two lights pointing down on the page. When the two dots where joined the camera was it the right distance and the spy got a quality image of a page.
So... (Score:5, Interesting)

by fuzzyfuzzyfungus ( 1223518 ) writes: on Friday May 15, 2009 @04:28PM (#27972143) Journal

How long before some particularly vengeful luddite publisher starts printing on treated paper stock that has an IR visible pattern, calculated to confuse these scanners, printed on it?

They've been making "anti-copy paper" designed to defeat optical scanning for years now, surely something similar in the IR band could be effected...

Share
twitter facebook
- Re: (Score:3, Insightful)
  
  by Anonymous Coward writes:
  
  Maybe those books are less important to commit to a digital scan ;-)
- Re:So... (Score:5, Insightful)
  
  by twistedsymphony ( 956982 ) writes: on Friday May 15, 2009 @04:43PM (#27972335) Homepage
  
  they could probably do it in the visible spectrum as well, it would just take twice as long because they can't map and scan at the same time.
  
  Failing that there are alternative methods that might work as well.
  
  Parent Share
  twitter facebook
- Re: (Score:2)
  
  by Chyeld ( 713439 ) writes:
  
  Why? Just as you said, they already have anti-copy paper. If you don't want someone to be able to copy your book, simply print using that (of course, that will cause your costs to skyrocket). It's not as if the IR block would prevent the copy, it'd just mean the copy looks like crap (thus potentially impacting your image as a publisher).
- Re: (Score:2)
  
  by bmwm3nut ( 556681 ) writes:
  
  Then you just do phased-lock detection. In the IR with current cheap detectors you can modulate in the kHz without any problem. I wouldn't be surprised if they do that now. In my lab we look for changes in an IR signal that are about 10^8 times smaller than the background IR radiation. It's not a hard problem to solve.
- Re: (Score:2)
  
  by BitZtream ( 692029 ) writes:
  
  Two things, first off, they just use something else to accomplish the same thing. If you can read it, something else can as well. It may not be as fast, it may take some time and money to develop and optimize but that amount of time and money is probably pretty trivial to Google.
  Second, Google doesn't care about any book that can do that at this time, they are going after old works currently, that aren't being produced by anyone anyway, so nothing they are going after right now is going to be affected by
- Shhh! Don't Give Them Ideas! (Score:2)
  
  by Arccot ( 1115809 ) writes:
  
  If the publishers see this article, the next book I want to read is going to be written in capchas!
  The really hard ones without an audio guide!
- Re: (Score:2)
  
  by nurb432 ( 527695 ) writes:
  
  Would it matter with the 100s of millions of books that are already there they have go to thru first?
  Wish i had that at home, would love to scan a lot of my stuff but refuse to cut it.
- Re: (Score:2)
  
  by russotto ( 537200 ) writes:
  
  How long before some particularly vengeful luddite publisher starts printing on treated paper stock that has an IR visible pattern, calculated to confuse these scanners, printed on it?
  Before one does it? Not long. Before any significant amount of product is produced using it? Probably forever, on cost and particularly cost/benefit issues. Besides, if the protected product produced was particularly interesting to those wanting to scan it, they could almost certainly modify the scan system to accomodate
- - Re:So... (Score:4, Interesting)
    
    by fuzzyfuzzyfungus ( 1223518 ) writes: on Friday May 15, 2009 @05:11PM (#27972683) Journal
    
    I have to hope that any publisher hip enough to read Slashdot for tech advice(rather than relying on glossy advertisements from "security" vendors in the latest issue of Monetizing The Everloving Fuck Out of Your Precious, Precious IP magazine) wouldn't do anything that stupid. I wouldn't bet on it, though.
    
    With respect to the foolishness over "copy protection" it is interesting to consider the possible application of the old line "the worse, the better." [wikipedia.org] The idea is that, in order for a bad situation to change, it must get worse, so that the cost of tolerating it becomes unbearably high. As long as DRM and anti-copy paper, and macrovision and all the others cause relatively limited customer displeasure and support calls, there will be little incentive to change, and things will remain as they are. If you can drive the content guys to ever more intrusive measures, things might actually get bad enough to spur a blowback.
    
    Parent Share
    twitter facebook
Patent? Prior Art? (Score:3, Insightful)

by mveloso ( 325617 ) writes: on Friday May 15, 2009 @04:29PM (#27972163)

Wasn't this a Sci-Fi movie staple back in the 80s? They used it for body and object scanning, not books...but still.

Share
twitter facebook
- Re: (Score:2)
  
  by MobileTatsu-NJG ( 946591 ) writes:
  
  Why did they run an OCR on a body scan? :D
  - Re: (Score:3, Funny)
    
    by SomeJoel ( 1061138 ) writes:
    
    To read the tattoos.
The New Bell Labs? (Score:5, Interesting)

by ObsessiveMathsFreak ( 773371 ) writes: <obsessivemathsfreak.eircom@net> on Friday May 15, 2009 @04:30PM (#27972177) Homepage Journal

I've read many comments over the years about the old Bell Labs and how a huge amount of pioneering research came out of them over the course of their existance, i.e. before they got axed.
It would seem that Google Labs is performing somewhat the same function, albeit more oriented towards software rather than physical research.

Share
twitter facebook
- - Re: (Score:3, Interesting)
    
    by Anonymous Coward writes:
    
    Bell Labs did basic research that most of the time didn't have any current commercial applications and maybe never will.
    Google's all have current commercial applications. I don't know of anything they do that is for pure research and to add to humanities knowledge.
    Doesn't Google have something called the 20% policy or something like that? Where Google engineers devote 20% of their time to non-Google projects?
    Not exactly basic research, but not necessarily commercial applications.
    The closure of Bell Labs is
    - Re: (Score:3, Informative)
      
      by mattack2 ( 1165421 ) writes:
      
      I can't find proof in a quick search, but I do remember others posting responses here recently (possibly Anonymous Cowards) to people mentioning the 20% time with things like (paraphrase) "that will be useful for Google". In other words, the implication (or at least my inference) was that while they are technically "non-Google", the intent was that eventually they would be Google projects or the projects would be killed off.
      I have no first hand knowledge of that, however.
      The small paragraph http://en.wikip [wikipedia.org]
Mostest importanly... (Score:4, Interesting)

by Anonymous Coward writes: on Friday May 15, 2009 @04:32PM (#27972207)

...who's flipping the pages?

Share
twitter facebook
- Re: (Score:3, Funny)
  
  by Anonymous Coward writes:
  
  I heard from some guy, somewhere, that on weekends the Oompa Loompas do it.
- Re: (Score:2, Informative)
  
  by Bob Wehadababyitsabo ( 629809 ) writes:
  
  There are automatic page turning machines that use puffs of air and a stylus to move through a book.
Obvious question... (Score:2)

by jwriney ( 16598 ) writes:

That's cool and all that, but who (or what) flips the pages?
--riney
- Re:Obvious question... (Score:4, Funny)
  
  by Captain Spam ( 66120 ) writes: on Friday May 15, 2009 @04:42PM (#27972323) Homepage
  
  That's cool and all that, but who (or what) flips the pages?
  Interns.
  
  Parent Share
  twitter facebook
  - You laugh, but look at this (Score:5, Interesting)
    
    by langelgjm ( 860756 ) writes: on Friday May 15, 2009 @04:52PM (#27972453) Journal
    
    That's modded funny, but take a look at this. [google.com]
    Maybe they use automated page turning machines for normal books, and turn pages by hand for older/more fragile works?
    
    Parent Share
    twitter facebook
    - Re:You laugh, but look at this (Score:5, Funny)
      
      by StikyPad ( 445176 ) writes: on Friday May 15, 2009 @05:12PM (#27972691) Homepage
      
      Now THAT'S a page turner.
      Ba dum dum. Thanks, I'll be here all week! Try the veal, and don't forget to tip your waitress!
      
      Parent Share
      twitter facebook
    - Re: (Score:2)
      
      by Anenome ( 1250374 ) writes:
      
      Dear god, what sort of hideous Lovecraftian monstrosity is pictured turning pages there??? O_O I can't tell if those are fingers or flippers. Burn it with fire!
    - - Re: (Score:2)
        
        by lithis ( 5679 ) writes:
        
        I think the strange appearance of the hands is due to the hand moving while being scanned. I remember, in high school, moving my hand inside a scanner while it was being scanned, causing all sorts of fun distortions: wavy fingers, extremely long fingers, etc.
- Re: (Score:3, Informative)
  
  by ebingo ( 533762 ) writes:
  
  There are scanners that flip pages themselves like this one: http://www.youtube.com/watch?v=UyB5c3S4vzc&feature=related [youtube.com] but I've seen somewhere (can't remember where though) a video of a scanner that was faster and didn't use vacuum to flip pages. It was quite a lot less noisy.
  - Re: (Score:3, Interesting)
    
    by trb ( 8509 ) writes:
    
    This is another one.
    http://www.treventus.com/index_en.html [treventus.com]
    http://www.youtube.com/watch?v=hlOQuuLYavY [youtube.com]
Unnecessary? (Score:2)

by sexconker ( 1179573 ) writes:

Can't you just calculate the 3D model of the page based on a known stuff?
Make a generic flattener filter that takes in page height and length, as well as page number.
Manually tweak the output a bit for the first and last pages, and then intermediary pages can all be calculated with much more accuracy than you need.
Hell, with this method any book "scanned" (using a camera from overhead) could be processed. Let those college kids who love Google so much run their books through your filters (and do the manual
- Re:Unnecessary? (Score:4, Interesting)
  
  by MaWeiTao ( 908546 ) writes: on Friday May 15, 2009 @04:44PM (#27972349)
  
  Pages lie different from the front to the back of the book, and books are bound differently. So you can't use a generic model and expect it to be accurate in most cases.
  I actually think this is really cool because it seems to account for any scenario, including folded pages, I would assume. Although, I suppose that in extreme bends it might not be perfect, but certainly they just need to ensure that pages are adequately flat. It automates the entire process.
  I wonder if they've built an automated page-turning mechanism; I would assume they have. Just drop in a book and let the machine go to town on it.
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by againjj ( 1132651 ) writes:
    
    Another poster [slashdot.org] shows that at least one book has been imaged with hand page turning [google.com]. Two pages of fingers.
  - Re: (Score:2)
    
    by sexconker ( 1179573 ) writes:
    
    I know they do - that's why you take a pic of the first and last pages first, and adjust those.
    The 3D model for all pages is the same - a sheet of paper of a certain length and width.
    The lay of the paper will be between the two extremes of the first and last pages.
    Effectively, you can define the lay of the paper as a simple curve in the x/y plane.
    The last page will be a flat line, the first page will be the most eccentrically curved.
    Page 3(4) lays almost identical to page 1(2).
    The curve is just a little fla
Isn't that all known? (Score:2, Insightful)

by Toonol ( 1057698 ) writes:

The technique is old, many years old. What is google's patent for? The use of a decades-old technique ON BOOKS?
- Re: (Score:2)
  
  by geekoid ( 135745 ) writes:
  
  Cite needed.
  You really don't understand what a patent is, do you?
  Hint: you don't patent ideas.
  - Re:Isn't that all known? (Score:4, Interesting)
    
    by Toonol ( 1057698 ) writes: on Friday May 15, 2009 @06:49PM (#27973693)
    
    "Looker."
    
    Building 3d computer models by stereoscopic analysis of project light patterns is at least twenty years old. In fact it mentions in the summary that it they use an established technique.
    
    As for your second comment... that's kind of my point. Since the technique is not new, the equipment is not new, what did google do that was new? Perhaps there is some actual invention in the process somewhere; but I don't have enough faith in the patent process to unquestioningly ASSUME that there is.
    
    Parent Share
    twitter facebook
What are the chances... (Score:4, Interesting)

by Shaterri ( 253660 ) writes: on Friday May 15, 2009 @04:41PM (#27972305)

...that Google licenses this to scanner manufacturers and we see this at a consumer level at some point in the future? I know I'd pay good money for a book scanner that doesn't need to have a 'book edge' (which you already have to pay through the nose for)...

Share
twitter facebook
- Re: (Score:2)
  
  by againjj ( 1132651 ) writes:
  
  This is not about the imager per se. It is about the way to take images and post process them afterwords. Basically, they take three pictures, one in visible light and two in infrared, and then use the two in infrared to create a stereoscopic image and correct the image in visible light so it is not warped. From the patent, it does look like the imager is a camera, and not a scanner, since the description talks about a book resting on a platform with cameras above it. I do notice the patent makes no men
- - Re: (Score:2)
    
    by Wesley Felter ( 138342 ) writes:
    
    This type of image processing requires obscene amounts of memory and CPU time to do.
    That's OK; have you heard of Winmodems? Just imagine a cheap scanner that does all the correction in software on your PC.
Butt what about... (Score:2, Funny)

by radiumhahn ( 631215 ) writes:

Imagine what this technology could do for coworkers who like to photocopy their butts!
- Re:Butt what about... (Score:4, Funny)
  
  by DRACO- ( 175113 ) writes: on Friday May 15, 2009 @04:57PM (#27972509) Homepage Journal
  
  Is this what the graphics department is talking about bump mapping?
  Karma burn.
  
  Parent Share
  twitter facebook
  - Re: (Score:3, Funny)
    
    by K. S. Kyosuke ( 729550 ) writes:
    
    That would be "bum mapping", obviously.
Why is this a big deal? (Score:5, Insightful)

by MBoffin ( 259181 ) writes: on Friday May 15, 2009 @04:48PM (#27972403) Homepage

I don't see why this is such a showstopper for other book scanning projects. Right off the top of my head I can think of three methods of dewarping book scans that have nothing do to with Google's methods. While Google's method is definitely quite interesting and seems like a great solution, it is by no means whatsoever the only way of accomplishing this.

Share
twitter facebook
- Re: (Score:3, Insightful)
  
  by BitZtream ( 692029 ) writes:
  
  No one said its a big deal, its simply a 'neat' way to accomplish the goal. As geeks we are generally interested in these neat ideas.
  No one said Google was evil for patenting it.
  No one said Google now has a monopoly on book scanning.
  No one really said anything other than 'this is how they do it' and we all said 'neat'.
cool, but not patent-worthy (Score:4, Insightful)

by Chirs ( 87576 ) writes: on Friday May 15, 2009 @04:56PM (#27972495)

This is useful and interesting, but doesn't seem particularly novel.
Projecting a known pattern onto a surface or using multiple cameras to determine the shape of a surface have been around for quite a while, so adding it to an OCR system doesn't seem like a big deal.

Share
twitter facebook
- Re: (Score:2)
  
  by geekoid ( 135745 ) writes:
  
  Yes it is you clueless N00b.
  It's the mechanism and how they do it thats patented, not the idea.
  If you patent something that turns widgetrs over, I can still patent something else that turns widgets over, as long as is does it DIFFERENTLY.
  Seriously people, it's pretty simple.
  Yes the Patent office needs to be tuned, but there is nothing wrong with the patent. In fact, what you seem to suggested would make the system completly unusable.
  Idiot.
But can they remove finger-scans and hand-scans? (Score:3, Interesting)

by waterbear ( 190559 ) writes: on Friday May 15, 2009 @05:49PM (#27973077)

De-warping sounds useful, but there are problems that it probably won't solve --
Like the operator who scans a book page with his/her fingers or hand stuck between the page and the scanner-glass. For example, the dreaded 'New York Hand' or its fingers can be seen occupying the place of part of the text or figures on many pages of books scanned for Google-Books from the New York Public Library. On some pages, the impression of the fingers is clear enough to show the rings worn by the Hand that was doing the scanning. :(
It will take more than a de-warping patent to solve that one .....
-wb-

Share
twitter facebook
- Re: (Score:2)
  
  by petermgreen ( 876956 ) writes:
  
  On that note has anyone tried the option on google books to report unreadable pages and if so do they do anything about it?
Seems like overkill (Score:2)

by SpinyNorman ( 33776 ) writes:

A typical book page has text on in in parallel lines which can be used to correct for curvature, straight-edge formatted into rectangles which can be used to correct for skew. Who needs another grid?
If a page doesn't have suitable text on it (e.g. a graphic), then just assume it's warped the same as the previous page (the one it's lying on top of).
Wood chipper? (Score:3, Funny)

by mkcmkc ( 197982 ) writes: on Friday May 15, 2009 @06:30PM (#27973511)

This is way better than my idea, which was to throw the book into a wood chipper, scan the results, and then algorithmically reassemble them...

Share
twitter facebook
OCR (Score:3, Interesting)

by 12357bd ( 686909 ) writes: on Saturday May 16, 2009 @04:10AM (#27977119)

Google should return to the open source community a decent OCR app+engine. Tesserac+ocropus are just too little, and it's already too late.
Windows already has decent ocr habilities, any hp scanner comes with decent image to page-document sofware. It's a shame that google, that has been build upon open source and has maybe the best ocr technology in the world, hasn't returned a competitive and free ocr solution for Linux.

Share
twitter facebook
- Playing Catch-up (Score:2)
  
  by krog ( 25663 ) writes:
  
  ... The sophistication of the technology illustrates that would-be competitors who want to feature their own digitized libraries won't have a trivial time catching up to Google.
  Especially with that shiny new patent.
  - Re:Playing Catch-up (Score:5, Insightful)
    
    by jsnipy ( 913480 ) writes: on Friday May 15, 2009 @04:33PM (#27972227) Journal
    
    but to be honest this is at least worthy patent
    
    Parent Share
    twitter facebook
    - Re:Playing Catch-up (Score:5, Informative)
      
      by fuzzyfuzzyfungus ( 1223518 ) writes: on Friday May 15, 2009 @04:47PM (#27972387) Journal
      
      Obviously it was worthy enough to be issued; but I don't know how worthy it is in the broader sense.
      
      Notably, for instance, there has been a fair bit of interest, for some years, in using digital cameras in concert with projectors, either for automatic keystone/distortion correction, for projectors that aren't perfectly aligned with the projection surface, or for automatic coordination of multiple projectors illuminating the same surface, without laborious manual tiling adjustment. This is, in essence, an equivalent problem(inferring a surface's geometry based on pictures of a known image projected upon it).
      
      The IEEE has held "Projector-Camera systems" workshops since 2003 [procams.org], and somebody was obviously working on it before that. I'm not saying that Google's patent falls into asshole troll territory or anything; but the notion of doing surface geometry inference based on known image projection isn't nearly as novel as it might seem.
      
      Parent Share
      twitter facebook
      - Re: (Score:3, Insightful)
        
        by poetmatt ( 793785 ) writes:
        
        This may be a projector thing, but they are doing something of physical manipulation. It would be pretty much appropriate to be patented. The whole thing is physically transformative. Meanwhile, if someone made their own version using something different, it too, would be patentable/improvement patent, which is how the patent system is supposed to work.
        To be clear, I'm saying the system as a whole should be patentable (infrared), but not the software used to decode it.
      - That reminds me (off topic) (Score:3)
        
        by DCstewieG ( 824956 ) writes:
        
        Totally off topic here but I'll risk it.
        It really bothers me that neither Rock Band nor Guitar Hero can auto-calibrate the audio lag using the microphone. There's absolutely no reason I can see that they can't "listen" for the calibration beeps with the mic to get a perfect reading.
        
        Re: (Score:2, Informative)
        
        by Anonymous Coward writes:
        
        Uhhh doesn't Rock Band 2 do that with a miniature microphone (and light sensor) built into the revised guitar?
        
        Re: (Score:2)
        
        by DCstewieG ( 824956 ) writes:
        
        Woah! I own the damn thing and I had no idea. In my defense I play drums 99% of the time. :) Thanks!
      - Re: (Score:2)
        
        by PopeRatzo ( 965947 ) * writes:
        
        Obviously it was worthy enough to be issued; but I don't know how worthy it is in the broader sense.
        Just when I thought you were going to make an interesting point on the worthiness of patents "in the broader sense" as you put it, it turns out you were just rooting for someone else.
        Your comment didn't turn out to be all that "worthy".
        In the broader sense, that is.
      - Re:Playing Catch-up (Score:5, Funny)
        
        by BikeHelmet ( 1437881 ) writes: on Friday May 15, 2009 @06:06PM (#27973253) Journal
        
        This is actually what I envisioned for a book scanner, years ago.
        But unlike Google, I...
        1) Never built it.
        2) Am not facing lawsuits from overzealous sue-happy publishers.
        Seems like a good defensive patent to have.
        
        Parent Share
        twitter facebook
      - Re:Playing Catch-up (Score:5, Interesting)
        
        by Anonymous Coward writes: on Friday May 15, 2009 @06:36PM (#27973557)
        
        This trick has been used for 20 years in astronomy. You shine a really powerful laser of known metrics into the sky and measure the atmospheric distortion suffered by the beam.
        Then you take those numbers and calculate what it would take to even out the beam, and you feed THAT set of numbers to a telescope with adaptive optics which will then correct for the atmospheric distortion. Bingo, suddenly your telescope is able to take sharp images without having the air screw it up.
        The technique is very effective and results in ground-based telescopes that rival anything the Hubble can do. Plus they are easier to fix.
        I want to say this is called Guidestar but I am not sure.
        Anyway the similarity to Google's process is simply that you shine a light or image of known value on something unknown and look at how the image now deviates from what you expect. A little math and suddenly you know exactly the shape of the unknown object. Brilliant.
        
        Parent Share
        twitter facebook
        
        Re:Playing Catch-up (Score:5, Informative)
        
        by tomz16 ( 992375 ) writes: on Friday May 15, 2009 @11:25PM (#27975913)
        
        It's simply called adaptive optics (AO). In AO, a guidestar is a natural isolated point-like star that is close to your science object (what you are trying to look at). If a laser is used to excite the sodium layer to create an artificial reference, it's called a "laser guidestar".
        Anyway, this "trick" is completely different from adaptive optics in both the mathematics and implementation.
        
        Parent Share
        twitter facebook
    - Re:Playing Catch-up (Score:5, Interesting)
      
      by ushering05401 ( 1086795 ) writes: on Friday May 15, 2009 @04:55PM (#27972491) Journal
      
      Word.
      I was involved in evaluating rare books back around the turn of the century.
      I can personally attest that representatives of online book search companies were attempting to buy up one of a kind pieces for destructive scanning.
      There was one dealer in possession of a somewhat flawed, but well examined Shakespeare folio that had to put the kabosh on a reputation making deal because he found out the buyer was going to slice the piece out of its binding for scanning.
      I turned down a much smaller offer on a much less significant, but still very cool, two hundred year old angler's guide (with hand colored plates and original binding) for the same reason.
      Quality scans without destruction can only help raise the profile of rare books and the value they offer society - not simply for their content, but as tangible examples of the evolution of the art of communication.
      
      Parent Share
      twitter facebook
      - Re: (Score:3, Funny)
        
        by jwhitener ( 198343 ) writes:
        
        If you were a rare book expert during the turn of the century, why isn't your slashdot ID smaller?
        ;
    - Re: (Score:3, Informative)
      
      by Pinky's Brain ( 1158667 ) writes:
      
      Really? Structured light to find 3D geometry is old hat ... the optical and signal processing part of book scanning seem pretty easy, making the mechanical part for page flipping robust seems a lot harder to me.
      - Well said. (Score:2)
        
        by Jartan ( 219704 ) writes:
        
        I'd also much rather hear how they managed the page flipping. Even with a lot of these machines they'd still have to achieve an impressive flipping rate without damaging the page being scanned.
  - Re: (Score:2)
    
    by petermgreen ( 876956 ) writes:
    
    Especially with that shiny new patent.
    Couldn't they just build and operate the scanner somewhere outside the patents coverage area?
  - Re: (Score:2)
    
    by Jartan ( 219704 ) writes:
    
    I kind of doubt the patent will stop any competitors. It should be trivial to achieve the same result with dozens of different methods.
    I'm kind of surprised they used that method in fact. There should of been several that allowed them to scan the books without even requiring them to fully flip open and lay flat each page. With so many books to scan speed must of been important.
    I guess this method worked because the device was so cheap that they could just make a lot of scanners.
- Re:Why? (Score:4, Insightful)
  
  by vertinox ( 846076 ) writes: on Friday May 15, 2009 @04:45PM (#27972359)
  
  Ok, is it just me, but wouldn't it be easier to just cut the spine off the book instead of developing a whole new way of scanning it?
  With 7 million books, the manpower and time saved for them to cut the spine off would be worth it.
  Also, they can resell the books if needed or give them charity after they are done.
  Kind of would be a waste of a paper to tear that many books apart.
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by StikyPad ( 445176 ) writes:
    
    Kind of would be a waste of a paper to tear that many books apart.
    Yep, and it would take a lot of spine.
    Oh man, I'm like a card catalog of puns today!
  - Re: (Score:2)
    
    by againjj ( 1132651 ) writes:
    
    I am willing to bet that they do that with cheap books (ones they buy), but not with expensive ones (ones they borrow). One certainly can't remove the spines of books in libraries or other collections.
- Re: (Score:2)
  
  by chill ( 34294 ) writes:
  
  Keep in mind, the majority of the books they are scanning are old, out-of-print and copyright expired texts. They aren't something you can pop over to Amazon and order another one of. So the bulk ARE old and/or valuable.
- Re: (Score:2)
  
  by moderatorrater ( 1095745 ) writes:
  
  Why use two separate processes for the two categories of book instead of using the same process for both, especially if this process cuts down on manpower and book damage.
- Re: (Score:2)
  
  by F34nor ( 321515 ) writes:
  
  Read "Rainbow's End" by Vernor Vinge
- Re: (Score:3, Insightful)
  
  by AndrewNeo ( 979708 ) writes:
  
  I really don't think the libraries that Google was scanning at would have appreciated that too much..
- Re:Why? (Score:5, Informative)
  
  by ChaosDiscord ( 4913 ) * writes: on Friday May 15, 2009 @05:11PM (#27972681) Homepage Journal
  
  Google is mostly scanning books borrowed from university libraries. Librarians get cranky if you borrow a book and return a stack of loose sheets of paper.
  
  Parent Share
  twitter facebook
- Re: (Score:3, Interesting)
  
  by Chyeld ( 713439 ) writes:
  
  Only if Google refused to license it. Google isn't Microsoft or Intel; I doubt they'd go that route.
  In fact, since Google has paid for the innovation of this tech, including the R&D for it, patenting it and then allowing companies to license it reduces the barrier since companies that couldn't have paid for the research now have the technique available to them.
- Re:As a writer, I did not give my permission to co (Score:4, Interesting)
  
  by geekoid ( 135745 ) writes: <dadinportlandNO@SPAMyahoo.com> on Friday May 15, 2009 @05:36PM (#27972967) Homepage Journal
  
  Cough, you don't ahve to. I can copy your book all gad damn day long and have not violated your rights or the copyright code.
  The moment I try to distribute them, then it's a copyright violation.
  It's called copyright, because the only reason one would copy it was to distribute it.
  Backup really wasn't an issue then like it is now.
  
  Parent Share
  twitter facebook
  - Re:As a writer, I did not give my permission to co (Score:4, Informative)
    
    by The Empiricist ( 854346 ) writes: on Friday May 15, 2009 @06:58PM (#27973781)
    
    Cough, you don't ahve to. I can copy your book all gad damn day long and have not violated your rights or the copyright code.
    The moment I try to distribute them, then it's a copyright violation.
    Be sure to check out the exclusive rights in copyrighted works [cornell.edu] before making blanket assertions on what is and is not legal under copyright law. The exclusive rights granted by copyright include both reproduction and distribution. There are lots of exceptions to these exclusive rights, but an interpretation that completely eviscerates the exclusive right to reproduce a work is not supported by the Copyright Act.
    
    Parent Share
    twitter facebook
- Re: (Score:2)
  
  by zehaeva ( 1136559 ) writes:
  
  Good luck with asking that guy with the 1st edition of [insert incredibly rare 200 year old book here]. I'm sure he'll let you take it and butcher it.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

More importantly (Score:2, Funny)

Re: (Score:2)

Re: (Score:2)

IMPORTANT QUESTIONS (Score:2, Funny)

Re:IMPORTANT QUESTIONS (Score:4, Funny)

Re: (Score:2)

note to self: (Score:2, Funny)

Patent!!??!! (Score:5, Funny)

Re:Patent!!??!! (Score:4, Funny)

Re: (Score:2, Informative)

Re:Patent!!??!! (Score:5, Informative)

Re:Patent!!??!! (Score:4, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2, Interesting)

Re: (Score:2)

Re: (Score:2)

Re:Patent!!??!! (Score:5, Informative)

Re: (Score:3, Funny)

Re:Patent!!??!! (Score:4, Informative)

Re: (Score:3, Insightful)

Re: (Score:2)

So... (Score:5, Interesting)

Re: (Score:3, Insightful)

Re:So... (Score:5, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Shhh! Don't Give Them Ideas! (Score:2)

Re: (Score:2)

Re: (Score:2)

Re:So... (Score:4, Interesting)

Patent? Prior Art? (Score:3, Insightful)

Re: (Score:2)

Re: (Score:3, Funny)

The New Bell Labs? (Score:5, Interesting)

Re: (Score:3, Interesting)

Re: (Score:3, Informative)

Mostest importanly... (Score:4, Interesting)

Re: (Score:3, Funny)

Re: (Score:2, Informative)

Obvious question... (Score:2)

Re:Obvious question... (Score:4, Funny)

You laugh, but look at this (Score:5, Interesting)

Re:You laugh, but look at this (Score:5, Funny)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3, Informative)

Re: (Score:3, Interesting)

Unnecessary? (Score:2)

Re:Unnecessary? (Score:4, Interesting)

Re: (Score:2)

Re: (Score:2)

Isn't that all known? (Score:2, Insightful)

Re: (Score:2)

Re:Isn't that all known? (Score:4, Interesting)

What are the chances... (Score:4, Interesting)

Re: (Score:2)

Re: (Score:2)

Butt what about... (Score:2, Funny)

Re:Butt what about... (Score:4, Funny)

Re: (Score:3, Funny)

Why is this a big deal? (Score:5, Insightful)

Re: (Score:3, Insightful)

cool, but not patent-worthy (Score:4, Insightful)

Re: (Score:2)

But can they remove finger-scans and hand-scans? (Score:3, Interesting)

Re: (Score:2)

Seems like overkill (Score:2)

Wood chipper? (Score:3, Funny)

OCR (Score:3, Interesting)

Playing Catch-up (Score:2)

Re:Playing Catch-up (Score:5, Insightful)

Re:Playing Catch-up (Score:5, Informative)

Re: (Score:3, Insightful)

That reminds me (off topic) (Score:3)

Re: (Score:2, Informative)

Re: (Score:2)

Re: (Score:2)

Re:Playing Catch-up (Score:5, Funny)