Spammers Solving Difficult AI Problems With An Underground X Prize

from the fascinating dept

Slashdot points us to an interview with Luis von Ahn (who we're a big fan of), where he talks about how spammers who are frustrated by various types of CAPTCHA tests have set up their own sort of "innovation prize," offering up somewhere in the range of $500,000 for software that can automatically pass CAPTCHA and reCAPTCHA reading tests (the things where you have to fill in a series of letters to sign up for a service or post a comment). As von Ahn points out: "If [the spammers] are really able to write a programme to read distorted text, great -- they have solved an AI problem." It is, effectively, an "X Prize" for optical character recognition. Not that we like to encourage spammers, but it is rather fascinating how the underground business seems to mirror the above ground innovation world as well.


Reader Comments (rss)

(Flattened / Threaded)

  •  
    identicon
    Bob V, Apr 22nd, 2009 @ 4:35am

    You know I'd almost pay for software that could effectively do this. I'm not colorblind, my vision is good, I have a good monitor but somehow about 25% of the time when i type them in I get it wrong.

     

    reply to this | link to this | view in chronology ]

  •  
    identicon
    Xanthir, FCD, Apr 22nd, 2009 @ 4:42am

    Awesome

    For those of you who don't know, reCAPTCHA is a very popular captcha program that uses images of words that couldn't be read properly by the OCR software used by various book digitizing programs. The fact that the OCR software failed means that it's unlikely any other similar software will succeed, which is what makes it such a great captcha.

    The neat thing is that it forces you to decipher *two* words, one of which is already known and one which is not. If you get the known word right, you pass. Once enough people give the same answer for a particular unknown word, though, that information is passed back upstream to the book digitizers.

    If the spammers can solve this reliably, it means that they've made a great advancement in the field of OCR which can be passed back to the book digitizers for great benefits. And it still won't defeat reCAPTCHA unless the new software is *perfect* - there will still be words that can't be read by the new software.

     

    reply to this | link to this | view in chronology ]

    •  
      identicon
      Anonymous Coward, Apr 22nd, 2009 @ 5:00am

      Re: Awesome

      however, if you were to write a program and seed it with correct answers and have it consistently send the wrong answers back to recaptcha you may be able to trick their servers into accepting wrong answers and then build up your set of solved captchas simply by using a reasonably small set of seeds. ... but then again Im not working on the recaptcha so they may already have things in place to prevent this, but there are a ton of sites using it, and i feel like it is almost entirely automated, so this type of hack could potentially ruin their model.

       

      reply to this | link to this | view in chronology ]

    •  
      icon
      WarOtter (profile), Apr 22nd, 2009 @ 6:04am

      Re: Awesome

      Any links to where this is in use?

       

      reply to this | link to this | view in chronology ]

  •  
    identicon
    Anonymous Coward, Apr 22nd, 2009 @ 5:55am

    wait till someone does it, patents it, then gets upset when nonspammers use it and starts suing

     

    reply to this | link to this | view in chronology ]

  •  
    identicon
    Anonymous Coward, Apr 22nd, 2009 @ 5:57am

    DRM effect

    Capcha's may end up having a DRM-like effect as they get harder and harder for humans to to read. They will be keeping out legitimate users, but they may end up allowing easy access to the illicit visitors using CapchaBots.

     

    reply to this | link to this | view in chronology ]

  •  
    icon
    TX CHL Instructor (profile), Apr 22nd, 2009 @ 6:05am

    CAPCHA

    I thought CAPCHA had been handily defeated already by offering free porn, and serving up an image capture of the capcha.
    --
    www.chl-tx.com (Thanks, BHO, for the wonderful stimulus you have given *MY* business)

     

    reply to this | link to this | view in chronology ]

  •  
    identicon
    Wesha, Apr 22nd, 2009 @ 8:38am

    Solved! Gimme $500,000!

     

    reply to this | link to this | view in chronology ]

  •  
    identicon
    Adam, Apr 22nd, 2009 @ 8:53am

    We can *totally* trust them to give the award!

    No doubt these same honorable folks who manage botnets and such will be men and women of honor. In a worst case scenario, they'll offer one MILLION dollars in Free V1agra to the winner instead of cash :P

     

    reply to this | link to this | view in chronology ]

  •  
    identicon
    Matt Bennett, Apr 22nd, 2009 @ 11:43am

    I'm not sure I'd trust whoever was offering the reward to actually pay me.

     

    reply to this | link to this | view in chronology ]


Add Your Comment

Have a Techdirt Account? Sign in now. Want one? Register here
Get Techdirt’s Daily Email
Save me a cookie
  • Note: A CRLF will be replaced by a break tag (<br>), all other allowable HTML will remain intact
  • Allowed HTML Tags: <b> <i> <a> <em> <br> <strong> <blockquote> <hr> <tt>
Follow Techdirt
A word from our sponsors...
Essential Reading
Techdirt Reading List
Techdirt Insider Chat
A word from our sponsors...
Recent Stories
A word from our sponsors...

Close

Email This