CAPTCHAs: The Good, the Bad, and the Ugly ISSE-GI SICHERHEIT 2010 - PowerPoint PPT Presentation

CAPTCHAs: The Good, the Bad, and the Ugly ISSE-GI SICHERHEIT 2010 Paul Baecher*, Marc Fischlin*, Lior Gordon, Robert Langenberg, Michael Lützow, Dominique Schröder* * TU Darmstadt, supported by DFG Emmy Noether Program October 7, 2010 | 1

Introduction October 7, 2010 | 2

What Are CAPTCHAs? ◮ C ompletely A utomated P ublic T uring test to tell C omputers and H umans A part ◮ “reverse” Turing test, term coined by [vABHL03] ◮ challenge/response protocol where ◮ response should be easy to observe for humans ◮ response should be hard to compute for machines ◮ application: protect online services from automated use image: cryptographp October 7, 2010 | 3

What Are CAPTCHAs? ◮ C ompletely A utomated P ublic T uring test to tell C omputers and H umans A part ◮ “reverse” Turing test, term coined by [vABHL03] 0.01% according to [CLSC05, vAMM + 08] ◮ challenge/response protocol where ◮ response should be easy to observe for humans ◮ response should be hard to compute for machines ◮ application: protect online services from automated use image: cryptographp October 7, 2010 | 3

A Third Dimension ◮ easy for humans, hard for machines ◮ what about practicability? ◮ small display dimensions ◮ varying input devices/methods ◮ different media formats and support thereof ◮ acceptance by users ◮ environmental aspects (audio CAPTCHAs in a shared office. . . ) October 7, 2010 | 4

Bad CAPTCHAs October 7, 2010 | 5

Breaking Bad CAPTCHAs October 7, 2010 | 5

Three Bad CAPTCHAs ◮ Bundesamt für Wirtschaft und Ausfuhrkontrolle (BAFA) ◮ “Umweltprämie”, economic stimulus program ◮ Bundesrepublik Deutschland - Finanzagentur GmbH – Bundeswertpapiere ◮ online banking interface to governmental bonds ◮ Sparda-Banken ◮ online banking interface October 7, 2010 | 6

One Approach to Break Them All 1. preprocess the images ◮ the grid is static: rather trivial to remove ◮ the line always starts in the same location, follow and remove 2. segment characters ◮ easy, since they do not touch each other 3. detect individual characters ◮ use a k -means clustering algorithm to learn mean characters ◮ see next slide. . . October 7, 2010 | 7

From Characters to Vectors ◮ k -means clustering operates on d -dimensional vectors ◮ obtain a 1024-dimensional vector for each character ◮ scale character to a 32 × 32 pixels bounding box ◮ normalize brightness of each pixel to [0, 1] ◮ traverse pixels in a unique sequence October 7, 2010 | 8

Breaking a CAPTCHA ◮ offline (training) phase ◮ obtain a set of training data CAPTCHA challenges ◮ preprocess and run k -means algorithm (Lloyd’s algorithm) ◮ use labels to correct a few errors ◮ save mean characters ◮ online (query) phase ◮ preprocess and find nearest cluster October 7, 2010 | 9

Results ◮ experimental results of our implementation: “Umweltprämie” 68% Bundeswertpapiere 70% Sparda-Banken 87 % ◮ 5% is considered broken according to [vAMM + 08] using tesseract OCR October 7, 2010 | 10

Better CAPTCHAs October 7, 2010 | 11

Designing Good CAPTCHAs ◮ use random challenge strings ◮ dictionary words help the attacker ◮ interpolate partially detected word fragments ◮ make an offline-decision ◮ use monochromatic images ◮ require segmentation ◮ mere recognition is not enough [CLSC05] ◮ apply distortions with many degrees of freedom October 7, 2010 | 12

Implementation Pitfall ◮ one version per challenge ◮ ❞✐❣❣✳❝♦♠ ◮ q✉♦❦❛✳❞❡ ◮ consider an attacker that is able to recognize one randomly chosen character October 7, 2010 | 13

reCAPTCHA October 7, 2010 | 14

reCAPTCHA ◮ unique concept ◮ human OCR system ◮ verification words, scan words ◮ proprietary but free centralized service ◮ very popular (facebook, . . . ) ◮ secure? major revisions of reCAPTCHA October 7, 2010 | 15

reCAPTCHA Considered Broken ◮ first generation, early 2008 ◮ broken by Wilkins using erode/dilate and OCR [Wil09], 5%* ◮ second generation, until December 2009 ◮ broken by Wilkins, 5%*; our results: 6–10% ◮ third generation, until August 2010 ◮ broken by Houck [Hou10], 10%; our results: ca. 6% ◮ fourth (current) generation ◮ broken by Houck, 30% broken! October 7, 2010 | 16

Conclusions ◮ the majority of all CAPTCHAs can be broken easily ◮ not hard to avoid most common errors ◮ rely on segmentation task ◮ reCAPTCHA is (was?) a good choice ◮ designing a robust CAPTCHA seems extremely difficult October 7, 2010 | 17

The End Thank you! ? October 7, 2010 | 18

References Kumar Chellapilla, Kevin Larson, Patrice Y. Simard, and Mary Czerwinski. Building Segmentation Based Human-Friendly Human Interaction Proofs (HIPs). In HIP , volume 3517 of Lecture Notes in Computer Science , pages 1–26. Springer-Verlag, 2005. Chad W. Houck. Decoding reCAPTCHA. ❤tt♣✿✴✴✇✇✇✳♥✸♦♥✳♦r❣✴♣r♦❥❡❝ts✴r❡❈❆P❚❈❍❆✴❞♦❝s✴r❡❈❆P❚❈❍❆✳❞♦❝① , 2010. Luis von Ahn, Manuel Blum, Nicholas J. Hopper, and John Langford. CAPTCHA: Using Hard AI Problems for Security. In Eli Biham, editor, Advances in Cryptology – EUROCRYPT 2003 , volume 2656 of Lecture Notes in Computer Science , pages 294–311, Warsaw, Poland, May 4–8, 2003. Springer, Berlin, Germany. Luis von Ahn, Benjamin Maurer, Colin McMillen, David Abraham, and Manuel Blum. reCAPTCHA: Human-Based Character Recognition via Web Security Measures. Science , 321(5895):1465–1468, 2008. Jonathan Wilkins. Strong CAPTCHA Guidelines v1.2. 2009. October 7, 2010 | 19

CAPTCHAs: The Good, the Bad, and the Ugly ISSE-GI SICHERHEIT 2010 - PowerPoint PPT Presentation

CAPTCHAs: The Good, the Bad, and the Ugly ISSE-GI SICHERHEIT 2010 Paul Baecher, Marc Fischlin, Lior Gordon, Robert Langenberg, Michael Ltzow, Dominique Schrder* * TU Darmstadt, supported by DFG Emmy Noether Program October 7, 2010 | 1

he Good, Good, the the Bad Bad & & the the Ugly Ugly Rev Revisited isited for for

The Goo ood, The Bad Bad, The Ug Ugly Ug Ugly Dr Gray, Dr Trevathan, Mrs Madisa This

The good, the bad and the ugly of online community engagement The good! The Really Good The

Bar Induction: The Good, the Bad and the Ugly Vincent Rahli, Mark Bickford, and Robert L.

The Good, The Bad and The Ug Ugly Ellis Jacobs, Ph.D., DABCC, FAACC Principal, EJ Clinical

November 21, 2012 Hydraulic Fracturing: The Good, The Bad, and the Ugly Stephan MacLellan, P.Eng

Being a PTL: The Good, the Bad, and the Ugly

MC Tools and NLO Monte Carlos or The Good, the Bad & the Ugly Frank Krauss Institute for

Animated Captchas And Games For Advertising Suhas Aggarwal IIT Guwahati May 2013 1 Content

EPL682 - PAPERS ---------- Re: CAPTCHAs Understanding CAPTCHA-Solving Services in an Economic

Something about audio CAPTCHAs Elie Bursztein, Romain Bauxis, Daniele Perito, Hristo Paskov,

Reverse Engineering CAPTCHAs Abram Hindle, Micheal W. Godfrey, Richard C. Holt Software

Limitations of next generation 0 germanium experiments: The Good, the Bad and the Ugly

Charter Schools The Good, the Bad, and the Ugly What it takes to start one. Dr. Erin Craig

PROTEIN & FATS THE GOOD, THE BAD & THE UGLY Conner Middelmann, B.Sc., DipION Nutrition

Good Data Gone Bad, Bad Data Gone Worse Renee Phillips pgconf.eu 2019 1 This is me. 2 Sakeeb

Web Authentication using Third-parties in Untrusted Environments Anna Vapen PhD Thesis

Yubikey Discovery and fjrst use of Yubico's Yubikey Presented by : Maxime de Roucy

Wireless Network Security Vedavyas Duggirala CS 6204, Spring 2005 1 Wireless Devices - Benefits

Attacking RO-PUFs with Enhanced Challenge-Response Pairs Nils Wisiol and Marian Margraf

Datagram Transport Layer Security in Constrained Environments draft-hartke-core-codtls-01 Klaus

The Design and Implementation of Protocol- Based Hidden Key Recovery Eu-Jin Goh, S tanford Dan

Yasser F. O. Mohammad REMIDER 1: IPSec Uses REMINDER 2: IPSec Services Access control

Lecture 14 Transport Layer Security/ Secure Socket Layer (TLS/SSL) (Chapter 9 in KPS) SSL:

CAPTCHAs: The Good, the Bad, and the Ugly ISSE-GI SICHERHEIT 2010 - PowerPoint PPT Presentation

CAPTCHAs: The Good, the Bad, and the Ugly ISSE-GI SICHERHEIT 2010 Paul Baecher*, Marc Fischlin*, Lior Gordon, Robert Langenberg, Michael Ltzow, Dominique Schrder* * TU Darmstadt, supported by DFG Emmy Noether Program October 7, 2010 | 1

he Good, Good, the the Bad Bad &amp; &amp; the the Ugly Ugly Rev Revisited isited for for

The Goo ood, The Bad Bad, The Ug Ugly Ug Ugly Dr Gray, Dr Trevathan, Mrs Madisa This

The good, the bad and the ugly of online community engagement The good! The Really Good The

Bar Induction: The Good, the Bad and the Ugly Vincent Rahli, Mark Bickford, and Robert L.

The Good, The Bad and The Ug Ugly Ellis Jacobs, Ph.D., DABCC, FAACC Principal, EJ Clinical

November 21, 2012 Hydraulic Fracturing: The Good, The Bad, and the Ugly Stephan MacLellan, P.Eng

Being a PTL: The Good, the Bad, and the Ugly

MC Tools and NLO Monte Carlos or The Good, the Bad &amp; the Ugly Frank Krauss Institute for

Animated Captchas And Games For Advertising Suhas Aggarwal IIT Guwahati May 2013 1 Content

EPL682 - PAPERS ---------- Re: CAPTCHAs Understanding CAPTCHA-Solving Services in an Economic

Something about audio CAPTCHAs Elie Bursztein, Romain Bauxis, Daniele Perito, Hristo Paskov,

Reverse Engineering CAPTCHAs Abram Hindle, Micheal W. Godfrey, Richard C. Holt Software

Limitations of next generation 0 germanium experiments: The Good, the Bad and the Ugly

Charter Schools The Good, the Bad, and the Ugly What it takes to start one. Dr. Erin Craig

PROTEIN &amp; FATS THE GOOD, THE BAD &amp; THE UGLY Conner Middelmann, B.Sc., DipION Nutrition

Good Data Gone Bad, Bad Data Gone Worse Renee Phillips pgconf.eu 2019 1 This is me. 2 Sakeeb

Web Authentication using Third-parties in Untrusted Environments Anna Vapen PhD Thesis

Yubikey Discovery and fjrst use of Yubico's Yubikey Presented by : Maxime de Roucy

Wireless Network Security Vedavyas Duggirala CS 6204, Spring 2005 1 Wireless Devices - Benefits

Attacking RO-PUFs with Enhanced Challenge-Response Pairs Nils Wisiol and Marian Margraf

Datagram Transport Layer Security in Constrained Environments draft-hartke-core-codtls-01 Klaus

The Design and Implementation of Protocol- Based Hidden Key Recovery Eu-Jin Goh, S tanford Dan

Yasser F. O. Mohammad REMIDER 1: IPSec Uses REMINDER 2: IPSec Services Access control

Lecture 14 Transport Layer Security/ Secure Socket Layer (TLS/SSL) (Chapter 9 in KPS) SSL:

CAPTCHAs: The Good, the Bad, and the Ugly ISSE-GI SICHERHEIT 2010 Paul Baecher, Marc Fischlin, Lior Gordon, Robert Langenberg, Michael Ltzow, Dominique Schrder* * TU Darmstadt, supported by DFG Emmy Noether Program October 7, 2010 | 1

he Good, Good, the the Bad Bad & & the the Ugly Ugly Rev Revisited isited for for

MC Tools and NLO Monte Carlos or The Good, the Bad & the Ugly Frank Krauss Institute for

PROTEIN & FATS THE GOOD, THE BAD & THE UGLY Conner Middelmann, B.Sc., DipION Nutrition