Research on deep learning techniques in breaking text-based Captchas and designing image-based Captcha

Resource type
Journal Article
Authors/contributors
Title
Research on deep learning techniques in breaking text-based Captchas and designing image-based Captcha
Abstract
The ability of hackers to infiltrate computer systems using computer attack programs and bots led to the development of Captchas or Completely Automated Public Turing Tests to Tell Computers and Humans Apart. The text Captcha is the most popular Captcha scheme given its ease of construction and user friendliness. However, the next generation of hackers and programmers has decreased the expected security of these mechanisms, leaving websites open to attack. Text Captchas are still widely used, because it is believed that the attack speeds are slow, typically two to five seconds per image, and this is not seen as a critical threat. In this paper, we introduce a simple, generic, and fast attack on text Captchas that effectively challenges that supposition. With deep learning techniques, our attack demonstrates a high success rate in breaking the Roman-character-based text Captchas deployed by the top 50 most popular international websites and three Chinese Captchas that use a larger character set. These targeted schemes cover almost all existing resistance mechanisms, demonstrating that our attack techniques are also applicable to other existing Captchas. Does this work then spell the beginning of the end for text-based Captcha? We believe so. A novel image-based Captcha named Style Area Captcha (SACaptcha) is proposed in this paper, which is based on semantic information understanding, pixel-level segmentation, and deep learning techniques. Having demonstrated that text Captchas are no longer secure, we hope that our proposal shows promise in the development of image-based Captchas using deep learning techniques.
Publication
IEEE Transactions on Information Forensics and Security
Volume
13
Issue
10
Pages
2522-2537
Date
2018-10
ISSN
1556-6021
Library Catalogue
IEEE Xplore
Extra
Conference Name: IEEE Transactions on Information Forensics and Security
Citation
Tang, M., Gao, H., Zhang, Y., Liu, Y., Zhang, P., & Wang, P. (2018). Research on deep learning techniques in breaking text-based Captchas and designing image-based Captcha. IEEE Transactions on Information Forensics and Security, 13(10), 2522–2537. https://doi.org/10.1109/TIFS.2018.2821096