Research on deep learning techniques in breaking text-based Captchas and designing image-based Captcha

Tang, Mengyun; Gao, Haichang; Zhang, Yang; Liu, Yi; Zhang, Ping; Wang, Ping

doi:10.1109/TIFS.2018.2821096

Research on deep learning techniques in breaking text-based Captchas and designing image-based Captcha

Open in Zotero

View on zotero.org

Resource type

Journal Article

Authors/contributors

Tang, Mengyun (Author)
Gao, Haichang (Author)
Zhang, Yang (Author)
Liu, Yi (Author)
Zhang, Ping (Author)
Wang, Ping (Author)

Title

Research on deep learning techniques in breaking text-based Captchas and designing image-based Captcha

Abstract

The ability of hackers to infiltrate computer systems using computer attack programs and bots led to the development of Captchas or Completely Automated Public Turing Tests to Tell Computers and Humans Apart. The text Captcha is the most popular Captcha scheme given its ease of construction and user friendliness. However, the next generation of hackers and programmers has decreased the expected security of these mechanisms, leaving websites open to attack. Text Captchas are still widely used, because it is believed that the attack speeds are slow, typically two to five seconds per image, and this is not seen as a critical threat. In this paper, we introduce a simple, generic, and fast attack on text Captchas that effectively challenges that supposition. With deep learning techniques, our attack demonstrates a high success rate in breaking the Roman-character-based text Captchas deployed by the top 50 most popular international websites and three Chinese Captchas that use a larger character set. These targeted schemes cover almost all existing resistance mechanisms, demonstrating that our attack techniques are also applicable to other existing Captchas. Does this work then spell the beginning of the end for text-based Captcha? We believe so. A novel image-based Captcha named Style Area Captcha (SACaptcha) is proposed in this paper, which is based on semantic information understanding, pixel-level segmentation, and deep learning techniques. Having demonstrated that text Captchas are no longer secure, we hope that our proposal shows promise in the development of image-based Captchas using deep learning techniques.

Publication

IEEE Transactions on Information Forensics and Security

Volume

13

Issue

10

Pages

2522-2537

Date

2018-10

DOI

10.1109/TIFS.2018.2821096

ISSN

1556-6021

Library Catalogue

IEEE Xplore

Extra

Conference Name: IEEE Transactions on Information Forensics and Security

Citation

Tang, M., Gao, H., Zhang, Y., Liu, Y., Zhang, P., & Wang, P. (2018). Research on deep learning techniques in breaking text-based Captchas and designing image-based Captcha. IEEE Transactions on Information Forensics and Security, 13(10), 2522–2537. https://doi.org/10.1109/TIFS.2018.2821096

Link to this record

https://docs.edtechhub.org/lib/4GMXQAMG