What is CAPTCHA? How it Works? | All You Need to Know!

64186

20 May 2020 • 10 min read

What is CAPTCHA? How it Works? | All You Need to Know!

20 May 2020 • 10 min read

What is CAPTCHA?

CAPTCHA is an acronym for “Completely Automated Public Turing test to tell Computers and Humans Apart.”, which is a module used in websites, mobile apps, and APIs to distinguish automated computer programs from genuine human users.

Sometimes, CAPTCHA is described as reverse Turing tests. As we know, the Turing test is a method of inquiry in artificial intelligence where a computer has to convince a human that it's a human. Therefore a reverse Turing test is a human convincing a computer that it is not a computer. If you write a program that automatically generates such a test on the internet, then you get yourself a CAPTCHA.

History of CAPTCHA

The first commercial deployment of a CAPTCHA-like system was done by Andrei Broder and his fellow engineers at the AltaVista search engine. They’ve developed an automated filter system to stop bots from submitting URLs (a type of black hat SEO) that skewed AltaVista’s algorithm.

However, the term CAPTCHA was coined in 2003 by Luis von Ahn and his colleagues at Carnegie Mellon University in their publication “CAPTCHA: Using Hard AI Problems for Security.”

different test-based captchas

For more information, please refer to "History of CAPTCHA - The Origin Story".

How Does CAPTCHA Work?

CAPTCHAs are deployed at operational gateways such as login, register, submit, etc., to prevent computer programs from accessing and committing fraud and abuse.

The original idea behind CAPTCHA is to present cognitive challenges in the form of distorted texts that humans can recognize and pass easily while computer programs cannot.

obsolete captcha type examples

However, with the advancement of computer technologies, traditional CAPTCHAs quickly became obsolete. Today, advanced CAPTCHA modules use risk analysis engines based on behavioral detection and artificial intelligence technologies to prevent sophisticated bot threats.

What are CAPTCHAs used for?

Digital criminals use bot programs to commit fraud and abuse through automation of tasks in a malicious way such as credential stuffing attacks, email or comment spam, online poll fraud, etc. CAPTCHA prevents digital criminals from automating their fraudulent tasks by ensuring there is a real human behind each request, thus effectively preventing scammers from scaling their illicit operations and committing online fraud and abuse.

CAPTCHA prevents threats including:

Web Scraping: Prevents valuable content theft
Ticket Scalping: Prevents fraudsters from automatically obtaining tickets for resale
Bonus Abuse: Prevents attackers from automatically acquiring bonuses and gifts
Abusive Traffic: Prevents platform resources from being maliciously wasted
Fake Comments: Prevents bots from artificially inflating or deflating public endorsements
Spam Email: Prevents the spread of spam email
Account Takeover: Prevents brute force and credential stuffing attacks and protects account security

Without CAPTCHA, spam and abuse would take over most of the platforms, and the internet ecosystem as we know it would not exist.

Types of CAPTCHAs: CAPTCHA's Evolution

Over the past two decades, CAPTCHA has evolved in three generations to defend against the increasingly sophisticated bad bots as well as meet the user's needs for a smoother experience.

First Generation: Standard CAPTCHA

This Captcha takes simple logic: Humans are better than machines at recognizing twisted and warped text letters. It includes:

Text-based Captcha
Audio Captcha

The superiority of humans over machine programs in recognizing twisted and warped text letters. By introducing noise in the form of different widths, heights, background patterns, borders, and so on, text letters would become impossible to be recognized by OCR(Optical Character Recognition) technology at the time.

Humans solved the challenges with only a 33% success rate while the computer had an accuracy of 99.8% in recognizing heavily distorted texts. This marked the end for first-generation text-based CAPTCHAs. Standard CAPTCHA (also known as text-based CAPTCHA) became obsolete in 2014 when google pitted one of its machine learning algorithms against humans on recognizing heavily distorted text .

Second Generation: Gamified CAPTCHA

This CAPTCHA had left the text-based input approach for more innovative challenges that were deemed them very difficult for machines to bypass. These challenges included logic puzzles, visual comparisons, movement-based CAPTCHAs, or math challenges.

However, even though the second generation of CAPTCHAs looked very different from the first one, the logic behind the challenges stayed very similar: the superiority of humans over machine programs in recognizing images, numbers, or various objects. It includes:

Image Recognition
Image Orientation
Math Challenge
Logic Questions
And many more

As computer technology advanced and bad bots were becoming better at solving such puzzles, the CAPTCHAs had to be increasingly difficult. As the user friction created by difficult CAPTCHAs has become too severe and the advanced AI technology has deemed the second-generation gamified CAPTCHAs merely ineffective.

Third Generation: No-knowledge CAPTCHA

This Captcha has taken the human verification process into a new dimension by introducing advanced risk analysis into the equation. With no requirement of human thinking, no-knowledge CAPTCHAs have minimum to no interruption to user operations and provide a much better user experience.

The back-end risk analysis based on behavioral factors within a confined space, as well as environmental factors such as device reputation, hardware specifications, etc., are utilized to tell apart genuine human behavior from automated human behavior. AI-powered bots can mimic human behavior, and AI-powered CAPTCHA is a necessity to stop advanced bot threats.

Comparison

The first and second generations of CAPTCHAs had a paradoxical logic that stated: humans, compared to machines, have superiority in recognizing images, numbers, or various objects. This means that as computer programs get better at recognizing characters and images, the CAPTCHA challenges have to be increasingly difficult to prevent computer programs from bypassing them.

difficult traditional captchas

As a result, the difficulty of CAPTCHAs has increased exponentially with the advancements in computer and AI technologies. Today, a better solution is the third-generation no-knowledge CAPTCHAs. Advanced CAPTCHA solutions are much easier to pass and infinitely more secure than traditional CAPTCHA solutions.

What Triggers a CAPTCHA Test?

A CAPTCHA test is triggered if the risk analysis engine cannot ensure that the user is indeed a human or a bot. Thus, it marks the user as suspicious, and a challenge-response is presented. There are many factors behind risk analysis, which are the secret sauce of the system. Some known factors are the parameters related to user behavior, such as operating speed and the IP address.

Are CAPTCHAs Enough for Stopping Malicious Bots?

The Limitation of Traditional CAPTCHA

Fraudsters have been exploiting systems with automated attacks since the early days of the Internet. Traditional CAPTCHA saved us from bot threats in the early times.

However, backed by massive financial motivation and advancing computer technologies, fraudsters keep finding ways to bypass or crack CAPTCHA measures, leaving the ecosystem vulnerable to bot attacks. Some bots can get past the text CAPTCHAs on their own. Researchers have demonstrated ways to write a program that beats image recognition CAPTCHAs as well. In addition, attackers can use click farms to beat the tests: thousands of low-paid workers solving CAPTCHAs on behalf of bots.

Most Popular CAPTCHA Alternatives

Captcha vs Honeypot: Honeypots are often pointed as the frictionless alternative to captchas. However, honeypots alone are not sufficient to stop spambots. The modern bot threats are smart enough to overcome such tricks with ease and stopping them requires more advanced solutions than a simple honeypot.
Captcha vs 2FA/MFA: Two-factor authentication is an identity verification system that validates the users’ real identity, while a captcha is used to determine whether the user is a human or a bot and cannot answer which specific human is behind the request. Captcha and 2FA/MFA solve different problems, therefore, they are not the alternative to one another.
Captcha vs Bot Manager: When it comes to fighting modern bot threats, the challenge for all advanced solutions is the same; to detect non-human behavior within the online traffic. While CAPTCHA stops this non-human traffic through the help of an interactive interaction, bot managers do this by analyzing the entire traffic to understand users’ intent. The behavioral data observed by advanced captchas are restricted within a pre-defined space. Moreover, when the risk analysis engine of a bot manager cannot ensure whether the user is an actual human or a bot, the suspicious traffic is directed to a challenge-response of a CAPTCHA system.

Advanced CAPTCHA: Future of CAPTCHA Technology

Industry experts hail machine learning as the most promising solution against automated threats. When the behavioral analysis is integrated into CAPTCHA, the challenge-response becomes a way to collect biometric data, instead of a cognitive challenge. This is a dramatic change in the logic of bot defense compared to older generations of CAPTCHAs.

Biometric data is used in the risk analysis engine to determine whether the behavior is human or machine, which allows challenges to be much harder for bots and a lot easier for all humans. This type of CAPTCHA is often referred to as “Advanced CAPTCHA”.

Advanced CAPTCHA is an anti-bot program that distinguishes itself from regular CAPTCHAs by utilizing a risk analysis engine (often based on behavioral analysis) over or within its challenge-response mechanism. While the legacy captcha systems ensured security with the difficulty of the challenge response, advanced captchas focused on behavioral characteristics of the traffic instead, providing higher security with less user friction.

Advanced CAPTCHAs can be used as stand-alone solutions to mitigate up to 98% of automated threats, and the benefit of bot managers over CAPTCHAs is minimal at best. In the field of cybersecurity, there is no 100% safety and in the endeavor to stop online fraud & abuse caused by automated programs, advanced CAPTCHAs can successfully break the business model of fraudsters.

GeeTest CAPTCHA: An Advanced CAPTCHA to Secure Cybersecurity

As CAPTCHA hacking methods are abundant, and tools of hacking are easily accessible, traditional CAPTCHAs are easily bypassed, making the sites vulnerable to malicious automated attacks.

However, with the introduction of GeeTest Adaptive CAPTCHA, the era of CAPTCHA is far from over. When integrated with a back-end engine, the possibilities for this advanced CAPTCHA are far and wide. We can observe the most prominent of those sophistication possibilities under three main categories:

Environmental Detection
Behavioral Detection
Dynamic Resource Database

Securing CAPTCHA with Environmental Detection

Environment detection refers to the information retrieved from the user’s computer environment, such as the hardware specification, various devices, screen size, browser properties, version, etc.

Using elaborate machine learning models for advanced risk analysis, the environmental information can be used to detect browser automation tools accurately. By mitigating browser automation tools from the arsenal of hackers, an advanced CAPTCHA solution can significantly limit the ability of hackers to stay under the radar and scale their fraudulent operations.

While strong front-end encryption and dynamic honeypots can mitigate the threat of API hacking, sophisticated origin detection techniques can pinpoint requests from CAPTCHA farms.

Securing CAPTCHA with Behavioral Detection

The integration of behavioral analysis into CAPTCHA allows challenges to be less about the ‘correct’ answer and more about ‘the method’ of acquiring the answer.

Biometric data generated through the user’s interaction with the CAPTCHA module is used in the risk analysis engine to determine whether the behavior belongs to a human or a machine. This is a dramatic change for the logic of bot defense compared to older generations of captchas and a crucial feature for any relevant advanced captcha solutions.

A biometric classification model within a CAPTCHA model means that merely using ML and OCR to crack the challenge is not enough. An automated program has to not only crack the challenge but also do so while perfectly mimicking human behavior. Generating biometric data that is genuinely human to pass the risk analysis engine -though possible- still introduces enough limitations to prevent “a successful bot attack” from occurring.

Securing CAPTCHA with Dynamic Resource Database

Once a CAPTCHA is presented to a user, the image used within the challenge becomes public. This means hackers can use these images to train a machine learning model or use them for reverse library types of attacks. Therefore, images used within the challenges can pose a threat to the security of the CAPTCHA.

By continuously updating the resource pool and encrypting the images used within the challenges, advanced captchas can prevent reverse library and brute force type of attacks, significantly increasing the cost of attempting an attack.

Conclusion

Advanced CAPTCHA is a business imperative, not an IT imperative.

If your online business operations are valuable for your business, you should go with an enterprise-grade advanced captcha that is secure and provides a seamless user experience and 24/7 support. Enterprise-grade advanced captchas can stay up to date with emerging threats, and mitigate the risk of your business being a target of sophisticated bad bots.

There are a lot of industries that can benefit significantly from an advanced CAPTCHA solution, including:

E-Commerce
Retail
Blockchain
Airline
Travel
Gaming

Most websites or mobile apps with critical operations (such as login, register, form submission, etc.) need an advanced CAPTCHA to prevent automated attacks from happening. With the availability and affordability of machine learning and cloud computing tools, today, the bad actors can reach further and hit harder than ever before. Without a strong deterrence such as an Advanced CAPTCHA solution, bot attacks are only a matter of when. Advanced CAPTCHAs increase the cost of attack exponentially and are a necessity for the protection of most websites, mobile apps, and APIs.

With over 12 years of enterprise-grade captcha services experience, GeeTest has served 360,000+ enterprises worldwide including Airbnb, Binance, Xiaomi, etc., and processes 1,000,000,000+ requests per day. Try GeeTest Adaptive CAPTCHA and protect your website, app, and APIs from CAPTCHA bot attacks, or register for a free 30-day trial now！

geetest adaptive captcha

Start your free trial

Over 320,000 websites and mobile apps worldwide are protected by GeeTest captcha

Get started

Learn more

GeeTest

Subscribe to our newsletter

SMS Pumping is a fraud where attackers use bots and premium-rate numbers to trigger fake SMS, causing financial losses and service disruptions in 2025.

Botpedia

What is SMS Pumping and How Does It Work in 2025

SMS Pumping is a fraud where attackers use bots and premium-rate numbers to trigger fake SMS, causing financial losses and service disruptions in 2025.

GeeTest • 2 min read