CAPTCHA Protection Rules

How we defend Forms Service from automated attacks without frustrating users

Overview

Forms Service is constantly under attack. Bots try to scrape forms, attackers test stolen credentials, and malicious actors send spam at scale. CAPTCHA is our primary defense.

But there's a cost: always-on CAPTCHA frustrates legitimate users and reduces conversion rates. This document explains how we've solved that trade-off using five targeted rules that activate CAPTCHA only when we detect suspicious patterns.

Why CAPTCHA Matters

CAPTCHA protects against four major threat vectors:

Automated attacks — Bots sending thousands of requests per second, overwhelming your infrastructure
Credential stuffing — Attackers testing stolen username/password combinations at massive scale
Spam submissions — Malicious actors bulk-submitting garbage data to pollute your database
Rate-based DoS — Coordinated volumetric attacks designed to crash your service

The fundamental trade-off: No CAPTCHA means attackers win. Always-on CAPTCHA means users lose. Our five rules split the difference: CAPTCHA activates only when we detect something genuinely suspicious.

The Five Rules Explained

Rule 1: High Request Volume from Single IP

Trigger Condition

More than 500 requests from the same IP address in less than 20 minutes.

What This Detects

A single attacker (or compromised machine) firing requests as fast as possible. The pattern suggests credential stuffing, form scraping, or direct API abuse. The 500-request threshold is high enough to miss legitimate bulk users but low enough to catch obvious attacks.

User Impact

Legitimate high-volume users behind a shared corporate IP might see unexpected CAPTCHA. The 20-minute window is tight enough that it only applies during the burst, not to the entire IP range for the day.

Best Used For

This is your first line of defense. It catches the most common attack pattern: repeated requests from a single source in quick succession.

Rule 2: Blacklisted IP Address

Trigger Condition

The request comes from an IP address in your internal blacklist.

What This Detects

Known malicious IP addresses from previous attacks, external threat feeds, or manual security team additions. This includes botnets, proxy services, and data center IPs commonly used for abuse.

User Impact

Users behind blacklisted IPs see CAPTCHA on every interaction. There are no false negatives—if an IP is blacklisted, you've decided to trust that decision completely.

Best Used For

Persistent blocking of repeat offenders. IPs stay blacklisted until explicitly removed by your security team. Fast, decisive, no nuance.

Rule 3: Request Spike Above Historical Average

Trigger Condition

The service receives more than 2x the average request volume for this hour in the current hour bucket.

Historical average for 2 PM:  10,000 requests/hour
Current 2 PM bucket (so far):  20,001+ requests
Result: CAPTCHA triggers for all traffic

What This Detects

Sudden traffic spikes that deviate sharply from your baseline. This catches DDoS attacks from many IPs simultaneously, or coordinated attacks that deliberately distribute load to bypass single-source rate limits.

User Impact

Legitimate traffic spikes also trigger this rule. A viral post, breaking news driving users to your forms, or a successful product launch will all look suspicious. When this rule fires, all users see CAPTCHA. This is your nuclear option—powerful but expensive.

Best Used For

Use when Rules 1 and 2 aren't catching sustained abuse. The 2-week historical baseline prevents false positives from gradual growth. Coordinate with product before major campaigns that might spike traffic.

Rule 4: Identical Payload Repeated Rapidly

Trigger Condition

The same request body is submitted more than 5 times in less than 30 seconds.

What Counts as "Same"

Identical form field values. Not the same form type, but the exact same text, selections, and file uploads. For example: a user submits "feedback: 'Great product'" five times in 10 seconds.

What This Detects

Automated form submission tools, spam campaigns, and copy-paste attacks. Simple, obvious automation that doesn't require external intelligence like IP blacklists or historical baselines.

User Impact

Low impact overall. Duplicate submissions are usually accidental (user hit submit twice). CAPTCHA prevents the duplicate while users understand why it's happening. The 30-second window is tight enough that legitimate workflows almost never hit it.

Best Used For

Lightweight defense against obvious automation without requiring complex infrastructure. An attacker can bypass this by varying payloads even slightly, which is intentional—it prevents legitimate bulk feedback from false-positiving.

Rule 5: Manual Admin Override

Trigger Condition

A security administrator manually enables CAPTCHA via the admin panel for specific requests.

What This Detects

Active threats identified by humans. Suspected data breaches, coordinated spam campaigns, or edge cases that automated rules aren't catching. This is where human judgment intervenes.

User Impact

Highest impact. Affects all requests matching the admin's filter criteria. Use only when other rules fail or during active security incidents. Always set a time limit to avoid accidental persistent lockouts.

Best Used For

The human override. When you have context that automated rules don't. Always disable it once the threat passes.

Real-world examples:

During a security incident, admin enables CAPTCHA for all requests from a specific geographic region
A specific endpoint is under attack; admin enables CAPTCHA only for that endpoint
A marketing campaign sends bulk requests; admin temporarily enables CAPTCHA to verify legitimacy

How the Rules Work Together

The five rules form a layered defense system:

Rule	Speed	Precision	Primary Purpose
Rule 1: Rate Limit	Instant (real-time counting)	High	Catch obvious attackers
Rule 2: Blacklist	Instant (lookup)	Very high	Block known repeat offenders
Rule 3: Spike Detection	Delayed (baseline lookup)	Medium	Defend against distributed attacks
Rule 4: Payload Dedup	Instant	High	Catch copy-paste automation
Rule 5: Admin Override	Instant	Manual (human judgment)	Handle edge cases and novel threats

Typical attack progression: Most attacks trigger Rule 1 within seconds. Repeat offenders get added to Rule 2 after analysis. Distributed attacks that bypass Rule 1 are caught by Rule 3 or Rule 4. Novel threats or edge cases are handled by Rule 5.

Configuration & Thresholds

Current CAPTCHA rule settings:

rules:
  rate_limit:
    requests: 500
    time_window_minutes: 20
  
  spike_detection:
    threshold_multiplier: 2.0
    baseline_period_days: 14
    bucket_granularity: hourly
  
  payload_dedup:
    max_occurrences: 5
    time_window_seconds: 30

Monitoring & Alerts

Metrics to track:

Number of CAPTCHA challenges per rule per hour
CAPTCHA completion rate (users who solve vs. abandon)
Conversion rate impact during CAPTCHA periods
False positive rate (legitimate users blocked)

Set alerts for:

Any single rule triggering >100x/hour (indicates misconfiguration)
Completion rate dropping below 70% (indicates UX problem)
Rule 3 (spike detection) staying active >1 hour (indicates ongoing attack)

Troubleshooting

Users Report Unexpected CAPTCHA

Check which rule triggered:

Look up the user's request in logs using IP + timestamp
Identify which rule caused the CAPTCHA
Rule 1: User's IP hit 500 requests in 20 min. Likely a corporate network or VPN. Consider raising the threshold or whitelisting the IP.
Rule 2: User's IP is blacklisted. Review the blacklist entry; remove it if the IP was compromised or is now legitimate.
Rule 3: Service experienced a traffic spike. Check if this was a legitimate campaign or surge. Adjust the threshold if needed.
Rule 4: User submitted identical payload 5+ times in 30 sec. Likely an accidental double-click. Expected behavior.

CAPTCHA Not Triggering for Known Attack

Possible causes and solutions:

Attack uses many IPs: Rules 1 & 2 won't catch it. Check if Rule 3 threshold is too high for your traffic baseline.
Attack uses varying payloads: Rule 4 won't catch it. Rules 1 & 2 must apply instead.
Attack looks like legitimate traffic: Use Rule 5 (manual override) or manually add the IP to the blacklist.

Questions or Issues?

Reach out to the appropriate team:

Implementation questions: Forms Service team #forms-service on Slack
Security concerns: Security team #security on Slack
On-call escalation: Use PagerDuty

CAPTCHA Protection Rules

Overview

Why CAPTCHA Matters

The Five Rules Explained

Rule 1: High Request Volume from Single IP

Rule 2: Blacklisted IP Address

Rule 3: Request Spike Above Historical Average

Rule 4: Identical Payload Repeated Rapidly

Rule 5: Manual Admin Override

How the Rules Work Together

Configuration & Thresholds

Monitoring & Alerts

Troubleshooting

Users Report Unexpected CAPTCHA

CAPTCHA Not Triggering for Known Attack

Related Documentation

Questions or Issues?