AppTrana is Indusface’s AI-powered, fully managed platform integrating Web Application Firewall, DAST scanning, bot mitigation, and API security.

How does Indusface protect against cyberattacks?

Indusface protects web, mobile, and API applications with AI-powered WAAP technology, real-time threat intelligence, and managed security operations to prevent DDoS, bots, and zero-day threats.

Is there a free trial available?

Yes, a free trial for AppTrana is available on the Indusface website.

429 Errors Explained: Rate Limit or API Attack?

A sudden surge of 429 Too Many Requests errors can be confusing. Is your API simply enforcing rate limits as traffic grows, or is it the first sign of an attack trying to overwhelm authentication, scrape data, or probe your application?

The challenge is that both scenarios often look identical at first. Performance dashboards may appear normal, infrastructure may stay healthy, yet users begin facing failures. Teams frequently waste time tuning limits or scaling resources before understanding what is causing the spike.

If you do not have time for deep investigation, start with a 60-second quick check. A few focused signals such as who is sending requests, where throttling occurs, and how traffic behaves, can quickly reveal whether you are seeing legitimate demand or automated abuse.

This guide helps you interpret 429 responses correctly, recognize early attack patterns, and decide when rate limiting is working as intended versus when your API may already be under pressure.

60-Second Triage: Quick Attack Check

Treat the spike as suspicious if you see several of these patterns together:

Rapid IP rotation: Requests originate from many IP addresses within a short period.
Anonymous or invalid identities: Requests lack valid session tokens, API keys, or authenticated users.
Targeting high-value endpoints: 429 errors concentrate on login, token generation, search, or authentication APIs.
Concurrent authentication errors: 401 or 403 responses increase alongside 429 errors.
Mechanically timed traffic: Requests arrive at highly consistent intervals, indicating scripted behavior.
Threshold probing: Requests repeatedly exceed rate limits by small margins, suggesting automated tuning to bypass controls.

If several of these signs appear together, treat the spike as potential abuse and investigate before loosening rate limits. Increasing limits prematurely can weaken the protection already slowing the attack.

Rapid Diagnostic Matrix and Mitigation for HTTP 429 Errors

Observed Signal (Around 429s)	What You Should Investigate	Likely Cause / Threat Pattern	Recommended Mitigation (Precise, Not Generic)
High concentration on auth endpoints	Failed vs successful login ratio, credential reuse patterns, IP/device fingerprint consistency	Credential Stuffing / Password Spraying	Enforce MFA step-up, credential stuffing detection (velocity + combo reuse), device fingerprinting, IP reputation blocking
Low inter-arrival variance (uniform timing / low jitter)	Request timing distribution at millisecond level, repeatable intervals across sessions	Scripted Automation / Bots	Deploy adaptive challenge-response (JS/CAPTCHA), behavioral scoring, bot fingerprinting (TLS/JA3/JARM)
Spike in GET requests with low parameter diversity	Query entropy vs baseline, repeated endpoint access patterns, cache hit/miss ratio	Automated Scraping / Data Harvesting	Rate-limit per session/token, detect rotating proxies, enforce behavioral thresholds, apply response obfuscation if needed
429s correlated with 401 / 403 responses	Error message patterns (“invalid token”, “access denied”), retry logic, token lifecycle anomalies	Token Abuse / Account Takeover Attempts	Apply progressive delays (tarpitting), token binding to device/IP, stricter validation, anomaly-based session invalidation
High IP-to-session ratio (many IPs, few sessions)	ASN distribution, geo spread, IP rotation frequency, session reuse	Distributed Layer 7 DDoS	ASN-based rate limiting, geo-fencing (selective), reputation-based throttling, upstream filtering
Requests to deprecated or undocumented endpoints	Access logs for legacy APIs, endpoint discovery patterns, sequential probing behavior	API Enumeration / Reconnaissance	Hard-block deprecated routes at WAF, enforce schema validation, remove unused endpoints (do not just hide them)
429s on authenticated / valid users	Usage vs contract quotas, historical consumption patterns, sudden legitimate spikes	Legitimate Quota Exhaustion (Not an attack)	Adjust rate limits dynamically, notify user, implement burst-tolerant throttling (grace windows)
Sudden spike from single ASN or cloud provider	Traffic origin clustering (AWS/GCP/Azure), instance churn, request patterns per node	Bot Farms / Cloud-based Attack Infrastructure	Apply ASN-level throttling, cloud provider heuristics, stricter challenges for flagged infrastructure
High retry rate immediately after 429 responses	Client retry behavior, backoff compliance, SDK vs bot patterns	Misbehaving Clients or Aggressive Bots	Enforce exponential backoff, introduce server-side cooldowns, temporarily block non-compliant clients

Repeated 429 Too Many Requests? Bring Indusface SOC into the Investigation.

If your API is returning sustained 429 Too Many Requests responses, you do not have to investigate it alone. Once you reach out through the Under Attack page, Indusface security engineers get on a call with your team to:

Validate what is happening using live traffic and rate-limit signals
Determine whether the spike reflects legitimate usage, automation, scraping, or credential abuse
Identify which endpoints, identities, or integrations are hitting enforcement first
Apply targeted mitigations such as adaptive rate controls, bot verification, and edge protections
Stay engaged while the incident is active, so policy changes are based on real signals, not guesswork

Get live help now.

Next: Use the confirmation workflow below to separate normal rate limiting from coordinated automation and decide what to throttle, challenge, or block first.

What a 429 Response Actually Signals

A 429 Too Many Requests response means a configured rate limit has been exceeded and enforcement has occurred. The system is deliberately slowing down or rejecting requests to protect stability. It is not a crash, and it is not proof of compromise. It is a control working as designed.

Rate limiting exists to maintain availability during traffic bursts, restrict repeated authentication attempts, ensure fair usage across tenants, and protect resource-heavy endpoints such as search or reporting APIs. In short, it keeps one client’s behavior from degrading the experience for everyone else.

The complexity arises because both legitimate and malicious traffic can trigger the same threshold. A high-usage integration can exceed its quota just as easily as an automated script testing login endpoints. The status code confirms enforcement, but it does not reveal intent.

To interpret a 429 spike correctly, you have to look beyond volume and evaluate traffic behavior, identity patterns, and endpoint concentration. The signal is clear. The reason behind it requires investigation.

Usage Pressure or Active Abuse: How to Diagnose a 429 Spike

Start with these five signals. Each one narrows the picture. Together they tell you whether you are looking at usage pressure or active abuse.

1. Who is sending the requests?

Pull the identity behind the traffic. Check whether requests are coming from stable, known sources or whether something is rotating.

If your logs look like this:

POST /api/sync API-Key: svc-account-prod-01 Status: 200 POST /api/sync API-Key: svc-account-prod-01 Status: 200 POST /api/sync API-Key: svc-account-prod-01 Status: 429

The same identity, authenticating successfully, hitting a limit. That is a consumption problem. The integration has outgrown its quota, not evading it.

If your logs look like this:

POST /login IP: 185.21.10.2 POST /login IP: 91.204.18.44 POST /login IP: 45.83.200.91

Each source sends a handful of requests before switching. No single identity crosses the threshold. The aggregate effect is sustained pressure that enforcement never fully catches because it is distributed across identities deliberately. That is evasion.

Stable identity signals usage pressure. Rotating identity signals deliberate circumvention.

2. How did the traffic arrive?

Pull the request timeline and look at the shape of the spike.

If your traffic graph looks like this:

08:00 → 120 req/min 08:30 → 240 req/min 09:00 → 480 req/min [429s begin] 09:30 → 460 req/min [enforcement holding]

Volume rises gradually before enforcement, following a curve similar to previous peak periods, indicating organic demand hitting a configured threshold. Morning traffic, scheduled jobs, and user activity typically follow this pattern.

If the graph looks like this:

08:00 → 15 req/min 08:01 → 950 req/min [429s begin immediately] 08:02 → 940 req/min

Full intensity from the first second with no ramp-up. There is no human behavior driving it. Automated scripts start at maximum throughput because there is nothing organic producing the load. Gradual growth signals demand. Instant saturation signals automation.

3. Which endpoints are being hit?

Look at where the traffic is going, not just how much of it there is. If your logs show requests spreading across endpoints in sequences that reflect how your application actually works:

POST /auth/token GET /api/user/profile GET /api/orders?page=1 GET /api/orders?page=2 POST /api/orders/checkout

That is a workflow under load. High volume distributed this way points toward legitimate usage where users moving through your application the way it was designed to be used.

If your logs show this:

GET /api/search?q=shoes GET /api/search?q=jackets GET /api/search?q=laptops GET /api/search?q=watches

One endpoint, constant variation, everything else quiet. The goal is not to use your application. It is to extract something from it.

Also cross-reference the target URLs against your API documentation. If 429s are clustering on routes that do not appear in your current spec, for example paths like /dev/test-auth, /internal/v1/debug, or deprecated versions like /v1/ when your application runs on /v4/, you are not looking at a quota breach. You are looking at endpoints that were never meant to be reachable, with weaker controls, being deliberately targeted.

Distributed endpoint usage signals legitimate workflows. Concentration on a single path, or on undocumented routes, signals targeting.

4. What does the timing look like?

Do not just look at request volume, analyze how those requests arrive over time. Timing patterns often expose intent more clearly than raw traffic counts.

Consider a typical log sequence like this:

14:02:01.243 GET /api/dashboard 14:02:03.887 GET /api/dashboard 14:02:05.124 GET /api/dashboard 14:02:09.302 GET /api/dashboard

The intervals are uneven. There are small delays, inconsistent gaps, and no fixed rhythm. This is what real traffic looks like. Variability comes from multiple factors such as network latency, user interaction speed, browser behavior, and backend response times. Even during high loads, legitimate traffic tends to retain this natural jitter. When a spike follows this pattern, it usually indicates genuine demand interacting with system limits.

If your logs show this:

12:01:00.100 GET /api/products 12:01:00.200 GET /api/products 12:01:00.300 GET /api/products 12:01:00.400 GET /api/products

Here, the requests arrive at perfectly consistent intervals with no deviation. This level of precision does not occur in real user behavior. It reflects automated execution scripts, bots, or tooling designed to generate predictable request rates. The absence of timing variability is the signal. Real systems are noisy; automation is not.

Another pattern to watch for is limit probing, where the client is not just sending traffic but actively learning your enforcement boundaries. For example:

8 req/sec → no throttling 9 req/sec → no throttling 10 req/sec → 429 triggered 9 req/sec → stabilizes just below the limit

This behavior is deliberate. The client increases request rates incrementally until it triggers a rate limit, then backs off slightly and maintains a steady flow just under the threshold. This is calibration. The goal is to maximize throughput without triggering defenses, effectively mapping your rate-limiting logic in real time.

The distinction is critical: legitimate traffic reacts to application needs, while automated traffic reacts to your defenses.

In practice, timing analysis gives you a more reliable signal than volume alone. High traffic with irregular timing often reflects real usage patterns under load. In contrast, mechanically precise intervals or adaptive threshold behavior point to automation, even when the request rate appears modest.

Irregular timing indicates organic conditions. Mechanical precision and especially adaptive precision indicates intent.

5. How did the client respond to enforcement?

This is often the clearest signal of intent. If your monitoring shows this:

09:14:00 → 429 returned 09:14:05 → request rate drops by 60% 09:14:30 → gradual resume with increasing spacing 09:15:00 → traffic stabilizes within limit

The client read the 429 as feedback and adjusted. Exponential backoff, spacing between retries, self-correcting behavior. That points toward a legitimate integration that has outgrown its quota.

If your monitoring shows this:

429 returned → immediate retry, no delay 429 returned → immediate retry, no delay 429 returned → source rotates to new IP, requests resume at full rate

No backoff. No reduction. In some cases traffic accelerates after hitting limits. That is not a client adapting to enforcement. That is automation actively resisting it. Raising rate limits in this scenario does not fix the problem. It removes the control already containing it.

Traffic that slows after enforcement signals cooperation. Traffic that continues or accelerates signals active resistance.

Making the Call

If most signals point toward stable identity, gradual growth, distributed endpoints, and retry compliance, check whether the spike aligns with something happening in your product. A launch, a campaign, a billing cycle, or seasonal demand that matches the timing confirms legitimate load under pressure. Enforcement is working as a stability control, not surfacing a threat.

If signals point toward identity churn, endpoint concentration, uniform timing, limit probing, or retry abuse, especially in combination, you are looking at coordinated automation. The response is not to adjust thresholds. It is to investigate the source, harden the targeted endpoint, and determine whether the traffic represents credential stuffing, scraping, or reconnaissance ahead of a larger attempt.

If credential abuse is suspected, use a structured approach to confirm it, patterns like login failure ratios, password reuse attempts, and IP rotation provide clear indicators (see the credential stuffing diagnosis guide).

A Note on Shadow and Zombie APIs

Before adjusting any threshold, cross-reference the URLs triggering 429s against your current API documentation. If enforcement is clustering on routes that do not appear in your spec, the problem is exposure.

Shadow APIs are endpoints that were created for internal testing, quick deployments, or developer convenience and never went through formal security review. They exist outside standard monitoring, carry permissive rate limits, and are often completely forgotten.

If your logs show 429s on paths like:

POST /dev/test-auth GET /internal/v1/debug POST /admin/bypass-check

An attacker has already discovered these routes. The 429 is proof someone is actively probing an endpoint that should not be reachable at all. The weaker controls there make it worth their time.

Compare that to 429s on documented, expected paths:

POST /api/v4/auth/token GET /api/v4/users/profile

Same enforcement firing, completely different meaning. One is a known surface being protected. The other is an unmanaged surface being discovered.

Zombie APIs are deprecated versions left active after the application moved on /v1/ or /v2/ routes still responding when your application runs on /v4/. They were kept alive for legacy support and never received updated rate limiting logic or security patches.

If your logs show:

POST /api/v1/login GET /api/v1/users?id=1 GET /api/v1/users?id=2 GET /api/v1/users?id=3

That is enumeration on a deprecated route. Attackers target old versioned paths specifically because they know controls there are likely outdated. A sequential ID pattern on a /v1/ endpoint while your application runs on /v4/ is not a quota issue. It is a signal that someone is scanning for vulnerabilities in code that stopped receiving attention a long time ago.

When 429s appear on undocumented or deprecated routes, do not adjust the threshold. Investigate why the endpoint is reachable, what controls are in place, and whether it needs to be decommissioned entirely.

Behavioral Evaluation: The Deciding Layer

Determining whether the traffic represents legitimate demand or automated abuse requires analyzing how requests behave across identities, endpoints, timing patterns, and retry behavior.

Review the following questions to determine whether the spike reflects normal demand, system behavior, or potential abuse.

Investigation Area	Key Questions to Ask	Evidence to Look For	What It May Indicate
Operational Activity	Did a deployment, configuration update, or release occur around the same time as the spike?	Deployment logs, CI/CD pipelines, change management records	A new feature or configuration may have increased API demand
Scheduled Jobs and Automation	Are background jobs, scheduled scripts, or data synchronization tasks running at this time?	Cron jobs, scheduler logs, batch processing systems	Automated processes may be generating bursts of requests
Infrastructure Telemetry	Did system resources change during the spike?	CPU usage, database queries, cache miss rates, memory utilization	Heavy queries or backend load may trigger rate limiting
Application Errors	Are other error responses increasing alongside 429 responses?	401, 403, validation errors, exception logs	Authentication testing or misconfigured requests
Integration Behavior	Did partner integrations or internal services change request patterns recently?	API gateway logs, partner activity dashboards	Integration misconfiguration or scaling changes
Business Events	Did a marketing campaign, product launch, or seasonal activity begin?	Campaign schedules, traffic analytics, customer activity reports	Legitimate user demand surge
Geographic Distribution	Is traffic arriving from regions where your user base normally operates?	IP geolocation data, traffic analytics	Unexpected regions may require investigation
User-Agent Diversity	Do requests reflect normal client diversity?	Browser identifiers, mobile app signatures	Unusual or repetitive clients may suggest automation
Traffic Persistence	How long does the spike persist?	Monitoring dashboards over time	Short bursts may reflect demand; prolonged spikes may require deeper analysis

Seeing Suspicious 429 Spikes Right Now? Get Live Help.

If your API is repeatedly returning 429 Too Many Requests and the pattern suggests automation or abuse, you do not have to investigate it alone. Once you reach out through the Under Attack page, Indusface security engineers join a live call to analyze real-time traffic signals, confirm whether the spike reflects legitimate rate limiting or bot-driven activity, and apply targeted mitigations at the edge to contain the abuse while keeping legitimate users unaffected.

Facing sustained 429 spikes? Get live help now.

429 Too Many Requests: Rate Limiting or API Under Attack?

60-Second Triage: Quick Attack Check

Rapid Diagnostic Matrix and Mitigation for HTTP 429 Errors

Repeated 429 Too Many Requests? Bring Indusface SOC into the Investigation.

What a 429 Response Actually Signals

Usage Pressure or Active Abuse: How to Diagnose a 429 Spike

A Note on Shadow and Zombie APIs

Behavioral Evaluation: The Deciding Layer

Seeing Suspicious 429 Spikes Right Now? Get Live Help.

Frequently Asked Questions (FAQs)

Join 51000+ Security Leaders

Fully Managed SaaS-Based Web Application Security Solution

Get free access to Integrated Application Scanner, Web Application Firewall, DDoS & Bot Mitigation, and CDN for 14 days