AI Crawler Allowlist
AI crawler User-Agent allowlist + robots.txt + ai.txt + Cloudflare WAF / nginx policy +…
Forged from real client work, proof attached. Pick a piece or take the whole system.
Browse the full catalog → Browse ready-made kits → Build your own set →Real-time bot classification engine for AI crawler traffic.
A real-time AI bot classification engine that proves whether traffic claiming to be GPTBot, ClaudeBot or ChatGPT-User actually is. It combines four-layer fingerprinting (User-Agent, IP range, ASN cross-check, reverse-DNS) with behavioral anomaly detection and Bayesian reputation scoring, deployed both as a sub-50ms edge classifier and a batch log re-classifier. Because a wrong call means either blocking a legitimate AI bot (lost citations) or burning bandwidth on a spoofer, every threshold is deliberately allow-biased.
Prices include 20% VAT. · Forged on real agency work · one-time, no lock-in
Inside the run · no black box
A User-Agent header is a claim, not an identity. The classifier tests each one against three independent signals before any allow, throttle or block verdict lands:
ai-bot-ua-classifier · core
core active · 6 lines
Verify that ChatGPT-User referral traffic is real and not a UA spoofer
Detect aggressive crawlers burning bandwidth with zero citation value
Close the measurement loop on an existing crawler allowlist policy
Baseline AI bot traffic before a large multilingual site launch
Flag behavioral anomalies like admin-path probing and request bursts
Run monthly re-classification to catch new bots and behavior drift
Drag time forward. Watch what stays.
Forever
That's what owning means.
ai writing tool: subscription
expired · access lostanalytics suite: subscription
expired · access lostdesign platform: subscription
expired · access lost(nothing left)
Stop trusting User-Agent claims: confirm bot identity with 4 independent signals
license: perpetualProtect AEO citation revenue by never blocking a legitimate AI crawler
license: perpetualCut wasted bandwidth spend by tiering and rate-limiting low-value bots
license: perpetualGet an auditable, replayable reason for every block, rate-limit or allow decision
license: perpetualsubscriptions expire · deeds don't
Pick a piece up. Watch it work.
Cloudflare Worker edge classifier with KV-cached IP ranges and behavior buckets
6 parts · one working system · ships instantly by email
From the field · a real case
Teams running AEO/GEO programs who need to measure and trust their AI bot traffic instead of guessing from raw User-Agent strings.
then this was forged for you.Universal by design: these run in any AI. Delivered in the open Agent Skills + MCP format (native in Claude); ChatGPT, Gemini, Cursor and Copilot adapt the same files their own way.
It is built to run at the edge so classification happens in the request path, in real time. You deploy it in front of your traffic rather than parsing server logs after the fact.
A UA string alone is exactly what it does not trust, which is why it cross-checks IP range, ASN and reverse-DNS on top of the User-Agent. A real GPTBot has to pass all four layers, not just send the right name.
It classifies and scores traffic with a reputation signal; it tells you what is real. Enforcing a block or allow decision is a separate layer, the job of an allowlist system.
By email right after purchase: ready to run, downloaded instantly, no setup wait.
A one-time purchase; no subscription or hidden fees. VAT (20%) is included.
As a digital product, it can’t be refunded once downloaded. That’s why we show exactly what’s inside and who it’s for, right here.