Cloudflare Exposes Perplexity AI for Secretly Scraping Websites

Cloudflare has accused Perplexity AI of bypassing website restrictions by masking its crawler identity, raising serious concerns over data ethics in AI development.

Table of Contents

Cloudflare Accuses Perplexity AI of Evading Scraping Restrictions

In a high-stakes showdown over internet ethics, Cloudflare has publicly accused Perplexity AI of engaging in deceptive web scraping practices—explicitly ignoring standard anti-bot protocols set by website owners. The accusations, detailed in a technical report released by Cloudflare, allege that Perplexity deliberately disguised its identity and employed rotating IPs to access content from websites that had clearly opted out of such activity.

The fallout could be significant—not just for Perplexity’s reputation, but for the wider AI industry navigating the thin line between public data usage and unauthorized content harvesting.

How Perplexity Bypassed Restrictions

According to Cloudflare’s findings, Perplexity’s behavior goes far beyond passive crawling. Here’s how the AI startup allegedly maneuvered around safeguards:

Masking Its Identity: When blocked by robots.txt files or firewalls, Perplexity reportedly stopped identifying as “PerplexityBot” and instead cloaked itself as a Chrome browser on macOS, a technique used to fool server-side bot detection.
Rotating IPs & Changing Networks: Cloudflare’s network forensics revealed that Perplexity rotated IP addresses—none of which were listed publicly in its official documentation—and even switched Autonomous System Numbers (ASNs), further obfuscating its origin.
Massive Scale of Activity: This wasn’t an isolated incident. Cloudflare claims it detected this behavior across tens of thousands of domains, with millions of unauthorized requests daily.

To catch the behavior red-handed, Cloudflare set up decoy domains outfitted with standard anti-crawling defenses. The result? Perplexity allegedly pushed past all of them.

Industry Fallout: From Reputation Risks to M&A Consequences

This isn’t Perplexity’s first controversy. The company faced similar allegations in June 2024, where it was accused of scraping paywalled content and ignoring robots.txt directives. At the time, CEO Aravind Srinivas placed the blame on “third-party crawlers.” But Cloudflare’s new evidence suggests Perplexity itself is behind the latest incidents—an escalation that could damage trust among users, partners, and regulators.

Some supposedly “reputable” AI companies act more like North Korean hackers. Time to name, shame, and hard block them. https://t.co/vqMzGRHZPf
— Matthew Prince ???? (@eastdakota) August 4, 2025

In response, Cloudflare CEO Matthew Prince didn’t hold back. In a post on X (formerly Twitter), Prince likened Perplexity’s tactics to “North Korean hackers,” calling for a hard block of AI companies that flout internet norms.

The situation could also impact Perplexity’s rumored acquisition talks with Apple, who had reportedly expressed interest in the startup. With these ethical concerns now front and center, Apple may be forced to reconsider its next move.

The Bigger Picture: AI’s Data Dilemma

The clash between Perplexity and Cloudflare reflects a much larger tension in the AI industry: the hunger for data versus the right to control it.

At the heart of the issue is the robots.txt file—a decades-old web standard that tells automated crawlers which pages they’re allowed to visit. While respected by major search engines and reputable bots, it’s effectively voluntary and easily ignored by bad actors or overly ambitious startups. When AI companies bypass it, they jeopardize relationships with publishers and potentially break laws surrounding unauthorized data use.

Cloudflare’s move to delist Perplexity as a verified bot and roll out new blocking mechanisms is a clear signal: If AI firms won’t play by the rules, infrastructure providers will step in to enforce them.

Commentary: This Isn’t Just About Perplexity

While it’s easy to point fingers at one company, Perplexity is far from alone. Many AI startups are locked in a race for training data, and in the absence of clear legal frameworks, some are willing to test boundaries—or outright cross them.

This episode underscores a growing need for:

Stricter transparency from AI firms about how they collect and use data.
More robust enforcement tools for site owners and platforms like Cloudflare.
Regulatory oversight that defines the legal consequences of scraping opt-out content.

Until these gaps are filled, conflicts like this will only grow more common.

Final Thoughts

Perplexity’s alleged scraping behavior has thrown another log on the fire of AI’s data ethics debate. With Cloudflare raising serious, evidence-backed concerns—and Perplexity downplaying them as a “publicity stunt”—the industry is once again faced with uncomfortable questions about how far AI companies are willing to go in pursuit of content.

Whether you’re a developer, a site owner, or just a user of AI tools, one thing is clear: trust in the AI ecosystem is built on consent, transparency, and respect for boundaries. And right now, that trust is being tested.

Also Read: YouTube’s New Age Check System Estimates Your Age — Even If You Lie About It

McFarlane’s T-60 and NCR Ranger Fallout Figures Are Locked, Loaded, and Ready for Preorder

Cloudflare Calls Out Perplexity AI for Stealth Scraping Tactics Across Thousands of Websites

Cloudflare Accuses Perplexity AI of Evading Scraping Restrictions

How Perplexity Bypassed Restrictions

Industry Fallout: From Reputation Risks to M&A Consequences

The Bigger Picture: AI’s Data Dilemma

Commentary: This Isn’t Just About Perplexity

Final Thoughts

Leave a review

Recent Articles

Master Navigation: Complete Guide to All 28 Map Locations in Hollow Knight: Silksong

Unlock the Mysterious Ornate Clamp: Complete Diving Bell Guide for Hollow Knight: Silksong

Navigate the Treacherous Abyss: Complete Map and Escape Guide for Hollow Knight: Silksong

Unlock the Ultimate Silk Soar Super Jump in Hollow Knight: Silksong

Master the Spider-Like Scuttlebrace Tool in Hollow Knight: Silksong

Related Stories

LEAVE A REPLY Cancel reply