Discover ANY AI to make more online for less.

select between over 22,900 AI Tool and 17,900 AI News Posts.


Perplexity is allegedly scraping websites it's not supposed to, again
Perplexity is allegedly scraping websites it's not supposed to, again

Web crawlers deployed by Perplexity to scrape websites are allegedly skirting restrictions, according to a new report from Cloudflare. Specifically, the report claims that the company's bots appear to be "stealth crawling" sites by disguising their identity to get around robots.txt files and firewalls.
Robots.txt is a simple file websites host that lets web crawlers know if they can scrape a websites' content or not. Perplexity's official web crawling bots are "PerplexityBot" and "Perplexity-User." In Cloudflare's tests, Perplexity was still able to display the content of a new, unindexed website, even when those specific bots were blocked by robots.txt. The behavior extended to websites with specific Web Application Firewall (WAF) rules that restricted web crawlers, as well.
Cloudflare
Cloudflare believes that Perplexity is getting around those obstacles by using "a generic browser intended to impersonate Google Chrome on macOS" when robots.txt prohibits its normal bots. In Cloudlfare's tests, the company's undeclared crawler could also rotate through IP addresses not listed in Perplexity's official IP range to get through firewalls. Cloudflare says that Perplexity appears to be doing the same thing with autonomous system numbers (ASNs) — an identifier for IP addresses operated by the same business — writing that it spotted the crawler switching ASNs "across tens of thousands of domains and millions of requests per day."
Engadget has reached out to Perplexity for comment on Cloudflare's report. We'll update this article if we hear back.
Up-to-date information from websites is vital to companies training AI models, especially as service's like Perplexity are used as replacements for search engines. Perplexity has also been caught in the past circumventing the rules to stay up-to-date. Multiple websites reported in 2024 that Perplexity was still accessing their content despite them forbidding it in robots.txt — something the company blamed on the third-party web crawlers it was using at the time. Perplexity later partnered with multiple publishers to share revenue earned from ads displayed alongside their content, seemingly as a make-good for its past behavior.
Stopping companies from scraping content from the web will likely remain a game of whack-a-mole. In the meantime, Cloudflare has removed Perplexity's bots from its list of verified bots and implemented a way to identify and block Perplexity's stealth crawler from accessing its customers' content.This article originally appeared on Engadget at https://www.engadget.com/ai/perplexity-is-allegedly-scraping-websites-its-not-supposed-to-again-211110756.html?src=rss

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

Perplexity sued by Japanese media giants for stealing information and presenting false information
Perplexity sued by Japanese media giants for stealing information and prese

<p>Another day, another instance of AI companies purportedly engaging in copyright infringement. Two Japanese media groups, <em>Nikkei</em> and the <em>Asahi Shimbun</em> [...]

Match Score: 155.49

Trump's Truth Social launches AI search powered by Perplexity
Trump's Truth Social launches AI search powered by Perplexity

<p>Truth Social, President Trump&#39;s social media platform, is <a data-i13n="cpos:1;pos:1" href="https://ir.tmtgcorp.com/news-events/press-releases/#b2iLibScrollTo"& [...]

Match Score: 115.92

Perplexity has cooked up a new way to pay publishers for their content
Perplexity has cooked up a new way to pay publishers for their content

<p style="text-align:left;"><span style="color:rgb(0, 0, 0);font-family:Arial, sans-serif;">Perplexity is launching a new revenue-sharing plan for publishers that will [...]

Match Score: 110.77

Perplexity will now show hotel information from TripAdvisor
Perplexity will now show hotel information from TripAdvisor

<p>TripAdvisor has entered into a <a data-i13n="elm:context_link;elmt:doNotAffiliate;cpos:1;pos:1" class="no-affiliate-link" href="https://tripadvisor.mediaroom.com/p [...]

Match Score: 98.28

Perplexity's Comet AI browser is available now for $200 per month
Perplexity's Comet AI browser is available now for $200 per month

<p>Comet, Perplexity's AI-powered web browser, <a data-i13n="elm:context_link;elmt:doNotAffiliate;cpos:1;pos:1" class="no-affiliate-link" href="https://www.perplexity [...]

Match Score: 83.88

Apple executives have held internal discussions about potentially bidding for AI startup Perplexity
Apple executives have held internal discussions about potentially bidding f

<p><img width="1681" height="896" src="https://the-decoder.com/wp-content/uploads/2025/06/apple_perplexity_logo.png" class="attachment-full size-full wp-pos [...]

Match Score: 77.59

Engadget Podcast: iPhone 16e review and Amazon's AI-powered Alexa+
Engadget Podcast: iPhone 16e review and Amazon's AI-powered Alexa+

<p>The keyword for the <a data-i13n="cpos:1;pos:1" href="https://www.engadget.com/mobile/smartphones/iphone-16e-review-whats-your-acceptable-compromise-020016288.html"> [...]

Match Score: 67.31

PayPal and Venmo users get a free year of Perplexity Pro and early access to its AI browser
PayPal and Venmo users get a free year of Perplexity Pro and early access t

<p>Perplexity, the NVIDIA- and Bezos-backed AI company, <a data-i13n="cpos:1;pos:1" href="https://newsroom.paypal-corp.com/2025-09-03-Skip-the-Waitlist-PayPal-and-Venmo-Users-O [...]

Match Score: 67.24

Apple is reportedly considering the acquisition of Perplexity AI
Apple is reportedly considering the acquisition of Perplexity AI

<p>Apple&#39;s executives are thinking of acquiring Perplexity AI both to get more talent and to be able to offer an AI-based search engine in the future, according to <a data-i13n=" [...]

Match Score: 62.72