Welcome to DU! The truly grassroots left-of-center political community where regular people, not algorithms, drive the discussions and set the standards. Join the community: Create a free account Support DU (and get rid of ads!): Become a Star Member Latest Breaking News Editorials & Other Articles General Discussion The DU Lounge All Forums Issue Forums Culture Forums Alliance Forums Region Forums Support Forums Help & Search
I`m calling
out YOU:
MAGA repugs,
complicit
MSM,
+ corporate
sycophants
And how many
times can a
man turn his
head, and
pretend that
he just
doesn`t see?

STUPID
is
as
TSF
Does
An island of
Sanity
In a sea of
Insanity


Mirt!
Mirt!
Mirt!
Thank you
for taking
out the
dirt!

AMERICA
LOVE IT
OR
FIX IT

Check out
all the stickies
on Grovelbot's
Big Board!

highplainsdem

(57,856 posts)
Mon Aug 4, 2025, 12:40 PM Aug 4

Perplexity accused of scraping websites that explicitly blocked AI scraping

Source: TechCrunch

AI startup Perplexity is crawling and scraping content from websites that have explicitly indicated they don’t want to be scraped, according to internet infrastructure provider Cloudflare.

On Monday, Cloudflare published research saying it observed the AI startup ignore blocks and hide its crawling and scraping activities. The network infrastructure giant accused Perplexity of obscuring its identity when trying to scrape web pages “in an attempt to circumvent the website’s preferences,” Cloudflare’s researchers wrote.

-snip-

Perplexity appears to be willingly circumventing these blocks by changing its bots “user agent,” meaning a signal that identifies a website visitor by their device and version type; as well as changing their autonomous system networks, or ASN, essentially a number that identifies large networks on the internet, according to Cloudflare.

“This activity was observed across tens of thousands of domains and millions of requests per day. We were able to fingerprint this crawler using a combination of machine learning and network signals,” read Cloudflare’s post.

-snip-

Read more: https://techcrunch.com/2025/08/04/perplexity-accused-of-scraping-websites-that-explicitly-blocked-ai-scraping/



Cloudflare also said Perplexity has been using "a generic browser intended to impersonate Google Chrome on macOS."

Very crooked company. But then, I don't know of any generative AI company that isn't based on theft and deceit.

The AI bots are doing terrible damage to the internet. Including here at DU. As EarlG explained last week

https://www.democraticunderground.com/101316061

the downtime and update then were at least partly about the bot problem, especially AI scrapers.
5 replies = new reply since forum marked as read
Highlight: NoneDon't highlight anything 5 newestHighlight 5 most recent replies
Perplexity accused of scraping websites that explicitly blocked AI scraping (Original Post) highplainsdem Aug 4 OP
Lie and steal Quanto Magnus Aug 4 #1
It doesn't really make any difference customerserviceguy Aug 4 #2
While true, that is not an excuse to never regulate the AI industry. Until something is done, they see this as a blank Karasu Aug 4 #3
Pass laws and regulations customerserviceguy Aug 4 #4
I agree it won't solve the problem. But I also think it's better than ACTIVELY enabling them through inaction, certainly Karasu Aug 4 #5

Quanto Magnus

(1,230 posts)
1. Lie and steal
Mon Aug 4, 2025, 12:48 PM
Aug 4

this is how they all act....

Lie about the product
Steal material from others
Lie about the theft
Lie some more...

customerserviceguy

(25,359 posts)
2. It doesn't really make any difference
Mon Aug 4, 2025, 03:03 PM
Aug 4

what laws or regulations we make, some geeks are always going to try to get around them, any way possible.

Karasu

(1,851 posts)
3. While true, that is not an excuse to never regulate the AI industry. Until something is done, they see this as a blank
Mon Aug 4, 2025, 03:30 PM
Aug 4

check to do whatever the fuck they want, whenever they want, however they want.

It is beyond absurd that something this world-altering has gone completely unchecked for as long as it already has.

customerserviceguy

(25,359 posts)
4. Pass laws and regulations
Mon Aug 4, 2025, 03:32 PM
Aug 4

if it makes you feel better, but don't be under any illusion that you've solved a problem.

Karasu

(1,851 posts)
5. I agree it won't solve the problem. But I also think it's better than ACTIVELY enabling them through inaction, certainly
Mon Aug 4, 2025, 03:39 PM
Aug 4

in the case of the AI industry.

Latest Discussions»Latest Breaking News»Perplexity accused of scr...