Advice on how to deal with AI bots/scrapers?

zoey@lemmy.librebun.com · 9 days ago

Advice on how to deal with AI bots/scrapers?

CronyAkatsuki@lemmy.cronyakatsuki.xyz · edit-2 9 days ago

Try crowdsec.

You can set it up with list that are updsted frequetly and have to look at caddy proxy logs and then it can easilly block ai/bot like traffic.

I have it blocking over 100k ip’s at this moment.

https://www.crowdsec.net/

zoey@lemmy.librebun.com · 9 days ago

Not gonna lie, the $3900/mo at the top of the /pricing page is pretty wild.
Searched “crowdsec docker” and they have docs and all that. Thank you very much, I’ve heard of crowdsec before, but never paid much attention, absolutely will check this out!

Jakeroxs@sh.itjust.works · 8 days ago

You don’t have to pay to use it

K3CAN@lemmy.radio · 8 days ago

The paid plans get you the “premium” blocklists, which includes one specially made to prevent AI scrapers, but a free account will still get you the actual software, the community blocklist, plus up to three "basic"lists.

CronyAkatsuki@lemmy.cronyakatsuki.xyz · edit-2 7 days ago

And the comminity blocklists are updated when more than a couple ( I think the number is something like 10-50 ) instances of crowdsec block an ip in some fast timeframe.

The ai blocklist just adds IP when even one instance finds an AI trying to scrape right from the useragent.

So even if the community blocklist has fewer ai ip’s, it does eventually include them.

Starfarer@lemmy.today · 7 days ago

Which Crowd-Sec blocklists are you using?