Readit News logoReadit News
zaptrem commented on Why are anime catgirls blocking my access to the Linux kernel?   lock.cmpxchg8b.com/anubis... · Posted by u/taviso
msgodel · 14 days ago
I'm generally very pro-robot (every web UA is a robot really IMO) but these scrapers are exceptionally poorly written and abusive.

Plenty of organizations managed to crawl the web for decades without knocking things over. There's no reason to behave this way.

It's not clear to me why they've continued to run them like this. It seems so childish and ignorant.

zaptrem · 14 days ago
The bad scrapers would get blocked by the wall I mentioned. The ones intelligent enough to break the wall would simply take the easier way out and download the alternative data source.
zaptrem commented on Why are anime catgirls blocking my access to the Linux kernel?   lock.cmpxchg8b.com/anubis... · Posted by u/taviso
lmm · 14 days ago
The kind of crawlers/scrapers who DDoS a site like this aren't going to bother checking common crawl or tarballs. You vastly overestimate the intelligence and prosociality of what bursty crawler requests tend to look like. (Anyone who is smart or prosocial will set up their crawler to not overwhelm a site with requests in the first place - yet any site with any kind of popularity gets flooded with these requests sooner or later)
zaptrem · 14 days ago
If they don’t have the intelligence to go after the more efficient data collection method then they likely won’t have the intelligence or willpower to work around the second part I mentioned (keeping something like Anubis). The only problem is when you put Anubis in the way of determined, intelligent crawlers without giving them a choice that doesn’t involve breaking Anubis.
zaptrem commented on Why are anime catgirls blocking my access to the Linux kernel?   lock.cmpxchg8b.com/anubis... · Posted by u/taviso
elsjaako · 14 days ago
There's a lot of people that really don't like AI, and simply don't want their data used for it.
zaptrem · 14 days ago
While that’s a reasonable opinion to have, it’s a fight they can’t really win. It’s like putting up a poster in a public square then running up to random people and shouting “no, this poster isn’t for you because I don’t like you, no looking!” Except the person they’re blocking is an unstoppable mega corporation that’s not even morally in the wrong imo (except for when they overburden people’s sites, that’s bad ofc)
zaptrem commented on Why are anime catgirls blocking my access to the Linux kernel?   lock.cmpxchg8b.com/anubis... · Posted by u/taviso
zaptrem · 14 days ago
If people are truly concerned about the crawlers hammering their 128mb raspberry pi website then a better solution would be to provide an alternative way for scrapers to access the data (e.g., voluntarily contribute a copy of their public site to something like common crawl).

If Anubis blocked crawler requests but helpfully redirected to a giant tar ball of every site using their service (with deltas or something to reduce bandwidth) I bet nobody would bother actually spending the time to automate cracking it since it’s basically negative value. You could even make it a torrent so most of the be costs are paid by random large labs/universities.

I think the real reason most are so obsessed with blocking crawlers is they want “their cut”… an imagined huge check from OpenAI for their fan fiction/technical reports/whatever.

zaptrem commented on Counter-Strike: A billion-dollar game built in a dorm room   nytimes.com/2025/08/18/ar... · Posted by u/asnyder
piperswe · 16 days ago
You can still access CS:GO, by selecting the "csgo_legacy" beta on CS2
zaptrem · 16 days ago
Are there still servers running games? Not that it's really necessary since CS2 is basically CSGO with better smoke effects/lighting.
zaptrem commented on Claude Sonnet 4 now supports 1M tokens of context   anthropic.com/news/1m-con... · Posted by u/adocomplete
MagicMoonlight · 22 days ago
It’s a stupid metric because nothing in the real world has half a million words of context. So all they’re doing is feeding it imagined slop, or sticking together random files.
zaptrem · 22 days ago
It's useful for hours-long long-context debugging sessions in Claude Code, etc.
zaptrem commented on The surprise deprecation of GPT-4o for ChatGPT consumers   simonwillison.net/2025/Au... · Posted by u/tosh
perlgeek · a month ago
GPT-5 simply sucks at some things. The very first thing I asked it to do was to give me an image of knife with spiral damascus pattern, it gave me an image of such a knife, but with two handles at a right angle: https://chatgpt.com/share/689506a7-ada0-8012-a88f-fa5aa03474...

Then I asked it to give me the same image but with only one handle; as a result, it removed one of the pins from a handle, but the knife had still had two handles.

It's not surprising that a new version of such a versatile tool has edge cases where it's worse than a previous version (though if it failed at the very first task I gave it, I wonder how edge that case really was). Which is why you shouldn't just switch over everybody without grace period nor any choice.

The old chatgpt didn't have a problem with that prompt.

For something so complicated it doesn't surprise that a major new version has some worse behaviors, which is why I wouldn't deprecate all the old models so quickly.

zaptrem · a month ago
The image model (GPT-Image-1) hasn’t changed
zaptrem commented on Death by AI   davebarry.substack.com/p/... · Posted by u/ano-ther
zaptrem · 2 months ago
A few versions of that overview were not incorrect, there actually was another Dave Barry who did die at the time mentioned. Why does this Dave Barry believe he has more of a right to be the one pointed to for the query "What happened to him" when nothing has happened to him but something most certainly did happen to the other Dave Barry (death)?
zaptrem commented on GLP-1s are breaking life insurance   glp1digest.com/p/how-glp-... · Posted by u/alexslobodnik
atleastoptimal · 2 months ago
The solution is for life insurance companies to pay people to stay on ozempic
zaptrem · 2 months ago
The “payment” can come in the form of lower rates, with penalties if you stop.
zaptrem commented on Bayeux Tapestry Will Return to the U.K. for the First Time in 950 Years   news.artnet.com/art-world... · Posted by u/andsoitis
7734128 · 2 months ago
The automatic headline rewriting is an absolute joke and it has not improved a title a single time ever.
zaptrem · 2 months ago
Does HN auto re-write headlines using an LLM or something?

u/zaptrem

KarmaCake day1812March 2, 2020View Original