Readit News logoReadit News
perardi commented on Retiring GPT-4o, GPT-4.1, GPT-4.1 mini, and OpenAI o4-mini in ChatGPT   openai.com/index/retiring... · Posted by u/rd
perardi · 16 days ago
OK, everyone is (rightly) bringing up that relatively small but really glaringly prominent AI boyfriend subreddit.

But I think a lot more people are using LLMs for relationship surrogates than that (pretty bonkers) subreddit would suggest. Character AI (https://en.wikipedia.org/wiki/Character.ai) seems quite popular, as do the weird fake friend things in Meta products, and Grok’s various personality mode and very creepy AI girlfriends.

I find this utterly bizarre. LLMs are peer coders in a box for me. I care about Claude Code, and that’s about it. But I realize I am probably in the vast minority.

perardi commented on Retiring GPT-4o, GPT-4.1, GPT-4.1 mini, and OpenAI o4-mini in ChatGPT   openai.com/index/retiring... · Posted by u/rd
fpgaminer · 16 days ago
I wish they would keep 4.1 around for a bit longer. One of the downsides of the current reasoning based training regimens is a significant decrease in creativity. And chat trained AIs were already quite "meh" at creative writing to begin with. 4.1 was the last of its breed.

So we'll have to wait until "creativity" is solved.

Side note: I've been wondering lately about a way to bring creativity back to these thinking models. For creative writing tasks you could add the original, pretrained model as a tool call. So the thinking model could ask for its completions and/or query it and get back N variations. The pretrained model's completions will be much more creative and wild, though often incoherent (think back to the GPT-3 days). The thinking model can then review these and use them to synthesize a coherent, useful result. Essentially giving us the best of both worlds. All the benefits of a thinking model, while still giving it access to "contained" creativity.

perardi · 16 days ago
Have you tried the relatively recent Personalities feature? I wonder if that makes a difference.

(I have no idea. LLMs are infinite code monkeys on infinite typewriters for me, with occasional “how do I evolve this Pokémon’ utility. But worth a shot.)

perardi commented on Tesla is committing automotive suicide   electrek.co/2026/01/29/te... · Posted by u/jethronethro
nailer · 16 days ago
Technology sometimes takes longer than estimates.
perardi · 16 days ago
And in Musk’s case, “longer” means “abandoned”. Like the cheap model 3. Or the Hyperloop. Or swappable batteries. Or X as an everything app that includes banking.
perardi commented on Tesla is committing automotive suicide   electrek.co/2026/01/29/te... · Posted by u/jethronethro
nailer · 16 days ago
Because it's hard and Tesla think they can do it.

See 'reusable rockets' and 'having paralysed people control things with their minds' for other examples.

HN often seem to think there's Elon fans downmodding things but it seems more like a case of irrational hatred.

perardi · 16 days ago
Oh, well let me get in my sub-$30,000 Model S, with a swappable battery and full-self-driving capabilities, and take a fully automated trip to the Hyperloop downtown so I can catch a quick ride out to O’Hare so I can fly out to watch a successful Starship launch…

…oh wait. I can’t. Because for all his successes, Musk has also sowed quite a lot of bullshit that has gone precisely nowhere.

perardi commented on Claude Cowork runs Linux VM via Apple virtualization framework   gist.github.com/simonw/35... · Posted by u/jumploops
dijit · a month ago
Is that even a sandbox?

I thought it was just a wrapper around an (old) existing tool that has been infinitely rebranded. Their old "remote desktop" program and some web listing capabilities to launch it in "rootless" mode.

perardi · a month ago
Yes, there is a sandbox.

https://simonwillison.net/2026/Jan/12/claude-cowork/

That’s the point of this gist, and the related blog post.

Also, it’s a bit of a stretch to call Claude Code, which isn’t even a year old…old.

perardi commented on The insecure evangelism of LLM maximalists   lewiscampbell.tech/blog/2... · Posted by u/todsacerdoti
perardi · a month ago
5 anti-AI posts on the home page of Hacker News…yeah, plenty of insecure evangelism amongst the skeptics, too.

Dead Comment

perardi commented on AI coding assistants are getting worse?   spectrum.ieee.org/ai-codi... · Posted by u/voxadam
PaulHoule · a month ago
You are better off talking to Google's AI mode about that sort of thing because it runs searches. Does great talking about how the Bills are doing because that's a good example where timely results are essential.

I haven't found any LLM where I totally trust what it tells me about Arknights, like there is no LLM that seems to understand how Scavenger recovers DP. Allegedly there is a good Chinese Wiki for that game which I could crawl and store in a Jetbrains project and ask Junie questions about but I can't resolve the URL.

perardi · a month ago
Even with search mode, I’ve had some hilarious hallucinations.

This was during the Gemini 2.5 era, but I got some just bonkers results looking for Tears of the Kingdom recipes. Hallucinated ingredients, out-of-nowhere recipes, and transposing Breath of the Wild recipes and effects into Tear of the Kingdom.

perardi commented on How Google got its groove back and edged ahead of OpenAI   wsj.com/tech/ai/google-ai... · Posted by u/jbredeche
sxp · a month ago
> Naina Raisinghani, 00 needed a name for the new tool to complete the upload. It was 2:30 a.m., though, and nobody was around. So she just made one up, a mashup of two nicknames friends had given her: Nano Banana.

Ah, that explains the silly name for such an impressive tool. I guess it's more a more Googley name than what would have otherwise been chosen: Google Gemini Image Pro Red for Workspace.

perardi · a month ago
Strongly disagree.

Google, OpenAI, and Microsoft all have a very confusing product naming strategy where it’s all lumped under Gemini/ChatGPT/Copilot, and the individual product names are not memorable and really quite obscure. (What does Codex do again?)

Nano Banana doesn’t tell you what the product does, but you sure remember the name. It really rolls off the tongue, and it looks really catchy on social media.

perardi commented on How Google got its groove back and edged ahead of OpenAI   wsj.com/tech/ai/google-ai... · Posted by u/jbredeche
Supermancho · a month ago
Claude has been measurably worse over other models, in my experience. This alone makes me doubt the number. That and Anthropic has not released official public financial statements, so I'll just assume it's the same kind of hand waving heavily leveraged companies tend to do.

I actually for for ChatGPT and my company pays for Copilot (which is meh).

Edit: Given other community opinions, I don't feel I'm saying anything controversial. I have noted HN readers tend to be overly bullish on it for some reason.

perardi · a month ago
That doesn’t reflect my (I would say extensive) experience at this point, nor does it reflect the benchmarks. (I realize benchmarks have issues.)

Are you using Claude as an agent in VSCode or via Claude Code, or are you asking questions in the web interface? I find Claude is the best model when it’s working with a strongly typed language with a verbose linter and compiler. It excels with Go and TypeScript in Cursor.

u/perardi

KarmaCake day5795May 24, 2013View Original