Readit News logoReadit News
gallerdude commented on Three Years from GPT-3 to Gemini 3   oneusefulthing.org/p/thre... · Posted by u/JumpCrisscross
gallerdude · a month ago
It is interesting that most of our modes of interaction with AI is still just textboxes. The only big UX change in that the last three years has been the introduction of the Claude Code / OpenAI Codex tools. They feel amazing to use, like you're working with another independent mind.

I am curious what the user interfaces of AI in the future will be, I think whoever can crack that will create immense value.

gallerdude commented on GPT-5o-mini hallucinates medical residency applicant grades   thalamusgme.com/blogs/cor... · Posted by u/medicalthrow
gallerdude · 2 months ago
I wonder if they're using reasoning? It usually eliminates these types of errors
gallerdude commented on Yet Another LLM Rant   overengineer.dev/txt/2025... · Posted by u/sohkamyung
efilife · 5 months ago
> it cannot "logically reason" like a human does

Reason? Maybe. But there's one limitation that we currently have no idea how to overcome; LLMs don't know how much they know. If they tell you they don't something it may be a lie. If they tell you they do, this may be a lie too. I, a human, certainly know what I know and what I don't and can recall from where I know the information

gallerdude · 5 months ago
> OpenAI researcher Noam Brown on hallucination with the new IMO reasoning model:

> Mathematicians used to comb through model solutions because earlier systems would quietly flip an inequality or tuck in a wrong step, creating hallucinated answers.

> Brown says the updated IMO reasoning model now tends to say “I’m not sure” whenever it lacks a valid proof, which sharply cuts down on those hidden errors.

> TLDR, the model shows a clear shift away from hallucinations and toward reliable, self‑aware reasoning.

Source: https://x.com/chatgpt21/status/1950606890758476264

gallerdude commented on OpenAI claims gold-medal performance at IMO 2025   twitter.com/alexwei_/stat... · Posted by u/Davidzheng
beering · 5 months ago
What do you mean by “pure language model”? The reasoning step is still just the LLM spitting out tokens and this was confirmed by Deepseek replicating the o models. There’s not also a proof verifier or something similar running alongside it according to the openai researchers.

If you mean pure as in there’s not additional training beyond the pretraining, I don’t think any model has been pure since gpt-3.5.

gallerdude · 5 months ago
Local models you can get just the pretrained versions of, no RLHF. IIRC both Llama and Gemma make them available.
gallerdude commented on Rolling the ladder up behind us   xeiaso.net/blog/2025/roll... · Posted by u/techknowlogick
burlesona · 6 months ago
> The issue with an industry awash with cheap dross, is that it becomes prohibitively expensive to produce high Quality stuff.

This seems to be one of the brutal truths of the modern world, and as far as I can tell it applies to everything. There's always a race to the bottom to make everything as cheaply as possible, and the further the industry goes down that "cheapness" scale, the more "quality" loses market share, the more expensive "quality" must be in order to operate at all, and finally things that used to be just "normal" and not too expensive are now luxury goods.

Consider textiles, carpentry, masonry, machine tooling, appliances, etc. etc.

This doesn't feel like a good outcome, but I'm not sure there's anything that can be done about it.

gallerdude · 6 months ago
I can see both sides of it. There’s a fancy bread bakery by where I live. I go infrequently, the bread is great. But it’s expensive, most of the I just want a cheap loaf from Target, as do most people.

Instead of broad employment of artisan breadsmiths, we have people doing email work, because it’s more economically valuable. If the government mandated a higher quality of bread, we’d be slightly richer and bread and slightly poorer in everything else.

gallerdude commented on VVVVVV Source Code   github.com/TerryCavanagh/... · Posted by u/radeeyate
unwind · 8 months ago
Wow, that is cool! Did it help/affect your later choices with your career, did you end up a game developer, or at least try it or so? Always fun with closure! :)
gallerdude · 8 months ago
I made a very mediocre platformer in my senior year of high school, published on itch.io. I ended up becoming a software developer, which I enjoy 80% as much, but without any burnout or worrying about the superstar economics of being a game dev. Once the singularity hits, maybe I'll make more games.

https://gallerdude.itch.io/the-journey-east-full

gallerdude commented on VVVVVV Source Code   github.com/TerryCavanagh/... · Posted by u/radeeyate
gallerdude · 8 months ago
When I was near the end of high school, my family visited London, and I was thinking about being a game dev. So I sent Terry Cavanagh an email, and to my surprise he completely agreed to get lunch.

He was extremely kind, gave me a lot of interesting life advice. I remember him saying that he got most of his ideas just from playing around with mechanics and experimenting a lot, he was never really one to get grand visions.

Anyways, great fellow, glad he opened source V (as he called it).

Dead Comment

gallerdude commented on DeepSeek-Prover-V2   github.com/deepseek-ai/De... · Posted by u/meetpateltech
smusamashah · 8 months ago
Sorry, forgot multiply by 100
gallerdude · 8 months ago
classic human hallucination
gallerdude commented on AGI Is Still 30 Years Away – Ege Erdil and Tamay Besiroglu   dwarkesh.com/p/ege-tamay... · Posted by u/Philpax
fusionadvocate · 8 months ago
Can someone throw some light on this Dwarkesh character? He landed a Zucc podcast pretty early on... how connected is he? Is he an industry plant?
gallerdude · 8 months ago
He's awesome.

I listened to Lex Friedman for a long time, and there was a lot of critiques of him (Lex) as an interviewer, but since the guests were amazing, I never really cared.

But after listening to Dwarkesh, my eyes are opened (or maybe my soul). It doesn't matter I've heard of not-many of his guests, because he knows exactly the right questions to ask. He seems to have genuine curiosity for what the guest is saying, and will push back if something doesn't make sense to him. Very much recommend.

u/gallerdude

KarmaCake day1564August 5, 2016View Original