Readit News logoReadit News
JanSt commented on GPT-5.2   openai.com/index/introduc... · Posted by u/atgctg
exe34 · 2 months ago
I'm quite sad about the S-curve hitting us hard in the transformers. For a short period, we had the excitement of "ooh if GPT-3.5 is so good, GPT-4 is going to be amazing! ooh GPT-4 has sparks of AGI!" But now we're back to version inflation for inconsequential gains.
JanSt · 2 months ago
I don't feel the S-curve at all yet. Still an exponential for me
JanSt commented on GPT-5.2   openai.com/index/introduc... · Posted by u/atgctg
JanSt · 2 months ago
The benchmarks are very impressive. Codex and Opus 4.5 are really good coders already and they keep getting better.

No wall yet and I think we might have crossed the threshold of models being as good or better than most engineers already.

GDPval will be an interesting benchmark and I'll happily use the new model to test spreadsheet (and other office work) capabilities. If they can going like this just a little bit further, much of the office workers will stop being useful.... I don't know yet how to feel about this.

Great for humanity probably but but for the individuals?

JanSt commented on Developers are choosing older AI models   augmentcode.com/blog/deve... · Posted by u/knes
virtualritz · 3 months ago
> [...] I'm using Opus 4.1 which is much better but seems to have much lower usage limits than before Sonnet 4.5 was released [...]

Yes, it's down from 40h/week to 3-5h/week on Max plan, effectively. A real bummer. See my comment here [1] regarding [2].

[1] https://news.ycombinator.com/item?id=45604301

[2] https://github.com/anthropics/claude-code/issues/8449

JanSt · 3 months ago
Thanks, didn't know that but aligns with my experience
JanSt commented on Developers are choosing older AI models   augmentcode.com/blog/deve... · Posted by u/knes
KronisLV · 3 months ago
For development use cases, I switched to Sonnet 4.5 and haven't looked back. I mean, sure, sometimes I also use GPT-5 (and mini) and Gemini 2.5 Pro (and Flash), and also Cerebras Code just switched to providing GLM 4.6 instead of the previous Qwen3 Coder so those as well, but in general the frontier models are pretty good for development and I wouldn't have much reason to use something like Sonnet 4 or 3.7 or whatever.
JanSt · 3 months ago
I have canceled my Claude Max subscription because Sonnet 4.5 is just too unreliable. For the rest of the month I'm using Opus 4.1 which is much better but seems to have much lower usage limits than before Sonnet 4.5 was released. When I hit 4.1 Opus limits I'm using Codex. I will probably go through with the Codex pro subscription.
JanSt commented on Composer: Building a fast frontier model with RL   cursor.com/blog/composer... · Posted by u/leerob
jasonjmcghee · 4 months ago
Yup - just like sibling comment said - my "low bar" is going to be whatever the best model is that isn't unreasonably costly/expensive.

Speed of model just isn't the bottleneck for me.

Before it I used Opus 4.1, and before that Opus 4.0 and before that Sonnet 4.0 - which each have been getting slightly better. It's not like Sonnet 4.5 is some crazy step function improvement (but the speed over Opus is definitely nice)

JanSt · 4 months ago
I think Opus 4.1 is still much better than Sonnet 4.5
JanSt commented on Tell HN: Supabase database restore from backup corrupting projects    · Posted by u/JanSt
999900000999 · 4 months ago
Looks like making a better Firebase is a bit hard. Superbase is neat, but hosting it yourself isn’t fun.

Hopefully they’ll be able to fix this data corruption, although a backup on the same host isn’t really a backup. The whole system can have issues

JanSt · 4 months ago
I'm only hosting a newly started side project there, but it does have paying users so I'm really unhappy at the moment.
JanSt commented on Tell HN: Supabase database restore from backup corrupting projects    · Posted by u/JanSt
cranberryturkey · 4 months ago
that's not good. i have 8 dbs with them.
JanSt · 4 months ago
This is a worst case scenario. Even worse is the no-communication and not turning off the restore function. This is having serious economic impact
JanSt commented on What happened to Apple's legendary attention to detail?   blog.johnozbay.com/what-h... · Posted by u/Bogdanp
JanSt · 4 months ago
My current top 3 apple software flaws:

1) battery warning above tabs in browser with no x to close it

2) WebKit bugs that make inputs and visual diverge so you have to click under the input to hit it

3) flickering email app when it’s opened

JanSt commented on Liquid Glass Is Cracked, and Usability Suffers in iOS 26   nngroup.com/articles/liqu... · Posted by u/uxjw
JanSt · 4 months ago
They also managed to introduce regressions into WebKit so that the visual and touch positions of fixed input elements diverge. Really makes you question what’s going on at Apple.

u/JanSt

KarmaCake day1602February 26, 2014View Original