tedsanders (u/tedsanders)

tedsanders commented on GPT-5.2 openai.com/index/introduc... · Posted by u/atgctg

FTA: Both models make clear mistakes, but GPT‑5.2 shows better comprehension of the image.

You can find it right next to the image you are talking about.

tedsanders · 3 days ago

To be fair to OP, I just added this to our blog after their comment, in response to the correct criticisms that our text didn't make it clear how bad GPT-5.2's labels are.

LLMs have always been very subhuman at vision, and GPT-5.2 continues in this tradition, but it's still a big step up over GPT-5.1.

One way to get a sense of how bad LLMs are at vision is to watch them play Pokemon. E.g.,: https://www.lesswrong.com/posts/u6Lacc7wx4yYkBQ3r/insights-i...

They still very much struggle with basic vision tasks that adults, kids, and even animals can ace with little trouble.

tedsanders commented on GPT-5.2 openai.com/index/introduc... · Posted by u/atgctg

arscan · 3 days ago

I think you may have inadvertently misled readers in a different way. I feel misled after not catching the errors myself, assuming it was broadly correct, and then coming across this observation here. Might be worth mentioning this is better but still inaccurate. Just a bit of feedback, I appreciate you are willing to show non-cherry-picked examples and are engaging with this question here.

Edit: As mentioned by @tedsanders below, the post was edited to include clarifying language such as: “Both models make clear mistakes, but GPT‑5.2 shows better comprehension of the image.”

tedsanders · 3 days ago

Thanks for the feedback - I agree our text doesn't make the models' mistakes clear enough. I'll make some small edits now, though it might take a few minutes to appear.

tedsanders commented on GPT-5.2 openai.com/index/introduc... · Posted by u/atgctg

iamdanieljohns · 3 days ago

Is Adaptive Reasoning gone from GPT-5.2? It was a big part of the release of 5.1 and Codex-Max. Really felt like the future.

tedsanders · 3 days ago

Yes, GPT-5.2 still has adaptive reasoning - we just didn't call it out by name this time. Like 5.1 and codex-max, it should do a better job at answering quickly on easy queries and taking its time on harder queries.

tedsanders commented on GPT-5.2 openai.com/index/introduc... · Posted by u/atgctg

DeathArrow · 3 days ago

Pricing is the same?

tedsanders · 3 days ago

ChatGPT pricing is the same. API pricing is +40% per token, though greater token efficiency means that cost per task is not always that much higher. On some agentic evals we actually saw costs per task go down with GPT-5.2. It really depends on the task though; your mileage may vary.

tedsanders commented on GPT-5.2 openai.com/index/introduc... · Posted by u/atgctg

breakingcups · 3 days ago

Is it me, or did it still get at least three placements of components (RAM and PCIe slots, plus it's DisplayPort and not HDMI) in the motherboard image[0] completely wrong? Why would they use that as a promotional image?

0: https://images.ctfassets.net/kftzwdyauwt9/6lyujQxhZDnOMruN3f...

tedsanders · 3 days ago

Yep, the point we wanted to make here is that GPT-5.2's vision is better, not perfect. Cherrypicking a perfect output would actually mislead readers, and that wasn't our intent.

tedsanders commented on GPT-5.1 for Developers openai.com/index/gpt-5-1-... · Posted by u/tedsanders

sunaookami · a month ago

Man these names are so confusing and now reasoning_effort "minimal" was renamed to "none"? And the error message says only "medium" is supported?? Also the docs make no mention if gpt-5.1-chat-latest is included in the "free" offer (when having prompt sharing turned on). The popup says gpt-5.1 is included but not gpt-5.1-chat even though gpt-5-chat-latest is included. Why is it even called "chat" when it's official name is "Instant"? And what even IS the difference between gpt-5.1 and gpt-5.1-chat if both support reasoning_effort??

tedsanders · a month ago

- reasoning_effort "minimal" was not renamed to "none"; "none" is a new, faster level supported by GPT-5.1 but not GPT-5

- there's no good reason it's called "chat" instead of "Instant"

- gpt-5.1 and gpt-5.1-chat are different models, even though they both reason now. gpt-5.1 is more factual and can think for much longer. most people want gpt-5.1, unless the use case is ChatGPT-like or they prefer its personality.

tedsanders commented on Yann LeCun to depart Meta and launch AI startup focused on 'world models' nasdaq.com/articles/metas... · Posted by u/MindBreaker2605

qsort · a month ago

>> non-$$ logic [...] aside from misanthropy

> I hope AGI can be used to automate work

You people need a PR guy, I'm serious. OpenAI is the first company I've ever seen that comes across as actively trying to be misanthropic in its messaging. I'm probably too old-fashioned, but this honestly sounds like Marlboro launching the slogan "lung cancer for the weak of mind".

tedsanders · a month ago

Tractors automate parts of farmwork. Dishwashers automate parts of dishwashing. Computers automate parts of accounting. People seem to like paying for things that make their lives easier. If people don't like something, they are free to not buy it.

tedsanders commented on GPT-5.1: A smarter, more conversational ChatGPT openai.com/index/gpt-5-1/... · Posted by u/tedsanders

ipsum2 · a month ago

I've been using GPT-5.1-thinking for the last week or so, it's been horrendous. It does not spend as much time thinking as GPT-5 does, and the results are significantly worse (e.g. obvious mistakes) and less technical. I suspect this is to save on inference compute.

I've temporarily switched back to o3, thankfully that model is still in the switcher.

edit: s/month/week

tedsanders · a month ago

Not possible. GPT-5.1 didn’t exist a month ago. I helped train it.