Readit News logoReadit News
grantpitt commented on Claude Opus 4.5   anthropic.com/news/claude... · Posted by u/adocomplete
GodelNumbering · 24 days ago
Makes it sound like a one trick pony
grantpitt · 24 days ago
well, it's a big trick
grantpitt commented on Claude Opus 4.5   anthropic.com/news/claude... · Posted by u/adocomplete
GodelNumbering · 24 days ago
The fact that the post singled out SWE-bench at the top makes the opposite impression that they probably intended.
grantpitt · 24 days ago
do say more
grantpitt commented on Show HN: Train a language model in the browser with WebGPU   sequence.toys... · Posted by u/vvin
grantpitt · a month ago
Any application that can run in the browser, will eventually run in the browser.
grantpitt commented on Nano Banana Pro   blog.google/technology/ai... · Posted by u/meetpateltech
evrenesat · a month ago
I've tried to repaint the exterior of my house. More than 20 times with very detailed prompts. I even tried to optimize it with Claude. No matter what, every time it added one, two or three extra windows to the same wall.
grantpitt · a month ago
Huh, can you share a link? I tried here: https://gemini.google.com/share/e753745dfc5d
grantpitt commented on Gemini 3   blog.google/products/gemi... · Posted by u/preek
tylervigen · a month ago
I am personally impressed by the continued improvement in ARC-AGI-2, where Gemini 3 got 31.1% (vs ChatGPT 5.1's 17.6%). To me this is the kind of problem that does not lend itself well to LLMs - many of the puzzles test the kind of thing that humans intuit because of millions of years of evolution, but these concepts do not necessarily appear in written form (or when they do, it's not clear how they connect to specific ARC puzzles).

The fact that these models can keep getting better at this task given the setup of training is mind-boggling to me.

The ARC puzzles in question: https://arcprize.org/arc-agi/2/

grantpitt · a month ago
Agreed, it also leads performance on arc-agi-1. Here's the leaderboard where you can toggle between arc-agi-1 and 2: https://arcprize.org/leaderboard
grantpitt commented on How to build tools that shape civilizations (Alan Kay and Ivan Zhao) [video]   youtube.com/watch?v=3M6_a... · Posted by u/justin66
grantpitt · 2 months ago
Very interesting to hear two technologists at a tech business conference say things along the lines of: "our tools do not merely extend us, they transform us", followed up with "we've become numb to the devastating consequences of technology".

(I know I'm somewhat selectively reading but still)

grantpitt commented on ARC-AGI-3 Preview   three.arcprize.org/... · Posted by u/blixt
grantpitt · 4 months ago
Interesting because games are exactly the kinds of RL environments that models can effectively learn - but the catch is that they must do this learning on the fly in test-time. Very exciting to see this.

u/grantpitt

KarmaCake day139August 14, 2021View Original