Readit News logoReadit News
underdeserver commented on The Waymo World Model   waymo.com/blog/2026/02/th... · Posted by u/xnx
andoando · 3 days ago
Ski lifts man, ski lifts all over the city
underdeserver · 3 days ago
What a glorious utopia we could have
underdeserver commented on We tasked Opus 4.6 using agent teams to build a C Compiler   anthropic.com/engineering... · Posted by u/modeless
underdeserver · 4 days ago
> when agents started to compile the Linux kernel, they got stuck. [...] Every agent would hit the same bug, fix that bug, and then overwrite each other's changes.

> [...] The fix was to use GCC as an online known-good compiler oracle to compare against. I wrote a new test harness that randomly compiled most of the kernel using GCC, and only the remaining files with Claude's C Compiler. If the kernel worked, then the problem wasn’t in Claude’s subset of the files. If it broke, then it could further refine by re-compiling some of these files with GCC. This let each agent work in parallel

This is a remarkably creative solution! Nicely done.

underdeserver commented on We tasked Opus 4.6 using agent teams to build a C Compiler   anthropic.com/engineering... · Posted by u/modeless
GaggiX · 4 days ago
Clang is not written in Rust tho
underdeserver · 4 days ago
jinx
underdeserver commented on We tasked Opus 4.6 using agent teams to build a C Compiler   anthropic.com/engineering... · Posted by u/modeless
phillmv · 4 days ago
i mean… your work also went into the training set, so it's not entirely surprising that it spat a version back out!
underdeserver · 4 days ago
Anthropic's version is in Rust though, so at least a little different.
underdeserver commented on My AI Adoption Journey   mitchellh.com/writing/my-... · Posted by u/anurag
underdeserver · 4 days ago
> At a bare minimum, the agent must have the ability to: read files, execute programs, and make HTTP requests.

That's one very short step removed from Simon Willison's lethal trifecta.

underdeserver commented on Best Gas Masks   theverge.com/policy/86857... · Posted by u/cdrnsf
fanatic2pope · 7 days ago
Note that if you have a beard you should be aware that these types of masks don't work well.

https://pekesafety.com/blogs/news/a-respirator-that-works-wi...

underdeserver · 7 days ago
I still remember old bioweapons threat levels. Level 1 was no threat. At level 3 everyone was required to shave.
underdeserver commented on AGENTS.md outperforms skills in our agent evals   vercel.com/blog/agents-md... · Posted by u/maximedupre
underdeserver · 10 days ago
I don't think you can really learn from this experiment unless you specify which models you used, if you tried it against at least 3 frontier models, if you ran each eval multiple times, and what prompts you tried.

These things are non-deterministic across multiple axes.

underdeserver commented on Airfoil (2024)   ciechanow.ski/airfoil/... · Posted by u/brk
underdeserver · 12 days ago
Should be (2024).
underdeserver commented on First, make me care   gwern.net/blog/2026/make-... · Posted by u/andsoitis
underdeserver · 15 days ago
Probably should be marked (2025).
underdeserver commented on The coming industrialisation of exploit generation with LLMs   sean.heelan.io/2026/01/18... · Posted by u/long
tptacek · 21 days ago
They're prioritizing memory corruption vulnerabilities, is the point of going to extremes to ensure there's no compiled C in their binaries.
underdeserver · 20 days ago
You can have memory corruption in pure Go code, too.

u/underdeserver

KarmaCake day3663May 20, 2019View Original