minimaxir (u/minimaxir)

minimaxir commented on AI makes the easy part easier and the hard part harder blundergoat.com/articles/... · Posted by u/weaksauce

hyperadvanced · 11 hours ago

Am I stupid or do these agents regularly not read what’s in the agents.md file?

minimaxir · 11 hours ago

More recent models are better at reading and obeying constraints in AGENTS.md/CLAUDE.md.

GPT-5.2-Codex did a bad job of obeying my more detailed AGENTS.md files but GPT-5.3-Codex very evidently follows it well.

minimaxir commented on Speed up responses with fast mode code.claude.com/docs/en/f... · Posted by u/surprisetalk

1123581321 · 2 days ago

Could be a use for the $50 extra usage credit. It requires extra usage to be enabled.

> Fast mode usage is billed directly to extra usage, even if you have remaining usage on your plan. This means fast mode tokens do not count against your plan’s included usage and are charged at the fast mode rate from the first token.

minimaxir · 2 days ago

After exceeding the increasingly shrinking session limit with Opus 4.6, I continued with the extra usage only for a few minutes and it consumed about $10 of the credit.

I can't imagine how quickly this Fast Mode goes through credit.

minimaxir commented on We tasked Opus 4.6 using agent teams to build a C Compiler anthropic.com/engineering... · Posted by u/modeless

stonogo · 4 days ago

AI companies set that expectation when their CEOs ran around telling anyone who would listen that their product is a generational paradigm shift that will completely restructure both labor markets and human cognition itself. There is no nuance in their own PR, so why should they benefit from any when their product can't meet those expectations?

minimaxir · 4 days ago

Because it leads to poor and nonconstructive discourse that doesn't educate anyone about the implications of the tech, which is expected on social media but has annoyingly leaked to Hacker News.

There's been more than enough drive-by comments from new accounts/green names even in this HN submission alone.

minimaxir commented on We tasked Opus 4.6 using agent teams to build a C Compiler anthropic.com/engineering... · Posted by u/modeless

whinvik · 4 days ago

It's weird to see the expectation that the result should be perfect.

All said and done, that its even possible is remarkable. Maybe these all go into training the next Opus or Sonnet and we start getting models that can create efficient compilers from scratch. That would be something!

minimaxir · 4 days ago

A symptom of the increasing backlash against generative AI (both in creative industries and in coding) is that any flaw in the resulting product is predicate to call it AI slop, even if it's very explicitly upfront that it's an experimental demo/proof of concept and not the NEXT BIG THING being hyped by influencers. That nuance is dead even outside of social media.

minimaxir commented on We tasked Opus 4.6 using agent teams to build a C Compiler anthropic.com/engineering... · Posted by u/modeless

gignico · 4 days ago

> To stress test it, I tasked 16 agents with writing a Rust-based C compiler, from scratch, capable of compiling the Linux kernel. Over nearly 2,000 Claude Code sessions and $20,000 in API costs, the agent team produced a 100,000-line compiler that can build Linux 6.9 on x86, ARM, and RISC-V.

If you don't care about code quality, maintainability, readability, conformance to the specification, and performance of the compiler and of the compiled code, please, give me your $20,000, I'll give you your C compiler written from scratch :)

minimaxir · 4 days ago

There is an entire Evaluation section that addresses that criticism (both in agreement and disagreement).

minimaxir commented on Claude Opus 4.6 anthropic.com/news/claude... · Posted by u/HellsMaddy

Aeroi · 4 days ago

($10/$37.50 per million input/output tokens) oof

minimaxir · 4 days ago

Only if you go above 200k, which is a) standard with other model providers and b) intuitive as compute scales with context length.

minimaxir commented on GPT-5.3-Codex openai.com/index/introduc... · Posted by u/meetpateltech

minimaxir · 4 days ago

I remember when AI labs coordinated so they didn't push major announcements on the same day to avoid cannibalizing each other. Now we have AI labs pushing major announcements within 30 minutes.

minimaxir commented on Claude Opus 4.6 anthropic.com/news/claude... · Posted by u/HellsMaddy

minimaxir · 4 days ago

Will Opus 4.6 via Claude Code be able to access the 1M context limit? The cost increase by going above 200k tokens is 2x input, 1.5x output, which is likely worth it especially for people with the $100/$200 plans.

minimaxir commented on Tell HN: We Are in Recession Now · Posted by u/ewuhic

minimaxir · 5 days ago

You’re not Michael Scott, you can’t just declare recession.

minimaxir commented on Xcode 26.3 – Developers can leverage coding agents directly in Xcode apple.com/newsroom/2026/0... · Posted by u/davidbarker

minimaxir · 6 days ago

[deleted]

u/minimaxir

KarmaCake day73576March 24, 2012

About

Max Woolf — Senior Data Scientist at BuzzFeed in San Francisco, creator of AI text generation tools such as aitextgen and gpt-2-simple, plotter of pretty charts

https://minimaxir.com

https://github.com/minimaxir

https://bsky.app/profile/minimaxir.bsky.social

max [at] minimaxir.com

Sponsorship for my open-source projects: https://www.patreon.com/minimaxir

View Original