https://www.geekwire.com/2025/im-good-for-my-80-billion-what...
https://www.geekwire.com/2025/im-good-for-my-80-billion-what...
Well - I would have been interested in GPT-5 vs. Opus. Claude Code Max is affordable with Opus.
Because Anthropic is presumably massively subsidizing the usage.
I think this is the whole reason not to compare it to Opus...
It’s more likely that this sum is higher than they want. So really it’s not about predictability.
But that doesn’t mean you are only using SRAM, that would be impractical. Just like using a CPU just by storing stuff in the L3 cache and never going to the RAM. Unless I am missing something from the original link, I don’t know how you got to the conclusion that they only used SRAM.
Because they are doing 1,500 tokens per second.
I believe previous UBI experiments have shown the same results: most people keep working, some people stop, but they usually have decent reasons. Education, extending parental leave, or being a caregiver aren't necessarily things we want to discourage if they result in a greater return.
Greater return than what and to whom?
We already have existing labor markets that are very capable of determining returns.
> The main objective is to learn writing attention in CUDA C++, since many features are not available in Triton, such as MXFP8 / NVFP4 MMA for sm120.