mzl (u/mzl) - Readit News

mzl commented on GPT‑5.3‑Codex‑Spark openai.com/index/introduc... · Posted by u/meetpateltech

pjs_ · 2 days ago

Continue to believe that Cerebras is one of the most underrated companies of our time. It's a dinner-plate sized chip. It actually works. It's actually much faster than anything else for real workloads. Amazing

mzl · a day ago

Technically, Cerebras solution is really cool. However, I am skeptical that it will be economically useful for models that are larger in size, as the requirements on the number of racks scales with the the size of the model to fit the weights in SRAM.

mzl commented on GPT‑5.3‑Codex‑Spark openai.com/index/introduc... · Posted by u/meetpateltech

simonw · a day ago

My stupid pelican benchmark proves to be genuinely quite useful here, you get a visual representation of the quality difference between GPT-5.3-Codex-Spark and full GPT-5.3-Codex: https://simonwillison.net/2026/Feb/12/codex-spark/

mzl · a day ago

I find it interesting that the spark version seems worse than the gpt-oss version (https://simonwillison.net/2025/Aug/5/gpt-oss/)

mzl commented on Kimi Released Kimi K2.5, Open-Source Visual SOTA-Agentic Model kimi.com/blog/kimi-k2-5.h... · Posted by u/nekofneko

XCSme · 18 days ago

> Kimi K2.5 can self-direct an agent swarm

Is this within the model? Or within the IDE/service that runs the model?

Because tool calling is mostly just the agent outputting "call tool X", and the IDE does it and returns the data back to AI's context

mzl · 18 days ago

An LLM model only outputs tokens, so this could be seen as an extension of tool calling where it has trained on the knowledge and use-cases for "tool-calling" itself as a sub-agent.

mzl commented on Scott Adams has died youtube.com/watch?v=Rs_Jr... · Posted by u/ekianjo

listenallyall · a month ago

And you're essentially demonstrating my point. Your long, complicated, meaningless comment here - which boils down to sex being impossible to define - is now widely accepted (and is the basis of a Supreme Court case), while someone like Scott Adams who would claim that chromosomes or sex organs (at birth) are indeed sufficient in defining one's sex, is perceived to be "off the rails". It's absurd.

mzl · 25 days ago

I said exactly what I wanted to say, in as simple terms as I am capable. The fact that some people insist on reality being simpler than it is does not make it true.

mzl commented on Ask HN: Share your personal website · Posted by u/susam

mzl · a month ago

https://zayenz.se - personal site with blog and research publications

mzl commented on vLLM large scale serving: DeepSeek 2.2k tok/s/h200 with wide-ep blog.vllm.ai/2025/12/17/l... · Posted by u/robertnishihara

rbanffy · a month ago

I'm a huge fan of their hardware - we've been promimsed wafer-scale integration since the 1980s and they delivered it. It'd be a shame if their tech ended up a dead-end.

On the bright side, they haven't started exploring stacking chips on top of their wafers to increase local memory, and every process change will bring increased bandwidth in and out of their "pizza". I really wish they succeed.

mzl · a month ago

Oh, I'm also a fan. It is really cool to see what they've done. However, in the current systems they have available, they would (as far as I've understood it) just need way to many racks to be able to serve the full Deepseek model for it to have any kind of economics. The main limiting factor is the amount of sram available per wafer.

mzl commented on vLLM large scale serving: DeepSeek 2.2k tok/s/h200 with wide-ep blog.vllm.ai/2025/12/17/l... · Posted by u/robertnishihara

rbanffy · a month ago

Very impressive numbers - I'd expect 2K tok/s on Cerebras hardware, not H200's.

mzl · a month ago

I don't think it would be economically viable to serve the full DeepSeek models on Cerebras hardware.

mzl commented on Scott Adams has died youtube.com/watch?v=Rs_Jr... · Posted by u/ekianjo

mzl · a month ago

Asking someone to give a sharp dividing line in a multi-dimensional bimodal but not discontinuous distribution is just nonsense.

In particular, being unable to give that strict difference (that does not exist) is not proof of not believing that the general bimodal groups exist, nor acknowledging that existence, nor saying that there is not general differences between the groups. It is not the gotcha that elementary school biology suggests it would be.

mzl commented on Mistral releases Devstral2 and Mistral Vibe CLI mistral.ai/news/devstral-... · Posted by u/pember

Fnoord · 2 months ago

> the wine glass scenario is a _realistic_ scenario

It is unrealistic because if you go to a restaurant, you don't get served a glass like that. It is frowned upon (alcohol is a drug, after all) and impractical (wine stains are annoying) to fill a glass of wine as such.

A pelican riding a bike, on the other hand, is realistic in a scenario because of TV for children. Example from 1950's animation/comic involving a pelican [1].

[1] https://en.wikipedia.org/wiki/The_Adventures_of_Paddy_the_Pe...

mzl · 2 months ago

A better reason why wine glasses are not filled like that is that wine glasses are designed to capture the aroma of the wine.

Since people look at a glass of wine and judge how much "value" they got based partly on how much wine it looks like, many bars and restaurants choose bad wine-glasses (for the purpose of enjoying wine) that are smalle and thus can be fulled more.

mzl commented on Jepsen: NATS 2.12.1 jepsen.io/analyses/nats-2... · Posted by u/aphyr

staticassertion · 2 months ago

I don't have a "school education" and I know plenty of theory, I certainly have read the papers cited in this test.

mzl · 2 months ago

You might not have a school education, but you have educated yourself. It is unfortunately common to hear people complain that the theory one learns in school (or by determined self-study) is useless, which I think is what the geybeard comment you replied to intends to say.

u/mzl

KarmaCake day1890April 27, 2010View Original