lzaborowski (u/lzaborowski)

Dead Comment

lzaborowski commented on NanoGPT Slowrun: Language Modeling with Limited Data, Infinite Compute qlabs.sh/slowrun... · Posted by u/sdpmas

lzaborowski · 12 days ago

I like the idea of flipping the constraint. Most ML benchmarks assume unlimited data and limited compute, so people optimize for speed.

If high-quality training data becomes the real bottleneck, then the interesting question is how much signal you can extract from the same dataset when compute is cheap.

lzaborowski commented on Something is afoot in the land of Qwen simonwillison.net/2026/Ma... · Posted by u/simonw

lzaborowski · 12 days ago

One thing I’ve noticed with local models is that people tolerate a lot more trial and error behavior. When a hosted model wastes tokens it feels expensive, but when a local model loops a bit it just feels like it’s “thinking.”

If models like Qwen can get good enough for coding tasks locally, the real shift might be economic rather than purely capability.

u/lzaborowski

KarmaCake day20March 4, 2026View Original