deadbolt (u/deadbolt)

> All of the atomic clocks continued ticking through the power outage last week thanks to their battery backup systems, according to NIST supervisory research physicist Jeff Sherman. What failed was the connection between some of the clocks and NIST's measurement and distribution systems, he said.

deadbolt commented on History LLMs: Models trained exclusively on pre-1913 texts github.com/DGoettlich/his... · Posted by u/iamwil

libraryofbabel · 2 months ago

This is the 2023 take on LLMs. It still gets repeated a lot. But it doesn’t really hold up anymore - it’s more complicated than that. Don’t let some factoid about how they are pretrained on autocomplete-like next token prediction fool you into thinking you understand what is going on in that trillion parameter neural network.

Sure, LLMs do not think like humans and they may not have human-level creativity. Sometimes they hallucinate. But they can absolutely solve new problems that aren’t in their training set, e.g. some rather difficult problems on the last Mathematical Olympiad. They don’t just regurgitate remixes of their training data. If you don’t believe this, you really need to spend more time with the latest SotA models like Opus 4.5 or Gemini 3.

Nontrivial emergent behavior is a thing. It will only get more impressive. That doesn’t make LLMs like humans (and we shouldn’t anthropomorphize them) but they are not “autocomplete on steroids” anymore either.

deadbolt · 2 months ago

As someone who still might have a '2023 take on LLMs', even though I use them often at work, where would you recommend I look to learn more about what a '2025 LLM' is, and how they operate differently?