Readit News logoReadit News
buildbot commented on From GPT-4 to GPT-5: Measuring progress through MedHELM [pdf]   fertrevino.com/docs/gpt5_... · Posted by u/fertrevino
lossolo · 4 days ago
Without a provable hold out, claim that "large models do fine on unseen patterns" is unfalsifiable. In controlled from scratch training, CoT performance collapses under modest distribution shift, even with plausible chains. If you have results where the transformation family is provably excluded from training and a large model still shows robust CoT, please share them. Otherwise this paper’s claim stands for the regime it tests.
buildbot · 4 days ago
This paper's claim holds - for 4 layer models. Models improve on out of context examples dramatically at larger scales.
buildbot commented on The AI Job Title Decoder Ring   dbreunig.com/2025/08/21/a... · Posted by u/dbreunig
professoretc · 4 days ago
I saw "Hugginface" listed alongside C++, React, and SQL as skills on a resume recently. Wasn't quite sure what to make of that.
buildbot · 4 days ago
Honestly it's a large enough library with enough weirdness and untested areas, footguns, and bugs that I'd deem it just as valid as React for example.

Why did tensor_parallel have output += mod instead of output = output + mod? (The += breaks backprop). Nobody tested it! A user had to notice it was broken and make a PR!

buildbot commented on Blurry rendering of games on Mac   colincornaby.me/2025/08/y... · Posted by u/bangonkeyboard
bee_rider · 11 days ago
Are the screens OLED? The phones are...

IMO the notch is pointless, but they need space for the front camera. With OLED they can just turn the pixels off when it suits the application and it becomes like a big bevel, which was the alternative anyway.

buildbot · 10 days ago
My M1 MacBook Pro turns off the display & backlight that would show the notch as needed, for example right now in full screen Safari you could not tell there is a notch/menu bar area at all. It's actually just as good as it can be already. Free extra space!

Bezel not bevel FYI.

buildbot commented on Encryption made for police and military radios may be easily cracked   wired.com/story/encryptio... · Posted by u/mikece
colmmacc · 18 days ago
I listened to your great podcast and the remark along the lines of "unencrypted police comms let the robbers know when the police are getting close" made me wonder if anyone has built a simple signal intensity detector for the encrypted radios. You don't need to hear the contents to know that the radios are closing in on you. I can't imagine police forces practice RF silence like special forces do.

It really would be better to hide in the noise of 5G.

buildbot · 18 days ago
I’ve long wanted to do this with an SDR and maybe some simple ML, build a dataset by driving by cars/things with frequencies of interest.

Now I wonder if you can fingerprint antennas…

buildbot commented on Open models by OpenAI   openai.com/open-models/... · Posted by u/lackoftactics
rushingcreek · 20 days ago
The native FP4 is one of the most interesting architectural aspects here IMO, as going below FP8 is known to come with accuracy tradeoffs. I'm curious how they navigated this and how the FP8 weights (if they exist) were to perform.
buildbot · 20 days ago
One thing to note is that MXFP4 is a block scaled format, with 4.25 bits per weight. This lets it represent a lot more numbers than just raw FP4 would with say 1 mantissa and 2 exponent bits.
buildbot commented on A Real PowerBook: The Macintosh Application Environment on a Pa-RISC Laptop   oldvcr.blogspot.com/2025/... · Posted by u/todsacerdoti
whaleofatw2022 · 22 days ago
Isn't VILW how a number of GPUs worked internally? That said GPU isn't the same as GPC
buildbot · 22 days ago
Yes, as other noted AMD used VLIW for terscale in the 2000-6000 series. https://en.wikipedia.org/wiki/TeraScale_(microarchitecture)

They are used in a lot of DSP chips too, where you (hopefully) have very simple branching if any and nice data access patterns.

buildbot commented on Anthropic cut up millions of used books, and downloaded 7M pirated ones – judge   businessinsider.com/anthr... · Posted by u/pyman
kjkjadksj · 2 months ago
You may not think it is but the law does.
buildbot · 2 months ago
The law says it’s copyright infringement, not theft.
buildbot commented on Writing a basic Linux device driver when you know nothing about Linux drivers   crescentro.se/posts/writi... · Posted by u/sbt567
0xbadcafebee · 2 months ago
I want to run FreeBSD on my laptop, but they don't have a [complete] driver for my wifi card. I've thought about diving into AI coding-assistant agents just to see if I could use one to finish throwing together a working driver... but figuring out the AI agents is frictiony enough that I'm leaving it be. (I'm not a VSCode user)
buildbot · 2 months ago
Claude code, being a CLI interface, might be more your style? Expensive though
buildbot commented on The FPGA turns 40   adiuvoengineering.com/pos... · Posted by u/voxadam
CamperBob2 · 2 months ago
I'd expect a diffusion model to outperform autoregressive LLMs dramatically.
buildbot · 2 months ago
Certainly possible! Or perhaps a block diffusion+autoregressive model or something like GPT 4o's image gen.
buildbot commented on Early US Intel assessment suggests strikes on Iran did not destroy nuclear sites   cnn.com/2025/06/24/politi... · Posted by u/jbegley
cjbgkagh · 2 months ago
The math isn’t that hard and the ideal case is a linear extrapolation so people can sit down with a calculator and figure it out.
buildbot · 2 months ago
The math is really that hard? I have no idea what the soil or rock is, what happens when the first bomb hits it, the second, and then the third? Does the timing matter? Does the timing matter if it's 5 minutes between? 1 hour between? Seconds between? Does the type of soil or rock compact or loosen when bombed? What's the variation in explosive yield? Does the ground transfer force from a shockwave well or poorly? Does that change after the first one?

I really doubt this is very linear.

u/buildbot

KarmaCake day5374January 30, 2016
About
Senior Deep Learning Engineer @ Nvidia

https://www.maxg.io

https://blog.maxg.io

[ my public key: https://keybase.io/mtg; my proof: https://keybase.io/mtg/sigs/4PwbkRLWa26ARSIXs2TCRAFZaUTDQ2lsT8JGzJD72QM ]

View Original