Readit News logoReadit News
Lerc · 8 months ago
If it runs pytorch at speed without hand holding I would probably get one.

If it runs tinygrad at speed(a lower bar developmentally) I might get one.

Is there a model benchmarking site where you can select varying degrees of models by source code and see how they perform on different hardware. It would assist people to evaluate whether or not a specific piece of hardware is good for the jobs that they want it to do.

diggan · 8 months ago
> Is there a model benchmarking site where you can select varying degrees of models by source code and see how they perform on different hardware. It would assist people to evaluate whether or not a specific piece of hardware is good for the jobs that they want it to do.

Not that I'm aware of (at least based on real benchmarks), but it's something I've been noodling about building, together with with some other associated data that can be helpful when wanting to select a model. Glad to hear I'm not the only one wanting it :)

Deleted Comment

kcb · 8 months ago
bearjaws · 8 months ago
Not shown: Wattage

100% chance its chewing through at least 50% more power to achieve the result.

Infact based on their TDP guidance, it goes up to 120w, which is more than double M4. But we don't know what the configuration was for this benchmark. We also don't have great numbers for M4's power consumption either.

Then you throw in the fact 120w TDP from AMD is not actually a power consumption figure... and it's all made up.

ac29 · 8 months ago
M4 Max is the most comparable to Strix Halo and while Apple does not appear to give an official power consumption, there are plenty of anecdotal reports of it using over 100W under load. For example:

https://www.reddit.com/r/macbookpro/comments/1hj3m0p/m4_max_...

https://forums.macrumors.com/threads/m4-max-eats-battery.244...

wizzard0 · 8 months ago
M2 Max discharges in 2-3h when running ML models, and plugged into 140W brick.

M4 is likely more power efficient, but not 2x.

danudey · 8 months ago
Not to mention what's the performance like on battery vs. plugged in. If I have to stay tethered to the wall in order to achieve the rated performance then it's not really an apples-to-apples comparison unless you only ever use your laptop at a desk (which is probably most people, honestly).
zamadatix · 8 months ago
Source article of that (though it's actually linked in TFA as well) https://www.digitaltrends.com/computing/amd-mobile-processor...

Non-thumbnail version of the chart: https://www.digitaltrends.com/wp-content/uploads/2025/01/3d-...

diggan · 8 months ago
What is the graph supposed to measure, actually? Renders are usually measured in seconds, so high=worse, but then clearly they highlight it as they're better, so it's the second-difference as a percentage or something?

Why can't companies just include absolute numbers in their comparisons...

zamadatix · 8 months ago
It's first party marketing so always orienting the scale towards "higher=better=ours" and measuring via "whatever measurement gave the best numbers to present". They could give all the information in the world and I'd still wait and see what 3rd party reviews say the performance actually is rather than look into the 1st party number.
danudey · 8 months ago
Betterness.
tedunangst · 8 months ago
They're benchmark scores.
Archit3ch · 8 months ago
Shouldn't they compare Max+ to M4 Max?
dagmx · 8 months ago
They should but it’s not favourable. In their presentation they specifically said it outperforms the binned M4 Pro and is on par with the unbinned M4 Pro.

It would be behind the M4 Max. It’s also over double the wattage of the M4 Pro to achieve these numbers.

sliken · 8 months ago
Even M4 Pro is a big step up. M4 max is pretty expensive and I suspect AMD is targetting a lower price point, not that any prices were mentioned today.

256 bits * DDR5-8533 is a pretty big step up from any other x86-64 laptop or SFF and should be a pretty huge help for anything graphics or bandwidth intensive, like LLMs.

ndriscoll · 8 months ago
I'd expect laptops with this thing will be available at closer to the Pro (~$2k) than the Max (~$3k). I see a laptop with the 375 for ~$1700 right now, which is more comparable to the 10-core M4. Or in the minipc space, the 370 is ~$1k, which would again be comparable to a 10-core M4 mac mini.
ncann · 8 months ago
> AMD 'Strix Halo' Ryzen AI Max+ 395 PRO

Gosh that name is a mouthful

hnuser123456 · 8 months ago
Strix Halo is the codename, not marketing name, so you can remove that part.

"AI" seems to have replaced the segment number.

The + is because it's the top-end model of the lineup.

Not sure what's Max or Pro about it though.

enragedcacti · 8 months ago
I think the AI appears on any SKU with enough TOPS for copilot+ (they released some 200 SKUs that are just Ryzen 5/7, no AI)

Max is the segment number i.e. it's "Ryzen 11" (the other 300 series SKUs they announced are Ryzen AI [5|7] 3xx). Weirdly though there are no Ryzen 9s so maybe it's really just a rebrand of 9.

The Pro just means it has management and security features for enterprise customers.

ncann · 8 months ago
They managed to put Max, plus, and Pro into the name, which is kinda impressive in a way. Now we just need Ultra to complete the set.
timewizard · 8 months ago
> Not sure what's Max or Pro about it though.

Secure processor, shadow stacks, secure boot, hardware asset trackability. Enterprise stuff.

solarkraft · 8 months ago
> Strix Halo is the codename, not marketing name, so you can remove that part.

That was the good part, lol.

Uehreka · 8 months ago
Isn’t Strix a brand of RAM or something?
gazchop · 8 months ago
Sounds like ChatGPT had a breakdown when asked to name it.
leptons · 8 months ago
I hate it. Whatever does "strix" even mean?? The rest of it is stupid too.
Havoc · 8 months ago
nerdix · 8 months ago
Isn't Strix Halo just a code name?
jsheard · 8 months ago
This is the first x86 SoC to follow the lead set by Apple Silicons huge unified memory bus.

Bandwidth is more or less on par with the M4 Pro, and it supports up to 128GB.

unusualmonkey · 8 months ago
Remember that AMD has been making x86 SoC's with unified memory for quite some time.
jsheard · 8 months ago
Yeah but they had pretty meagre memory bandwidth until now, aside from the parts they made exclusively for Xbox and Playstation. AMD didn't seem to be interested in bringing fast unified memory to real computers until Apple did it. Now they need to double up the bus again to make an M4 Max-alike...
sliken · 8 months ago
Right, but none wider than 128 bits, unless you count PS5 and XboxX.
timschmidt · 8 months ago
Intel's Knights series of chips (a.k.a. Xeon Phi, a.k.a. Larabee) for servers shipped 8gb of 320gb/s on package memory in 2012: https://en.wikipedia.org/wiki/Xeon_Phi
crest · 8 months ago
It "just" doubled the memory bus width from 128 to 256 bit and cranked up the interface clock speed. I wonder what it means for the infinity fabric. Is it going to run at ~4GHz to keep up?
ksec · 8 months ago
I cant find any information on memory it uses for its 256GB/s. It said New Memory interface. Seems high even for 256Bit LPDDR5X.

Zen 5 CPU, RDNA 3.5 GPU, and XDNA 2 NPU. No word on process nodes.

adrian_b · 8 months ago
It uses LPDDR5X-8000, with a 256-bit memory interface (double in comparison with standard desktops).

8 GHz x 32 bytes = 256 GB/s

This has been known for a long time.

What annoys me is that AMD does not say whether the Zen 5 cores of Strix Halo have full vector processing pipelines, like Granite Ridge and Fire Range, or they have the narrow pipelines of Strix Point and Krackan Point.

wtallis · 8 months ago
These should be re-using the same CPU chiplets from the desktop and server products, so it'll be the full-sized Zen 5 cores.
sliken · 8 months ago
Various leaks have claimed that some products would ship with DDR5-8533 which is 266GB/sec. I wouldn't be surprised if a range of frequencies ship with the 1st gen devices.

Maybe even a SFF sized motherboard that allows CUDIMMs, which is a nice fit since each CUDIMM is 128 bits wide.

davrosthedalek · 8 months ago
LTT showed a slide which said 4nm IIRC or was that the 9950X3D?
UncleOxidant · 8 months ago
> If AMD keeps with tradition, which we fully expect, we will see these monstrous APU chips come to desktop PCs in the future.

How far in the future? I don't need another laptop, but would be nice to have a box to run local llms on. If these things can run LLMs at a decent clip then this would be sort of a "shut up and take my money" situation.

sliken · 8 months ago
I hope the LLM benchmarks announced were reasonable and not something gross like using a model that doesn't swap on the Strix Halo, but does on a 4090.

HP announced a HP Z2 mini g1a, which is bigger than a NUC, but I believe still considered a SFF:

https://www.pcworld.com/article/2567865/hp-z2-mini-g1a-packs...

wmf · 8 months ago
It's gotta be swapping or something. The 4090 is faster (and far more expensive) than Strix Halo in every way.
JudasGoat · 8 months ago
The problem with using these on a AM5 motherboard would be that these chips have quad channel ddr5, while AM5 has 2 memory channels.
wmf · 8 months ago
HP already announced a mini workstation with this chip.
UncleOxidant · 8 months ago
I'm seeing a laptop [1] do you have a link to the workstation?

EDIT: Oh, I see they're calling the laptop a workstation.

[1] https://liliputing.com/hp-zbook-ultra-14-g1a-mobile-workstat...

jmward01 · 8 months ago
If they can deliver the drives and support for pytorch out of the box then I know what my next laptop will have in it.
UncleOxidant · 8 months ago
Yep. Though I'd prefer just to have a small form factor box on my desk. Something the size of a Mac Mini.
sliken · 8 months ago
The strix halo announce was pretty much exactly what was leaked.

However one big surprise was that the Halo 395 chip runs Llama 3.1 70B-Q4 2.2x times faster than a RTX 4090 24GB. Anyone have any details? The slide mentions seeing AMD endnote SHO-14 for details.

Maybe 70B-Q4 doesn't fit in 24GB?

plasticchris · 8 months ago
Last time I played with it, 70b models are much larger than 24gb without a lot of quantization.
sliken · 8 months ago
It mentioned Q4, but after searching around a bit looks like 70B-Q4 need 35GB or so. So strix halo is 2.2x faster than a 4090 when it's paging to system ram.

Not so impressive 8-(.