arthur2e5 (u/arthur2e5)

arthur2e5 commented on ONNX Runtime and CoreML May Silently Convert Your Model to FP16 ym2132.github.io/ONNX_MLP... · Posted by u/Two_hands

noosphr · 6 days ago

While this is a bit too harsh - and the solution is naive at best - the problem is real.

The idea of bitwise reproducibility for floating point computations is completely laughable in any part of the DL landscape. Meanwhile in just about every other area that uses fp computation it's been the defacto standard for decades.

From NVidia not guaranteeing bitwise reproducibility even on the same GPU: https://docs.nvidia.com/deeplearning/cudnn/backend/v9.17.0/d...

To frameworks somehow being even worse. Where the best you can do is order the frameworks in terms of how bad they are - with tensorflow being far down at the bottom and jax being (currently) at the top - and try to use the best one.

This is a huge issue to anyone serious about developing novel models and I see no one talking about it, let alone trying to solve it.

arthur2e5 · 6 days ago

> Meanwhile in just about every other area that uses fp computation it's been the defacto standard for decades.

Not that strongly for more parallel things, quite similar to the situation with atomics on cuDNN. cuBLAS for example has a similar issue with multi-stream handling, though this can be overcome with a proper workspace allocation: https://docs.nvidia.com/cuda/cublas/index.html?highlight=Rep....

Still better than cuDNN where some operations just don't have a reproducible version though. The other fields are at least trying. DL doesn't seem to be.

On that note Intel added reproducible BLAS to oneMKL on CPU and GPU last year. https://www.intel.com/content/www/us/en/developer/archive/tr...

arthur2e5 commented on When O3 is 2x slower than O2 cat-solstice.github.io/te... · Posted by u/keyle

cat_plus_plus · 2 months ago

As a denser gas, Ozone would have greater friction getting through small pores, so that would be one example?

arthur2e5 · 2 months ago

Hey! I actually appreciate this kind of HN humor. As in, when you pretend to read the title badly.

arthur2e5 commented on When O3 is 2x slower than O2 cat-solstice.github.io/te... · Posted by u/keyle

pclmulqdq · 2 months ago

-O3 -march=haswell

The second one is your problem. Haswell is 15 years old now. Almost nobody owns a CPU that old. -O3 makes a lot of architecture-dependent decisions, and tying yourself to an antique architecture gives you very bad results.

arthur2e5 · 2 months ago

The author gives a godbolt link. It takes 5 minutes to add two compilers on raptorlake and see that it gives the same result.

https://godbolt.org/z/oof145zjb

So no, Haswell is not the problem. LLVM just doesn't know about the dependency thing.

arthur2e5 commented on Notes by djb on using Fil-C cr.yp.to/2025/fil-c.html... · Posted by u/transpute

oddmiral · 2 months ago

I wish, we will have something like Fil-C as an option for unsafe Rust.

arthur2e5 · 2 months ago

Fil-C works because you recompile the whole C userspace. Unsafe Rust doesn't do that... and for many practical purposes you probably want to touch the non-safe-version of the C userspace.

Still, it's all LLVM, so perhaps unsafe Rust for Fil-space can be a thing, a useful one for catching (what would be) UBs even [Fil-C defines everything, so no UBs, but I'm assuming you want to eventually run it outside of Fil-space].

Now I actually wonder if Fil-C has an escape hatch somewhere for syscalls that it does not understand etc. Well it doesn't do inline assembly, so I shouldn't expect much... I wonder how far one needs to extend the asm clobber syntax for it to remotely come close to working.

arthur2e5 commented on Notes by djb on using Fil-C cr.yp.to/2025/fil-c.html... · Posted by u/transpute

oddmiral · 2 months ago

s/webserver/DNS/

arthur2e5 · 2 months ago

HTTPS is there, so you go down to that level only if you want to distrust any element of the public key infrastructure. Which, to be fair, there are plenty of reasons if you are paranoid -- they do tell you who's doing what in a shady way as they revoke, so there's a huge list of transgressions.

arthur2e5 commented on The first non-opoid painkiller worksinprogress.news/p/th... · Posted by u/ortegaygasset

PaulHoule · 6 months ago

Breaking news: turns out acetaminophen has an active metabolite that acts on... sodium channels.

https://medicalxpress.com/news/2025-06-acetaminophen-discove...

arthur2e5 · 6 months ago

AM404 really is the metabolite that keeps giving. The central nervous system system effects were fun too, acting on cannabinoid receptors and/or TRPV1 channels -- so either stony or spicy.

arthur2e5 commented on The first non-opoid painkiller worksinprogress.news/p/th... · Posted by u/ortegaygasset

xeeeeeeeeeeenu · 6 months ago

>It's the first non opioid painkiller applicable for situations like post operative use.

Perhaps the first approved by FDA, I don't know. In many countries, metamizole is the first-line drug for postoperative pain.

(It should be noted that metamizole may very rarely cause agranulocytosis. It is suspected that the risk varies depending on the genetic makeup of the population, which would explain why it is banned in some countries but available OTC in others.)

arthur2e5 · 6 months ago

From my limited experience of metamizole it feels a bit stronger than paracetamol/acetaminophen. Neat little drug if your genetics can take it.

Tangential: China technically banned metamizole due to the agranulocytosis scare, but somehow small clinics always have fresh stocks of this stuff. And their stocks don't look like my metamizole for horses! It's pressed out of the usual magnesium stearate instead of whatever rock-hard thing they use for animal drugs in China.

arthur2e5 commented on 'Sticky thinking' hampers decisions in depression bps.org.uk/research-diges... · Posted by u/domofutu

consp · 6 months ago

Isn't this just what cognitive behavioral therapy is but with different names?

arthur2e5 · 6 months ago

It does sound similar. Throw in a few helping lines about understanding why you do the "thinking" and it might even become dialectical.

arthur2e5 commented on NIH.gov DNS servers down, making PubMed, BLAST, etc. unreachable [fixed] nslookup.io/domains/www.n... · Posted by u/raphman

bow_ · 10 months ago

I have been out of the field for some time, so I am not sure how much BLAST is used these days.

Therer was a time when BLAST-ing a DNA and protein sequence you have is like doing a Google search on it: it simply tells you where the sequence might come from. This is useful especially when your research is to figure out what that specific sequence is doing. It won't give you the answer immediately (otherwise why bother doing the research at all), but it certainly gives context: sequence similarity often hints at similar / related functions.

As an analogy: imagine if StackOverflow is suddenly down and you don't know *if* it's going to be up again.

arthur2e5 · 10 months ago

Not a professional, but still use it like that. They also have a new smartblast thing, which works much faster (really, really like Google!) but only on highly similar proteins.

arthur2e5 commented on Questioning the Criteria for Evaluating Non-Cryptographic Hash Functions cacm.acm.org/practice/que... · Posted by u/todsacerdoti

AlotOfReading · 10 months ago

The ubiquity of decent hashes also means that they're commonly used for situations where they're not wholly appropriate. I've frequently encountered non-cryptographic hashes being used for fixed sized integer maps or iterated.

Even non-cryptographic functions can benefit from the same kinds of threat modeling and carefully attention to selection criteria common in crypto. The risk for failing is that you end up with something as catastrophically flawed as boost's hash_combine [0].

[0] https://www.boost.org/doc/libs/1_70_0/doc/html/hash/referenc...

arthur2e5 · 10 months ago

on one hand i can see how it's going to break stuff (and I see they've changed it! []). on the other hand... mind talking a bit more about how it did break things?

[]: https://github.com/boostorg/container_hash/commit/40ec854466...