Readit News logoReadit News
WiSaGaN commented on Prism   openai.com/index/introduc... · Posted by u/meetpateltech
Perseids · 12 days ago
I'm dumbfounded they chose the name of the infamous NSA mass surveillance program revealed by Snowden in 2013. And even more so that there is just one other comment among 320 pointing this out [1]. Has the technical and scientific community in the US already forgotten this huge breach of trust? This is especially jarring at a time where the US is burning its political good-will at unprecedented rate (at least unprecedented during the life-times of most of us) and talking about digital sovereignty has become mainstream in Europe. As a company trying to promote a product, I would stay as far away from that memory as possible, at least if you care about international markets.

[1] https://news.ycombinator.com/item?id=46787165

WiSaGaN · 12 days ago
OpenAI has a former NSA director on its board. [1] This connection makes the dilution of the term "PRISM" in search results a potential benefit to NSA interests.

[1]: https://openai.com/index/openai-appoints-retired-us-army-gen...

WiSaGaN commented on Floating-Point Printing and Parsing Can Be Simple and Fast   research.swtch.com/fp... · Posted by u/chmaynard
WiSaGaN · 17 days ago
Rust's `serde_json` recently switched to use a new library for floating string conversion: https://github.com/dtolnay/zmij.
WiSaGaN commented on The microstructure of wealth transfer in prediction markets   jbecker.dev/research/pred... · Posted by u/jonbecker
WiSaGaN · 20 days ago
A market maker needs a premium to provide liquidity. If all else is equal, why would they take on execution time risk? This is a universal feature of continuous-trading Central Limit Order Books (CLOBs), not something unique to prediction markets.
WiSaGaN commented on Reproducing DeepSeek's MHC: When Residual Connections Explode   taylorkolasinski.com/note... · Posted by u/taykolasinski
taykolasinski · a month ago
I’m referring specifically to the fundamental residual connection backbone that defines the transformer architecture (x_{l+1} = x_l + F(x_l)).

While the sub-modules differ (MHA vs GQA, SwiGLU vs GeLU, Mixture-of-Depths, etc.), the core signal propagation in Llama, Gemini, and Claude relies on that additive residual stream.

My point here is that DeepSeek's mHC challenges that fundamental additive assumption by introducing learnable weighted scaling factors to the residual path itself.

WiSaGaN · a month ago
I guess I am asking how we know Gemini and Claude relies on the additive residual stream. We don't know the architecture details for these closed models?
WiSaGaN commented on Reproducing DeepSeek's MHC: When Residual Connections Explode   taylorkolasinski.com/note... · Posted by u/taykolasinski
taykolasinski · a month ago
OP here. I spent the last few days reproducing the mHC architecture from the recent DeepSeek paper (2512.24880).

Two key takeaways from the reproduction:

Unconstrained Hyper-Connections really do explode (7x amplification even at 10M scale).

I hit a nasty "stream persistence" bug where my tensors were the right shape, but the architecture was functionally broken.

This is Part 1 (10M scale). Part 2 (scaling to 1B on A100s) is coming later this week. Happy to answer questions about the implementation.

WiSaGaN · a month ago
How do you know "GPT-5, Claude, Llama, Gemini. Under the hood, they all do the same thing: x+F(x)."?
WiSaGaN commented on CLI agents make self-hosting on a home server easier and fun   fulghum.io/self-hosting... · Posted by u/websku
WiSaGaN · a month ago
I have a similar experience when I found out that claude code can use ssh to conect to remote server and diagnose any sysadmin issue there. It just feels really empowered.
WiSaGaN commented on New information extracted from Snowden PDFs through metadata version analysis   libroot.org/posts/going-t... · Posted by u/libroot
bawolff · a month ago
Its crazy this is just being discovered now.
WiSaGaN · a month ago
I think it's likely someone already discovered this. It's just that info is not broadcasted to people who want to comment on this thread.
WiSaGaN commented on Ed25519-CLI – command-line interface for the Ed25519 signature system (2024)   lib25519.cr.yp.to/ed25519... · Posted by u/INGELRII
WiSaGaN · a month ago
I can't find the source. Anyone can point to it?
WiSaGaN commented on How uv got so fast   nesbitt.io/2025/12/26/how... · Posted by u/zdw
Nextgrid · a month ago
It's not just greenfield-ness but the fact it's a commercial endeavor (even if the code is open-source).

Building a commercial product means you pay money (or something they equally value) to people to do your bidding. You don't have to worry about politics, licensing, and all the usual FOSS-related drama. You pay them to set their opinions aside and build what you want, not what they want (and if that doesn't work, it just means you need to offer more money).

In this case it's a company that believes they can make a "good" package manager they can sell/monetize somehow and so built that "good" package manager. Turns out it's at least good enough that other people now like it too.

This would never work in a FOSS world because the project will be stuck in endless planning as everyone will have an opinion on how it should be done and nothing will actually get done.

Similar story with systemd - all the bitching you hear about it (to this day!) is the stuff that would've happened during its development phase had it been developed as a typical FOSS project and ultimately made it go nowhere - but instead it's one guy that just did what he wanted and shared it with the world, and enough other people liked it and started building upon it.

WiSaGaN · a month ago
This argument falls apart when you look at Rust and Cargo. uv is literally trying to be "Python's Cargo." The entire blueprint came from a flagship FOSS project.

Rust's development used a structured, community RFC process—endless planning by your definition. The result was a famously well-designed toolchain that the entire community praises. FOSS didn't hold it back; it made it good.

So no, commercial backing isn't the only way to ship something good. FOSS is more than capable to ship great software when done right.

WiSaGaN commented on MiniMax M2.1: Built for Real-World Complex Tasks, Multi-Language Programming   minimaxi.com/news/minimax... · Posted by u/110
andai · a month ago
Opus is 3x cheaper now.

I think it's still not on the $20 plan tho which is sad.

WiSaGaN · a month ago
It is now. But the limit on $20 plan is quite low and easy to use up.

u/WiSaGaN

KarmaCake day963September 18, 2015View Original