Readit News logoReadit News
Ameo commented on Philosophical Thoughts on Kolmogorov-Arnold Networks (2024)   kindxiaoming.github.io/bl... · Posted by u/jxmorris12
Ameo · 2 days ago
I did a little mini research project into trying out KANs in practice for some toy problems, also back in 2024 when they were the hot new thing: https://cprimozic.net/blog/trying-out-kans/

TL;DR KANs are tricker to train than traditional neural networks, but they largely have similar loss values given equivalent parameter counts.

Part of this may be due to the fact that most of the optimizers and other components of the training stack have been tuned over decades for MLPs, and there may well be ways out there to get training to work even better for KANs.

I don't personally find a lot of appeal in KANs for big, deep models like LLMs or anything close to that scale. KANs and their B-Splines are much less hardware-friendly than matrix multiplication. However, they are interesting to me from an interpretability perspective, and there may be some unique possibilities there for smaller cases.

Ameo commented on GPT-5   openai.com/gpt-5/... · Posted by u/rd
Ameo · 17 days ago
$10 per million output tokens, wow
Ameo commented on SQLx – Rust SQL Toolkit   github.com/launchbadge/sq... · Posted by u/stmw
Ameo · a month ago
sqlx is my favorite way of working with databases in Rust hands down.

I've tried alternatives like Diesel and sea-orm. To be honest, I feel like full-blown ORMs really aren't a very good experience in Rust. They work great for dynamic languages in a lot of cases, but trying to tie in a DB schema into Rust's type system often creates a ton of issues once you try to do anything more than a basic query.

It's got a nice little migration system too with sqlx-cli which is solid.

Ameo commented on Ask HN: What are you working on? (July 2025)    · Posted by u/david927
Ameo · a month ago
Shadertoy for geometry - Geotoy

https://3d.ameo.design/geotoy

Most core functionality is finished, and it's ready to go. Still some work to go on docs, tutorials, and polish.

Ameo commented on Show HN: WTFfmpeg – Natural Language to FFmpeg Translator   github.com/scottvr/wtffmp... · Posted by u/ycombiredd
Ameo · a month ago
This has to be at least the fifth LLMpeg I've seen posted to hacker news in the past few months.

This whole repo is a single 300 LoC Python file over half of which is the system prompt and comments. It's not even a fine-tuned model or something, it's literally just a wrapper around llama-cpp with a very basic prompt tacked on.

I'm sure it's potentially useful and maybe even works, but I'm really sick of seeing these extremely low-effort projects posted and upvoted over and over.

Ameo commented on Libpostal: C library for parsing/normalizing street addresses around the world   github.com/openvenues/lib... · Posted by u/nateb2022
Ameo · a month ago
I used this at a previous company with quite good success.

With relatively minimal effort, I was able to spin up a little standalone container that wrapped around the service and exposed a basic API to parse a raw address string and return it as structured data.

Address parsing is definitely an extremely complex problem space with practically infinite edge cases, but libpostal does just about as well as I could expect it to.

Ameo commented on AlphaGenome: AI for better understanding the genome   deepmind.google/discover/... · Posted by u/i_love_limes
LarsDu88 · 2 months ago
You know the corporate screws are coming down hard, when the model (which can be run off a single A100) doesn't get a code release or a weight release, but instead sits behind an API, and the authors say fuck it and copy-paste the entirety of the model code in pseudocode on page 31 of the white paper.

Please Google/Demis/Sergei, just release the darn weights. This thing ain't gonna be curing cancer sitting behind an API and it's not gonna generate that much GCloud revenue when the model is this tiny.

Ameo · 2 months ago
> The model source code and weights will also be provided upon final publication.

Page 59 from the preprint[1]

Seems like they do intend to publish the weights actually

[1]: https://storage.googleapis.com/deepmind-media/papers/alphage...

Ameo commented on I like Svelte more than React (it's store management)   river.berlin/blog/why-i-l... · Posted by u/adityashankar
mhh__ · 3 months ago
I also like svelte quite a lot although I was/am genuinely a bit confused as to how to join svelte to some external [state machine / business logic]. I ended up with runes basically infecting the entire codebase, but presumably a proper boundary must be possible?
Ameo · 3 months ago
> runes basically infecting the entire codebase

Yeah sadly the stores the author talks about here aren't the right way to do things anymore in modern svelte and they're all-in on Runes.

Stores were a big part of the reason I liked svelte; they were so conceptually simple, extensible with plain JS, and made interop outside of Svelte trivial without the Svelte magic leaking out.

They're still in Svelte, but they mix poorly with runes and are basically unsupported in runes mode. I opened up a bug about store behavior not working like it used to when mixing with runes, and the response was pretty much "yeah we don't expect it to work like that anymore when runes mode is enabled".

u/Ameo

KarmaCake day1652June 6, 2015
About
https://cprimozic.net/

[ my public key: https://keybase.io/ameo; my proof: https://keybase.io/ameo/sigs/C7EVkxMlxFSzSnGNmmt2tIHFXKNsYyeQH8xVid35a2g ]

View Original