Ameo (u/Ameo) - Readit News

Ameo commented on Philosophical Thoughts on Kolmogorov-Arnold Networks (2024) kindxiaoming.github.io/bl... · Posted by u/jxmorris12

Ameo · 2 days ago

I did a little mini research project into trying out KANs in practice for some toy problems, also back in 2024 when they were the hot new thing: https://cprimozic.net/blog/trying-out-kans/

TL;DR KANs are tricker to train than traditional neural networks, but they largely have similar loss values given equivalent parameter counts.

Part of this may be due to the fact that most of the optimizers and other components of the training stack have been tuned over decades for MLPs, and there may well be ways out there to get training to work even better for KANs.

I don't personally find a lot of appeal in KANs for big, deep models like LLMs or anything close to that scale. KANs and their B-Splines are much less hardware-friendly than matrix multiplication. However, they are interesting to me from an interpretability perspective, and there may be some unique possibilities there for smaller cases.

Posted by u/Ameo 6 days ago

Geotoy – Shadertoy for 3D Geometry 3d.ameo.design/geotoy...

Ameo commented on GPT-5 openai.com/gpt-5/... · Posted by u/rd

Ameo · 17 days ago

$10 per million output tokens, wow

Ameo commented on SQLx – Rust SQL Toolkit github.com/launchbadge/sq... · Posted by u/stmw

Ameo · a month ago

sqlx is my favorite way of working with databases in Rust hands down.

I've tried alternatives like Diesel and sea-orm. To be honest, I feel like full-blown ORMs really aren't a very good experience in Rust. They work great for dynamic languages in a lot of cases, but trying to tie in a DB schema into Rust's type system often creates a ton of issues once you try to do anything more than a basic query.

It's got a nice little migration system too with sqlx-cli which is solid.

Ameo commented on Ask HN: What are you working on? (July 2025) · Posted by u/david927

Ameo · a month ago

Shadertoy for geometry - Geotoy

https://3d.ameo.design/geotoy

Most core functionality is finished, and it's ready to go. Still some work to go on docs, tutorials, and polish.

Ameo commented on Show HN: WTFfmpeg – Natural Language to FFmpeg Translator github.com/scottvr/wtffmp... · Posted by u/ycombiredd

Ameo · a month ago

This has to be at least the fifth LLMpeg I've seen posted to hacker news in the past few months.

This whole repo is a single 300 LoC Python file over half of which is the system prompt and comments. It's not even a fine-tuned model or something, it's literally just a wrapper around llama-cpp with a very basic prompt tacked on.

I'm sure it's potentially useful and maybe even works, but I'm really sick of seeing these extremely low-effort projects posted and upvoted over and over.

Ameo commented on Libpostal: C library for parsing/normalizing street addresses around the world github.com/openvenues/lib... · Posted by u/nateb2022

Ameo · a month ago

I used this at a previous company with quite good success.

With relatively minimal effort, I was able to spin up a little standalone container that wrapped around the service and exposed a basic API to parse a raw address string and return it as structured data.

Address parsing is definitely an extremely complex problem space with practically infinite edge cases, but libpostal does just about as well as I could expect it to.

Ameo commented on AlphaGenome: AI for better understanding the genome deepmind.google/discover/... · Posted by u/i_love_limes

LarsDu88 · 2 months ago

You know the corporate screws are coming down hard, when the model (which can be run off a single A100) doesn't get a code release or a weight release, but instead sits behind an API, and the authors say fuck it and copy-paste the entirety of the model code in pseudocode on page 31 of the white paper.

Please Google/Demis/Sergei, just release the darn weights. This thing ain't gonna be curing cancer sitting behind an API and it's not gonna generate that much GCloud revenue when the model is this tiny.

Ameo · 2 months ago

> The model source code and weights will also be provided upon final publication.

Page 59 from the preprint[1]

Seems like they do intend to publish the weights actually

[1]: https://storage.googleapis.com/deepmind-media/papers/alphage...

Ameo commented on I like Svelte more than React (it's store management) river.berlin/blog/why-i-l... · Posted by u/adityashankar

mhh__ · 3 months ago

I also like svelte quite a lot although I was/am genuinely a bit confused as to how to join svelte to some external [state machine / business logic]. I ended up with runes basically infecting the entire codebase, but presumably a proper boundary must be possible?

Ameo · 3 months ago

> runes basically infecting the entire codebase

Yeah sadly the stores the author talks about here aren't the right way to do things anymore in modern svelte and they're all-in on Runes.

Stores were a big part of the reason I liked svelte; they were so conceptually simple, extensible with plain JS, and made interop outside of Svelte trivial without the Svelte magic leaking out.

They're still in Svelte, but they mix poorly with runes and are basically unsupported in runes mode. I opened up a bug about store behavior not working like it used to when mixing with runes, and the response was pretty much "yeah we don't expect it to work like that anymore when runes mode is enabled".

Posted by u/Ameo 3 months ago

Non-Pointless Software Projects for New Devs in the LLM Age cprimozic.net/blog/non-po...

u/Ameo

KarmaCake day1652June 6, 2015

About

https://cprimozic.net/

[ my public key: https://keybase.io/ameo; my proof: https://keybase.io/ameo/sigs/C7EVkxMlxFSzSnGNmmt2tIHFXKNsYyeQH8xVid35a2g ]

View Original