aconz2 (u/aconz2) - Readit News

aconz2 commented on How Compiler Explorer Works in 2025 xania.org/202506/how-comp... · Posted by u/pjmlp

aconz2 · 3 months ago

Ahh good to know about those -builder repos. It would be awesome to publish the compilers as container images, could also publish the squashfs as an artifact to a registry. And it shouldn't be too hard to back-publish the existing tarballs as containers without rebuilding. Might poke around a bit later

aconz2 commented on Show HN: Program Explorer, a container playground programexplorer.org/... · Posted by u/aconz2

TkTech · 6 months ago

Neat! One small bug - In dark mode, the text on the banner at the top is unreadable.

aconz2 · 6 months ago

Thanks, will get that fixed

aconz2 commented on Show HN: Program Explorer, a container playground programexplorer.org/... · Posted by u/aconz2

simpaticoder · 6 months ago

Congrats on a launch, sounds like your implementation is quite thoughtful. In fact limiting to only shell helps focus on what you're giving - 1 second of compute with 1GB of RAM, for free. Who knows what scripts people will try? I think sharing those scripts would be interesting. In that vein, sharing from you to your users, consider providing some example scripts that demonstrate the fiddly differences between base images.

aconz2 · 6 months ago

Thank you! I'm also excited at the possibilities and today you can share "long links" (with all the data embedded in the URL's fragment) but in the future short links with storage on the server or loading from a gist for example would be another cool feature. And having a variety of interesting examples ready to go like you mention would be nice too. There's almost too much there that I was worried I would never ship and would still be playing with examples. For fun, here is the most basic version of compiler explorer https://programexplorer.org/#s=eyJjbWQiOiJzaCAvcnVuL3BlL2luc...

aconz2 commented on Show HN: Program Explorer, a container playground programexplorer.org/... · Posted by u/aconz2

vivzkestrel · 6 months ago

only supports running shell scripts?

aconz2 · 6 months ago

You can run any program that is in the container you select. Right now it ignores the entrypoint from the container config (though you could manually copy that into the Cmd field). Shell is a convenient program to run though so you can pass it a script to then run multiple commands. But you can run `date` directly for instance: https://programexplorer.org/#s=eyJjbWQiOiJkYXRlIiwic3RkaW4iO...

aconz2 commented on Show HN: Program Explorer, a container playground programexplorer.org/... · Posted by u/aconz2

sys13 · 6 months ago

Feedback on the name: I didn't think that it would be about containers. The other platforms you mentioned are more descriptive (jsfiddle, db-fiddle, Go playground, and Rust playground)

aconz2 · 6 months ago

Yeah names are hard and it is just a working name. I'm simultaneously using containers as the distribution and runtime mechanism but also wanted them to be an implementation detail and not the spotlight

aconz2 commented on You could have designed state of the art positional encoding fleetwood.dev/posts/you-c... · Posted by u/Philpax

imjonse · 9 months ago

I don't think the first code example should work (it indeed says false here).

When given a permuted sequence, the attention output will also be permuted, not identical. The need for positional encodings is due to two tokens resulting in the same value in the final attention matrix regardless of the tokens' absolute and relative position; that is enough to miss a lot of meaning.

aconz2 · 9 months ago

To add on since this took me a while to understand: for a single token, self attention is permutation invariant because we take the qK (one query dot all the other keys) weighted sum of all the values; that sum is what gives the invariance because + is commutative. But for all the tokens, the mha output matrix will not be invariant, but rather equivariant, where you apply the same permutation to the output matrix as you did to the input tokens. What might be a more useful example is to take one position, like the last one, and compute its mha for every permutation of the previous tokens; those will/should all be the same.

aconz2 commented on The geometry of data: the missing metric tensor and the Stein score [Part II] blog.christianperone.com/... · Posted by u/perone

aconz2 · 10 months ago

Great read and visuals. I think they typo'd the pun on basically/basisally. It got me thinking about program synthesis in the following scheme: data is embedded as vectors and program operations are metric tensors (or maybe just fields in general?) which tell the data how to move. Then, if you have an input/output pair we seek some program to move the data from input to output along some low energy path. Model a whole program as a time varying (t 0-1) metric tensor (is that a thing?) and optimize to find such an object. Maybe you choose ahead of time the number of operations you're searching over and these are like spline basis points and then you lerp between the metric tensors of each op; or you do it continuously and then somehow recover the operations. Then you want to find one program which satisfies multiple input/output pairs, ie one time varying metric tensor (or generally field) such that if you integrate from the input points they all end up at (or close to, which makes me think that you want some learned metric tensor for closeness) the output points. Right now I'm only thinking of unary ops with no constants, maybe the constants could be appended to the input data symbolically and you also get to optimize that portion of the input vectors, with the constraint that it is a shared parameter across all inputs.

aconz2 commented on Our container platform is in production. It has GPUs. Here's an early look blog.cloudflare.com/conta... · Posted by u/jgrahamc

aconz2 · a year ago

surprised they didn't go straight into cloud-hypervisor, though I haven't actually tested with gpu yet but it is on my todo list. OCI layers can use zstd compression. I wonder if they are defeating layer sharing by splitting in 500 mb chunks. Lambda splits your image into chunks and shares at the block layer (I believe even same chunk different (user's?) container on a single host). Esp for 15 GB images I'd think using lazy pulling with nydus/stargz or whatever would be beneficial. I'd like to test out snapshotting, though my testing already boots a guest and runs a container in ~170ms; and I'm not actually sure how you write the guest init to signal it is ready for snapshotting and then wait properly (maybe you just sleep 1000?) so it resumes from the snapshot in a good state. I know fly has written about their use of snapshotting but I don't think it went into that detail. Cool stuff overall though, not worrying about locations and the yucky networking to do so seems nice