RyanCavanaugh (u/RyanCavanaugh)

RyanCavanaugh commented on We tasked Opus 4.6 using agent teams to build a C Compiler anthropic.com/engineering... · Posted by u/modeless

halxc · 10 days ago

We all saw verbatim copies in the early LLMs. They "fixed" it by implementing filters that trigger rewrites on blatant copyright infringement.

It is a research topic for heaven's sake:

https://arxiv.org/abs/2504.16046

RyanCavanaugh · 10 days ago

The internet is hundreds of billions of terabytes; a frontier model is maybe half a terabyte. While they are certainly capable of doing some verbatim recitations, this isn't just a matter of teasing out the compressed C compiler written in Rust that's already on the internet (where?) and stored inside the model.

RyanCavanaugh commented on NASA announces unprecedented return of sick ISS astronaut and crew livescience.com/space/spa... · Posted by u/bookofjoe

okdood64 · a month ago

A single soldier having a medical issue generally doesn't cancel a multi-month mission costing some X large sum of money, requiring another Y large sum of money to even finish cancelling it (returning their unit home).

Therefore it's not relevant and not needed for the public to know.

RyanCavanaugh · a month ago

What are you going to do with this information? What policy would you plausibly advocate for on the basis of it?

RyanCavanaugh commented on I Think I Found Something Weird About Physical Constants quantummarmelade.substack... · Posted by u/obius_prime

RyanCavanaugh · 2 months ago

If you overlay 30 prime number frequency waves plus 30 more even frequency waves, you're going to have an enormous number of local peaks.

Look at a chart of sin(x) + sin(x/2) + sin(x/3) + sin(x/5) + sin(x/7) + sin(x/11) + sin(x/13) + sin(x/17) + sin(x/19) + sin(x/23) + sin(x/29) + sin(x/31) + sin(x/37) + sin(x/43), you can find a local peak close to practically any number; the chart is effectively entirely composed of peaks.

It's extremely unsurprising that you would find peaks near mathematically relevant numbers, since there are peaks near any number whatsoever. You could pick ten random numbers out of a hat and fine tune those to 99.999%+ accuracy as well using the same scaling procedure.

RyanCavanaugh · 2 months ago

I modified hamilton_perfect_finder.py to have new values:

# Target constants CONSTANTS = { 'fine_structure': 131.11, 'phi': 1.9, 'pi': 3.6, 'e': 2.4, 'sqrt_2': 1.1, 'sqrt_3': 1.2, 'sqrt_5': 2.5,

Best results: L = 3017.391610 s = 0.042000 Average: 94.848564% Minimum: 82.509479%

  Constants:
    sqrt_2                   : 100.00000000%
    sqrt_3                   : 99.99236291%
    phi                      : 99.93928922%
    pi                       : 99.89806320%
    e                        : 99.88623436%
    sqrt_5                   : 99.85340314%

And this is before any fine-tuning of the parameter set!

RyanCavanaugh commented on I Think I Found Something Weird About Physical Constants quantummarmelade.substack... · Posted by u/obius_prime

obius_prime · 2 months ago

If anyone wants to collaborate or has questions or just wants to tell me off, my email is SylvanGaskin@gmail.com All are welcome. I'd tell myself off but ive done that already and it didn't help, i'm still doing this shit and cant seem to stop... maybe i need an intervention. XD

RyanCavanaugh · 2 months ago

If you overlay 30 prime number frequency waves plus 30 more even frequency waves, you're going to have an enormous number of local peaks.

Look at a chart of sin(x) + sin(x/2) + sin(x/3) + sin(x/5) + sin(x/7) + sin(x/11) + sin(x/13) + sin(x/17) + sin(x/19) + sin(x/23) + sin(x/29) + sin(x/31) + sin(x/37) + sin(x/43), you can find a local peak close to practically any number; the chart is effectively entirely composed of peaks.

It's extremely unsurprising that you would find peaks near mathematically relevant numbers, since there are peaks near any number whatsoever. You could pick ten random numbers out of a hat and fine tune those to 99.999%+ accuracy as well using the same scaling procedure.

RyanCavanaugh commented on Microsoft has urged its employees on H-1B and H-4 visas to return immediately timesofindia.indiatimes.c... · Posted by u/irthomasthomas

arn7av · 5 months ago

I agree with the first part, but for the second, this is not an unnamed WH source, it is the Press Secretary Karoline Leavitt

RyanCavanaugh · 5 months ago

The Press Secretary isn't the Supreme Court. Her say-so doesn't change the plain text of the order, and you're rolling the dice as to which any given border agent is going to choose to believe.

RyanCavanaugh commented on Hyrum's Law hyrumslaw.com... · Posted by u/andsoitis

gwd · 6 months ago

> At the very least, relying on quirks of someone else's implementation is a risk that should be understood and accounted for, and no one has any reasonable grounds for complaint if those quirks suddenly change in a new version.

It's almost always unintentional. Someone wrote some code, it works, they ship it, not realizing it only works if the list comes back in a specific order, or with a specific timing. Then a year or two later they do some updates, the list comes back in a different order, or something is faster or slower, and suddenly what worked before doesn't work.

This is why in Golang, for instance, when you iterate over map keys, it purposely does it in a random order -- to make sure that your program doesn't accidentally begin to rely on the internal implementation of the hash function.

ETA: But of course, that's not truly random, just pseudorandom. It's not impossible that someone's code only works because of the particular pseudorandom order they're generating, and that if Golang even changes the pseudorandom number generator they're using to evade Hyrum's Law that someone's code will break.

RyanCavanaugh · 6 months ago

There's probably at least one game out there somewhere that uses Go's map iteration order to shuffle a deck of cards, and would thus be broken by Go removing the thing that's supposed to prevent you from depending on implementation details.

RyanCavanaugh commented on Hyrum's Law hyrumslaw.com... · Posted by u/andsoitis

AdamH12113 · 6 months ago

From an API designer's standpoint (especially if that API has paying customers), Hyrum's Law is something that has to be taken into account. But from a user's standpoint, it is engineering malpractice, plain and simple. At the very least, relying on quirks of someone else's implementation is a risk that should be understood and accounted for, and no one has any reasonable grounds for complaint if those quirks suddenly change in a new version.

RyanCavanaugh · 6 months ago

The problem is that people commonly don't even realize they're depending on implementation quirks.

For example, they write code that unintentionally depends on some distantly-invoked async tasks resolving in a certain order, and then the library implementation changes performance characteristics and the other order happens instead, and it creates a new bug in the application.

RyanCavanaugh commented on The belay test and the modern American climbing gym climbing.com/people/peter... · Posted by u/vasco

mordechai9000 · a year ago

“I had some really good, famous, climbers come in and fail the belay test,”

Climbers still complain about the belay test, especially older climbers who cut their teeth outdoors and same late to the gym scene. But most gym accidents involving top roping or lead climbing are going to come down to a failed safety check or a mistake on the part of the belayer. And a failed safety check is at least partially a belayer failure.

Experience level doesn't necessarily correlate with safe technique. Beginners can be highly conscious of the consequences of a fall, where more experienced climbers can get complacent and sloppy when the negative consequences fail to materialize.

For example: the coach of an internationally competitive athlete dropped his climber on a grigri because he was casually chatting with someone on the ground and failed to control the brake strand.

https://youtu.be/WBGkKqLhM8Y?si=p58XDsgOG5O2dbJP

RyanCavanaugh · a year ago

The problem with the belay test as it exists today is that it tests whether you know all the peculiarities of each gym's beliefs around things like the exact order your hands should move when taking slack, whether tails on figure 8s are important (if so, how long, and what kind of knot may or must terminate them), whether the length of the belay loop matters, and so on. These things change seemingly on a whim and aren't always motivated by good evidence.

I learned to belay at Vertical World in 2005 and would fail Vertical World's belay test today, for multiple reasons, if I used the same method they themselves taught me!

Meanwhile, as you point out, no test can determine whether or not a person will be paying attention during an actual climb.

RyanCavanaugh commented on Why Go? github.com/microsoft/type... · Posted by u/mepian

jasonthorsness · a year ago

Would you say C# AOT could have been competitive in startup time and overall performance, and the decision came down to the other factors you've noted? I think everyone would have expected C# AOT to be the default choice so it might be nice to see some more concrete examples of where Go proved itself more suitable for your use-case.

RyanCavanaugh · a year ago

C# AOT is quite strong and would be a great choice in a lot of contexts.

Go was just very, very strong on port-friendliness when coming from the TS codebase in particular. If you pull up both codebases side-by-side, it's hard to even tell which is which. C# is more class-oriented and the core JS checker uses no classes at all, so the porting story gets quite a bit more difficult there.

RyanCavanaugh commented on Why Go? github.com/microsoft/type... · Posted by u/mepian

RyanCavanaugh · a year ago

(OP author here) Lots of people reading too much into the tea leaves here; this is just a matter of picking the best tool for this particular task, and our task (porting a JS codebase to the fastest available native target that still works in terms of not altering program structure as part of the port) is pretty unusual as far as things go

I would also recommend reading kdy1's observations when faced with the same task: https://kdy1.dev/2022-1-26-porting-tsc-to-go . The only caveat I'd add is that I can't tell where the 62x measurement in that post came from. Our own experiments doing Rust and Go implementations showed them within the margin of error, with certain phases being faster in one vs the other.