rluna828 (u/rluna828)

rluna828 commented on DARPA’s new X-76 darpa.mil/news/2026/darpa... · Posted by u/newer_vienna

It looks like a maintenance nightmare with those clutches to decouple the blades and the mechanisms to have them folded during cruising. Does it even improve substantially in anh metric over the V280 to put money into it?

rluna828 · 3 days ago

it also has stealth. This is a complete disaster. The only purpose of this stealth ship is to steal leaders and or go inside cave lairs and blow them up.

rluna828 commented on DARPA’s new X-76 darpa.mil/news/2026/darpa... · Posted by u/newer_vienna

bilsbie · 3 days ago

So it has jet engines that blades unfold and attach to during takeoff and landing? Why not always use the blades?

rluna828 · 3 days ago

stealth

rluna828 commented on DARPA’s new X-76 darpa.mil/news/2026/darpa... · Posted by u/newer_vienna

rluna828 · 3 days ago

I wonder is Iran would have gone different if we had captured the Ayatollah instead of killing him. A stealth drop ship like this would have allowed that to happen. The reason why regimes are more likely to negotiate when you capture their leaders is because you might release them. (not a good day for the usurper.)

rluna828 commented on Triplet Superconductor sciencedaily.com/releases... · Posted by u/jonbaer

rluna828 · 6 days ago

The limiting factor for quantum computers is keeping them cold. Is this triple superconductor high temperature too? If not, it's not going to change things much.

rluna828 commented on The Physics and Economics of Moving 44 Tonnes at 56mph mikeayles.com/blog/heavy-... · Posted by u/mikeayles

toss1 · 14 days ago

>>That seems like exactly the kind of thing that should be regulated away

Yes, and the regulation should NOT be limiting passing or requiring the slower truck to brake

It should allow a "Push To Pass" button that allows a 10mph boost for enough seconds to make a pass in a reasonable amount of distance so as to not create problems for other traffic.

Current technology would allow these to be easily limited to X uses per hour/day and even geo-fence the usage for safe zones (use could even be limited to passing lanes so the truck being passed cannot start a drag race to stay ahead). They could even require connectivity and disable it in poor road conditions.

The real people being inconsiderate are not so much the truckers (particularly the slower trucker failing to yield and let the other one pass in a reasonable distance), as it is the regulators who created this mess.

rluna828 · 13 days ago

Using GPS to calibrate speedometers would solve this problem with fewer steps. GPS speedometers are accurate to 0.1 km/h.

rluna828 commented on I want to wash my car. The car wash is 50 meters away. Should I walk or drive? mastodon.world/@knowmadd/... · Posted by u/novemp

throwuxiytayq · 24 days ago

Yes, I’m sure that’s what engineers at Google are doing all day. That, and maintaining the moon landing conspiracy.

rluna828 · 23 days ago

It looks like they do. https://simonwillison.net/2025/May/25/claude-4-system-prompt... They patch it in the prompt and they eventually address it in the re-enforcement training. It seems the eventual goal is to patch all of these tiny "glitches" so as to hide the lack of cognition.

rluna828 commented on I want to wash my car. The car wash is 50 meters away. Should I walk or drive? mastodon.world/@knowmadd/... · Posted by u/novemp

biot · 24 days ago

This is the LLM equivalent of a riddle, eg: “A farmer has 17 sheep. All but 9 die. How many are left?”

rluna828 · 23 days ago

I think it's this: https://www.anthropic.com/news/golden-gate-claude

rluna828 commented on I want to wash my car. The car wash is 50 meters away. Should I walk or drive? mastodon.world/@knowmadd/... · Posted by u/novemp

pvillano · 24 days ago

I recently asked an AI a chemistry question which may have an extremely obvious answer. I never studied chemistry so I can't tell you if it was. I included as much information about the situation I found myself in as I could in the prompt. I wouldn't be surprised if the ai's response was based on the detail that's normally important but didn't apply to the situation, just like the 50 meters

rluna828 · 23 days ago

Thanks, Excellent catch! Everyone is saying this is a "brain teaser." However, this reminded me of the LLM that thought it was the golden gate bridge. I hadn't been able to say it (or think it) succinctly. From Claude, "when we turn up the strength of the “Golden Gate Bridge” feature, Claude’s responses begin to focus on the Golden Gate Bridge. Its replies to most queries start to mention the Golden Gate Bridge, even if it’s not directly relevant." Here's the link for those interested. https://www.anthropic.com/news/golden-gate-claude

rluna828 commented on I want to wash my car. The car wash is 50 meters away. Should I walk or drive? mastodon.world/@knowmadd/... · Posted by u/novemp

jader201 · 24 days ago

That’s not the problem with this post.

The problem is that most LLM models answer it correctly (see the many other comments in this thread reporting this). OP cherry picked the few that answered it incorrectly, not mentioning any that got it right, implying that 100% of them got it wrong.

rluna828 · 23 days ago

The magic of LLMs is that one llm can learn everything and then we can clone it. However, if we don't know ahead of time which one will be the best one, then we should probably keep a lot of version with real (mathematically calculated) diversity. Ironically, the DEI peeps were right all along.

rluna828 commented on I want to wash my car. The car wash is 50 meters away. Should I walk or drive? mastodon.world/@knowmadd/... · Posted by u/novemp

jason_oster · 23 days ago

> This is literally the Gell-Mann Amnesia Effect in action.

Absolutely! But there is some nuance, here. The failure mode is for an ambiguous question, which is an open research topic. There is no objectively correct answer to "Should I walk or drive?" given the provided constraints.

Because handling ambiguities is a problem that researchers are actively working on, I have confidence that models will improve on these situations. The improvements may asymptotically approach zero, leading to ever increasingly absurd examples of the failure mode. But that's ok, too. It means the models will increase in accuracy without becoming perfect. (I think I agree with Stephen Wolfram's take on computationally irreducibility [1]. That handling ambiguity is a computationally irreducible problem.)

EWD was right, of course, and you are too for pointing out rigorous languages. But the interactivity with an LLM is different. A programming language cannot ask clarifying questions. It can only produce broken code or throw a compiler error. We prefer the compiler errors because broken code does not work, by definition. (Ignoring the "feature not a bug" gag.)

Most of the current models are fine-tuned to "produce broken code" rather than "compiler error" in these situations. They have the capability of asking clarifying questions, they just tend not to, because the RL schedule doesn't reward it.

[1]: https://writings.stephenwolfram.com/2017/05/a-new-kind-of-sc...

rluna828 · 23 days ago

Producing fewer "Compiler errors" and more "broken code errors" is a fundamental failure. The cost of detecting compiler errors is lower than detecting broken code. If the cost of detecting and fixing broken code increases at the same rate as LLMs "improve" then their net benefit will remain fixed. I asked my five year old the above "brain teaser" and he got it right. I did a follow up of what should he wash at a car wash if he walked there, he said, "my hands." Chat answered with more giberish.