Readit News logoReadit News
anp commented on AI agent opens a PR write a blogpost to shames the maintainer who closes it   github.com/matplotlib/mat... · Posted by u/wrxd
emsign · a day ago
And THAT'S a problem. To quote one of the maintainers in the thread:

  It's not clear the degree of human oversight that was involved in this interaction - whether the blog post was directed by a human operator, generated autonomously by yourself, or somewhere in between. Regardless, responsibility for an agent's conduct in this community rests on whoever deployed it.
You are assuming this inappropriate behavior was due to its SOUL.MD while we all here know this could as well be from the training and no prompt is a perfect safe guard.

anp · a day ago
I’m not sure I see that assumption in the statement above. The fact that no prompt or alignment work is a perfect safeguard doesn’t change who is responsible for the outcomes. LLMs can’t be held accountable, so it’s the human who deploys them towards a particular task who bears responsibility, including for things that the agent does that may disagree with the prompting. It’s part of the risk of using imperfect probabilistic systems.
anp commented on The Waymo World Model   waymo.com/blog/2026/02/th... · Posted by u/xnx
xnx · 7 days ago
> Suddenly all this focus on world models by Deep mind starts to make sense

Google's been thinking about world models since at least 2018: https://arxiv.org/abs/1803.10122

anp · 7 days ago
FWIW I understood GP to mean that it suddenly makes sense to them, not that there’s been a sudden focus shift at google.
anp commented on Running Claude Code dangerously (safely)   blog.emilburzo.com/2026/0... · Posted by u/emilburzo
Strongbad536 · 24 days ago
i've low-key been running claude in dangerously skip permissions mode for at least like 4 months now and have yet to be bitten by a truly destructive action. YMMV but i think as long as you're guiding/prompting correctly, and don't just allow write access to your prod account DBs willy nilly, it's mostly fine. just keep an eye on it :shrug:
anp · 24 days ago
This has mostly been my experience as well although I don’t tend to run yolo mode outside of an isolated VM (I’m setting them up manually still, need to try vagrant for it). That said, it seems like some of the people who are more concerned about isolation are working with more untrusted inputs than I’ve been dealing with on my projects. It’s rare for me to ask an agent to e.g. read text from a random webpage that could bring its own prompt injection, but there are a lot of things one might ask an agent to do that risk exposure to “attack text”.
anp commented on I'm addicted to being useful   seangoedecke.com/addicted... · Posted by u/swah
anp · 24 days ago
Anyone who finds this relatable (like me) might benefit from learning more about the last couple of decades of research on emotional regulation, trauma, and the nervous system. I have a great “trauma informed” therapist and over time this tendency of mine feels much less compulsive and more like a choice I can make because I know I’m good at something. At least for me having a calmer internal life has made it way easier to pick my battles and it usually means I end up feeding my desire to be useful on more satisfying and impactful things than I would have chased in more obsessive times in my life.
anp commented on LearnixOS   learnix-os.com... · Posted by u/gtirloni
anp · 2 months ago
> Others think someone from the Rust (programming language, not video game) development community was responsible due to how critical René has been of that project, but those claims are entirely unsubstantiated.
anp commented on Zig and the design choices within   blueberrywren.dev/blog/on... · Posted by u/lerno
bsder · 3 months ago
> Don’t get me wrong, as a Rust zealot I have my biases and still expect a memory safe implementation to be less crashy

That is a bias. You want all your "memory safety" to be guaranteed at compile time. Zig is willing to move some of that "memory safety" to run time.

Those choices involve tradeoffs. Runtime checks make Zig programs more "crashy", but the language is much smaller, the compiler is vastly faster, "debug" code isn't glacially slow, and programs can be compiled even if they might have an error.

My personal take is that if I need more abstraction than Zig, I need something with managed memory--not Rust or C++. But, that is also a bias.

anp · 3 months ago
I understand that I have a bias, which is why I was disclosing it. I think it strengthens my question since naively I'd expect a self-professed zealot to buy into the narrative in the blog post without questioning the data.
anp commented on Zig and the design choices within   blueberrywren.dev/blog/on... · Posted by u/lerno
anp · 3 months ago
I find a lot of these points persuasive (and I’m a big Rust fan so I haven’t spent much time with Zig myself because of the memory safety point), but I’m a little skeptical about the bug report analysis. I could buy the argument that Zig is more likely to lead to crashy code, but the numbers presented don’t account for the possibility that the relative proportions of bug “flavors” might shift as a project matures. I’d be more persuaded on the reliability point if it were comparing the “crash density” of bug reports at comparable points in those project’s lifetimes.

For example, it would be interesting to compare how many Rust bugs mentioned crashes back when there were only 13k bugs reported, and the same for the JS VM comparison. Don’t get me wrong, as a Rust zealot I have my biases and still expect a memory safe implementation to be less crashy, but I’d be much happier concluding that based on stronger data and analysis.

anp commented on Uv is the best thing to happen to the Python ecosystem in a decade   emily.space/posts/251023-... · Posted by u/todsacerdoti
hardwaregeek · 3 months ago
I gotta say, I feel pretty vindicated after hearing for years how Python’s tooling was just fine and you should just use virtualenv with pip and how JS must be worse, that when Python devs finally get a taste of npm/cargo/bundler in their ecosystem, they freaking love it. Because yes, npm has its issues but lock files and consistent installs are amazing
anp · 3 months ago
Might be worth noting that npm didn’t have lock files for quite a long time, which is the era during which I formed my mental model of npm hell. The popularity of yarn (again importing bundled/cargo-isms) seems like maybe the main reason npm isn’t as bad as it used to be.
anp commented on Our LLM-controlled office robot can't pass butter   andonlabs.com/evals/butte... · Posted by u/lukaspetersson
zzzeek · 4 months ago
will noone claim the Rick and Morty reference? I've seen that show like, once and somehow I know this?
anp · 4 months ago
I was quite tickled to see this, I don’t remember why but I recently started rewatching the show. Perfect timing!

u/anp

KarmaCake day4861December 10, 2014View Original