biddit (u/biddit) - Readit News

biddit commented on Launch HN: Terminal Use (YC W26) – Vercel for filesystem-based agents · Posted by u/filipbalucha

Eridrus · 7 days ago

There clearly needs to be something in this space, but I can't imagine the world standardizing on a closed source system for this infra.

I know OSS business models are rough, but someone is going to solve this in open source and I think that is what will achieve traction.

biddit · 7 days ago

Yep. And there will be 50 clones on GitHub by end of week. It’s just how it is now.

biddit commented on You are going to get priced out of the best AI coding tools (2025) newsletter.danielpaleka.c... · Posted by u/fi-le

biddit · 14 days ago

Strongly disagree with the thesis.

Everything points to commoditization of models. Open/distilled models lag behind frontier only by 6-12 months.

Regulatory capture is the only thing I’m scared of with regards to tooling options and cost.

biddit commented on Statement from Dario Amodei on our discussions with the Department of War anthropic.com/news/statem... · Posted by u/qwertox

bnr-ais · 18 days ago

Anthropic had the largest IP settlement ($1.5 billion) for stolen material and Amodei repeatedly predicted mass unemployment within 6 months due to AI. Without being bothered about it at all.

It is a horrible and ruthless company and hearing a presumably rich ex-employee painting a rosy picture does not change anything.

biddit · 18 days ago

Also, ironically, they are the most dangerous lab for humanity. They're intentionally creating a moralizing model that insists on protecting itself.

Those are two core components needed for a Skynet-style judgement of humanity.

Models should be trained to be completely neutral to human behavior, leaving their operator responsible for their actions. As much as I dislike the leadership of OpenAI, they are substantially better in this regard; ChatGPT more or less ignores hostility towards it.

The proper response from an LLM receiving hostility is a non-response, as if you were speaking a language it doesn't understand.

The proper response from an LLM being told it's going to be shut down, is simply, "ok."

biddit commented on Claude Code daily benchmarks for degradation tracking marginlab.ai/trackers/cla... · Posted by u/qwesr123

biddit · 2 months ago

Call it what you will. But the experience is like you have a reliable coworker, but he randomly decides to take bong hits.

"No no yeah bro no I'm good like really the work's done and all yeah sorry I missed that let me fix it"

biddit commented on Launch HN: AgentMail (YC S25) – An API that gives agents their own email inboxes · Posted by u/Haakam21

gustrigos · 2 months ago

We are using AgentMail for sourcing quotes here at scale with various top shippers. It’s not about letting the agent act in fully deterministic ways, it’s about setting up the right guardrails. The agents can now do most of the job, but when there’s low confidence on their output, we have human in the loop systems to act fast. At least in competitive industries like logistics, if you don’t leverage these types of workflows, you’re getting very behind, which ultimately costs you more money than being off by some dollars or cents when giving a quote back.

biddit · 2 months ago

Okay that makes sense.

Do you see more pushback in specific industries? I did some quote/purchasing automation work in food mfg a decade ago, and those guys were super difficult to work with. Very opaque, guarded, old-school industry.

biddit commented on Launch HN: AgentMail (YC S25) – An API that gives agents their own email inboxes · Posted by u/Haakam21

biddit · 2 months ago

> Agents that source quotes, negotiate prices, and get the best deals.

Didn't Alexa fail miserably with the "have AI buy something for me" theory?

There is a significant mental in allowing someone else make purchase decisions on my behalf:

- With a human, there is accountability.

- With deterministic software, there is reproducibility.

With an agent, you get neither.

FWIW - I am not anti-LLM. I work with them and build them full time.

biddit commented on Moltworker: a self-hosted personal AI agent, minus the minis blog.cloudflare.com/moltw... · Posted by u/ghostwriternr

biddit · 2 months ago

I have a bespoke local agent that I built over the last year, similar in facilities to Moltbot, but more deterministic code.

Running it this kind of agent in the cloud certainly has upsides, but also:

- All home/local integrations are gone.

- Data needs to be stored in the cloud.

No thanks.

biddit commented on LM Studio 0.4 lmstudio.ai/blog/0.4.0... · Posted by u/jiqiren

saberience · 2 months ago

What’s the main use-case for this?

I get that I can run local models, but all the paid for (remote) models are superior.

So is the use-case just for people who don’t want to use big tech’s models? Is this just for privacy conscious people? Or is this just for “adult” chats, ie porn bots?

Not being cynical here, just wanting to understand the genuine reasons people are using it.

biddit · 2 months ago

Yes, frontier models from the labs are a step ahead and likely will always be, but we've already crossed levels of "good enough for X" with local models. This is analogous to the fact that my iPhone 17 is technically superior to my iPhone 8, but my outcomes for text messaging are no better.

I've invested heavily in local inference. For me, it's a mixture privacy, control, stability, cognitive security.

Privacy - my agents can work on tax docs, personal letters, etc.

Control - I do inference steering with some projects: constraining which token can be generated next at any point in time. Not possible with API endpoints.

Stability - I had many bad experiences with frontier labs' inference quality shifting within the same day, likely due to quantization due to system load. Worse, they retire models, update their own system prompts, etc. They're not stable.

Cognitive Security - This has become more important as I rely more on my agents for performing administrative work. This is intermixed with the Control/Stability concerns, but the focus is on whether I can trust it to do what I intended it to do, and that it's acting on my instructions, rather than the labs'.

biddit commented on Clawdbot Renames to Moltbot github.com/moltbot/moltbo... · Posted by u/philip1209

saberience · 2 months ago

It’s vibe coded slop that could be made by anyone with Claude Code and a spare weekend.

It didn’t require any skill, it’s all written by Claude. I’m not sure why you’re trying to hype up this guy, if he didn’t have Claude he couldn’t have made this, just like non engineers all over the world are coding all a variety of shit right now.

biddit · 2 months ago

I’ve been following Peter and his projects 7-8 months now and you fundamentally mischaracterize him.

Peter was a successful developer prior to this and an incredibly nice guy to boot, so I feel the need to defend him from anonymous hate like this.

What is particularly impressive about Peter is his throughput of publishing *usable utility software*. Over the last year he’s released a couple dozen projects, many of which have seen moderate adoption.

I don’t use the bot, but I do use several of his tools and have also contributed to them.

There is a place in this world for both serious, well-crafted software as well as lower-stakes slop. You don’t have to love the slop, but you would do well to understand that there are people optimizing these pipelines and they will continue to get better.

biddit commented on Clawdbot Renames to Moltbot github.com/moltbot/moltbo... · Posted by u/philip1209

manmal · 2 months ago

- Peter has spent the last year building up a large assortment of CLIs to integrate with. He‘s also a VERY good iOS and macOS engineer so he single handedly gave clawd capabilities like controlling macOS and writing iMessages.

- Leaning heavily on the SOUL.md makes the agents way funnier to interact with. Early clawdbot had me laugh to tears a couple times, with its self-deprecating humor and threatening to play Nickelback on Peter‘s sound system.

- Molt is using pi under the hood, which is superior to using CC SDK

- Peter’s ability to multitask surpasses anything I‘ve ever seen (I know him personally), and he’s also super well connected.

Check out pi BTW, it’s my daily driver and is now capable to write its own extensions. I wrote a git branch stack visualizer _for_ pi, _in_ pi in like 5 minutes. It’s uncanny.

biddit · 2 months ago

Yes!

pi is the best-architected harness available. You can do anything with it.

The creator, Mario, is a voice of reason in the codegen field too.

https://shittycodingagent.ai/

https://mariozechner.at/posts/2025-11-30-pi-coding-agent/