Readit News logoReadit News
boratanrikulu commented on Launch HN: Codebuff (YC F24) – CLI tool that writes code for you    · Posted by u/jahooma
jahooma · 10 months ago
It's mainly from experience. From when I set it up I didn't have the feature to ask whether to run commands. It has been rawdogging commands this whole time and has never been a problem for me.

I think we have many other users who are similar. To be fair, sometimes after watching it install packages with npm, people are surprised and say that they would have preferred that it asked. But usually this is just the initial reaction. I'm pretty confident this is the way forward.

boratanrikulu · 10 months ago
Do you have any sandbox-like restrictions in place to ensure that commands are limited to only touching the project folder not any other places in the system?
boratanrikulu commented on Launch HN: Codebuff (YC F24) – CLI tool that writes code for you    · Posted by u/jahooma
jahooma · 10 months ago
Yes, this is a good point. I think not asking to run commands is maybe the most controversial choice we've made so far.

The reason we don't ask for human review is simply: we've found that it works fine to not ask.

We've had a few hundred users so far and usually people are skeptical of this at first, but as they use it they find that they don't want it to ask for every command. It enables cool use cases where Codebuff and iterate by running tests, seeing the error, attempting a fix, and running them again.

If you use source control like git, I also think that it's very hard for things to go wrong. Even if it ran rm -rf from your project directory, you should be able to undo that.

But here's the other thing: it won't do that. Claude is trained to be careful about this stuff and we've further prompted it to be careful.

I think not asking to run commands is the future of coding agents, so I hope you will at least entertain this idea. It's ok if you don't want to trust it, we're not asking you to do anything you are uncomfortable with.

boratanrikulu · 10 months ago
> it won't do that. Claude is trained to be careful about this stuff and we've further prompted it to be careful.

Could you please explain a bit how you are sure about it?

boratanrikulu commented on Launch HN: Codebuff (YC F24) – CLI tool that writes code for you    · Posted by u/jahooma
codenamev · 10 months ago
You are really missing out: https://github.com/e2b-dev/e2b
boratanrikulu · 10 months ago
I don't see any sandbox usage in the demo video.
boratanrikulu commented on Launch HN: Codebuff (YC F24) – CLI tool that writes code for you    · Posted by u/jahooma
boratanrikulu · 10 months ago
Allowing LLMs to execute unrestricted commands without human review is risky and insecure.

u/boratanrikulu

KarmaCake day120February 4, 2019View Original