Readit News logoReadit News
jahooma commented on What I learned building an AI coding agent for a year   jamesgrugett.com/p/what-i... · Posted by u/vinhnx
iFire · 8 months ago
Oh no achivement acquired!

jahooma failed the turing test as a human :'( The human text is seen as ai text.

I don't know what to think about this.

jahooma · 8 months ago
Current AI's could never have such deep thoughts haha
jahooma commented on What I learned building an AI coding agent for a year   jamesgrugett.com/p/what-i... · Posted by u/vinhnx
nunez · 8 months ago
Why do blog post authors decide to use AI art with nonsense text in it? It's a dead giveaway and looks sloppy, IMO. Unless the new normal is that images with gibberish text are fine now.

I immediately stop reading whatever I'm reading when I see this. I'm left to assume that the post I'm reading was also heavily AI-assisted and isn't a true representation of the author's writing ability or their actual thoughts.

I'd much rather see a post full of grammatical errors and stock art from an author with a story to tell that actually put the work in than a grammatically/syntactically milquetoast piece in the style of a million other milquetoast pieces.

TL;DR: "If they can't be bothered to actually write this, then I can't be bothered to read it," is what I think when I see AI slop art.

jahooma · 8 months ago
I wrote every word! Though I did have some suggestions from Opus haha
jahooma commented on Taking on the giants – an AI coding startup on the brink of success   jamesgrugett.com/p/taking... · Posted by u/jahooma
jahooma · a year ago
Hi, just wrote up some reflections & strategy on this moment as a codegen startup founder.

Let me know if you have any questions!

jahooma commented on Claude 3.7 Sonnet and Claude Code   anthropic.com/news/claude... · Posted by u/bakugo
Daniel_Van_Zant · a year ago
Being able to control how many tokens are spent on thinking is a game-changer. I've been building fairly complex, efficient, systems with many LLMs. Despite the advantages, reasoning models have been a no-go due to how variable the cost is, and how hard that makes it to calculate a final per-query cost for the customer. Being able to say "I know this model can always solve this problem in this many thinking tokens" and thus limiting the cost for that component is huge.
jahooma · a year ago
Yup, it's just what we wanted for our coding agent. Codebuff can enter a "Deep thinking" mode and we can tell it to burn a lot of tokens hahaha.
jahooma commented on Show HN: AI that can use Gmail, SMS, Slack, Calendar    · Posted by u/darweenist
jahooma · a year ago
I would try this if it had Android support! Is that planned?

Also curious which integrations people use the most. If it's email, why not just build an email app?

jahooma commented on Ask HN: Code Generation – Infinite Monkey Theorem or SWE of the Future?    · Posted by u/ruiters
jahooma · a year ago
Nice! It's a totally different world with AI tools. Seems like a good strategy to get previously non-technical folks in on it, especially for front end apps.
jahooma commented on Launch HN: Codebuff (YC F24) – CLI tool that writes code for you    · Posted by u/jahooma
marvin-hansen · a year ago
How is this different from Qodo? Why isn’t it mentioned as a competitor?

I’ve hard time figuring out what codebuff brings to the table that hasn’t been done before other than being YC backed. I think to win in this massively competitive and fast moving market, you really have to put forward something significantly better than an expensive cobbled together script replicating OSS solutions…

I know this sounds harsh, but believe me, differentiation makes or breaks you sooner than later. Proper differentiation doesn’t have to be hard, it just needs to answer the question what you offer that I can’t get anywhere else at a similar price point. Right now, your offer is more expensive for basically something I get elsewhere better for 1/5 the price… I’m seriously worried whether your venture will be around in one or two years from now without a more convincing value prop.

From my experience of leaning more into full end to end Ai workflows building Rust, it seems that

1) context has clearly won over RAG. There is no way back.

2) workflow is the next obvious evolution and gets you an extra mile

3) adversial GAN training seems a path forward to get from just okay generated code to something close to a home run on the first try

4) generating a style guide based on the entire code base and feeding that style guide together with the task and context into the LLM is your ticket to enterprise customers because no matter how good your stuff might be , if the generated code doesn’t fit the mold you are not part of the conversation. Conversely, if you deliver code in the same style and formatting and it actually works, well, price doesn’t matter much.

5) in terms of marketing to developers, I suggest starting listening to their pain points working with existing Ai tools. I don’t have one single of the problems you try to solve. Im sitting over a massive Rust monorepo and I’ve seen virtually every existing Ai coding assistant failing one way or another. The one I have now works miracles half the time and only fails the other half. That is already a massive improvement compared to everything else I tried over the past four years.

Point is, there is a massive need for coding assistance on complex systems and for CodeBuff to make a dime of a difference, you have to differentiate from what’s out there by starting with the challenges engineers face today.

jahooma · a year ago
Yes, but did you try it? I think Codebuff is by far the easiest to use and may also be more effective in your large codebase than any other comparable tool (i.e. like Cursor composer, Aider, Cline. Not sure about Qodo) because it is better at finding the appropriate files.

Re: style guide. We encourage you to write up `knowledge.md` files which are included in every prompt. You can specify styles or other guidelines to follow in your codebase. One motivating example is we wrote in instructions of how to add an endpoint (edit these three files), and that made it do the right thing when asked to create an endpoint.

jahooma commented on Launch HN: Codebuff (YC F24) – CLI tool that writes code for you    · Posted by u/jahooma
ErikBjare · a year ago
But you said you haven't tried Aider, how can you say it's a "leap in UX"?

My own tool `gptme` lets the agent interactively read/collect context too (as does Anthropic in their latest minimal-harness submission to SWE-bench), it's nothing novel.

jahooma · a year ago
I'm just saying what users of Codebuff have said about us compared to competitors.

You should try Codebuff and see for yourself how it reads files! It's not simply a tool call. We put a lot of work into it.

jahooma commented on Launch HN: Codebuff (YC F24) – CLI tool that writes code for you    · Posted by u/jahooma
Oras · a year ago
In any YC application, they request a list of competitors and why your product is better! Curious about what OP listed as competitors in the application.
jahooma · a year ago
Here's my application: https://manicode.notion.site/Manicode-YC-application-c52f592...

I listed: Cursor, Devin, Codium, Augment, Greptile, Lovable.dev, Aider.chat, mentat.ai, devlo.ai, etc

So I did mention Aider. I was definitely aware that it existed, I just hadn't used it.

u/jahooma

KarmaCake day188October 10, 2013View Original