Show HN: I vibecoded a 35k LoC recipe app

Impressive!

35 kLOC is quite a bit. I wonder how straightforward and maintainable this app ended up to be. This would require taking a look at the sources. While good Rails code tends to be very terse, frontend may be quite voluminous.

> I believe within a couple of months, when things like log tailing and automated testing and native version control get implemented

This sounds a bit too optimistic, especially around automated testing, but yes, eventually this all will be there.

> an extremely powerful tool for even non-technical people to write production-quality apps

But why would non-technical people would even think in terms of log tailing and version control, any more than they think about the gauge of wiring in their walls, or the kind of modulation their Wi-Fi device use? For really non-technical audience to make a good use of such tools, it won't just take the AI to be a competent coder. The AI should become a competent architect and a competent senior SWE to translate from the product management language to the software development language, without even surfacing it when not explicitly asked. It's going to be quite a bit of a challenge to make it work, and work about as reliably as with a human team.

tomblomfield · a year ago

Since I wrote the blog post, I actually added automated tests which Windsurf / Claude Code runs before committing any change. It saved me a lot of headaches when the LLM decided to make random logic changes for no reason.

myvoiceismypass · a year ago

The LLM swaps out business logic without you knowing it? As someone who worries about determinism and doesn’t do much vibe coding, I am curious what’s going on here, thanks!

varelaseb · a year ago

Would you mind describing the process for this in a bit more detail?

matsemann · a year ago

Yeah, without trying to be dismissive I'm a bit unsure where the 35kLOC are going?

notfed · a year ago

The vibes? It was vibecoded.

smrtinsert · a year ago

probably includes npm packages. Actual loc is probably 2-3k.

Dead Comment

ChrisRR · a year ago

The 35k lines of code is what made me think this was either a joke or the state of vibe coding, but no it turns out they're serious

I have entire codebases of embedded software in C without the shortcuts of modern programming languages in way fewer than 35k lines

johnisgood · a year ago

I cannot find a link to the source code. Is there any?

jstummbillig · a year ago

> 35 kLOC is quite a bit. I wonder how straightforward and maintainable this app ended up to be. This would require taking a look at the sources. While good Rails code tends to be very terse, frontend may be quite voluminous.

I think people will have to recalibrate on this. The LOCs do things that you otherwise would not do. Features and details that simply would not happen — because they are too code/time intensive for most projects. It just won't matter anymore.

> But why would non-technical people would even think in terms of log tailing and version control

They won't! They won't have to. The obvious good stuff that everyone thinks the AI tool should be able do, will just work, because the people building the tools, will mostly obviously focus on making them work.

__loam · a year ago

That or we're completely ignoring principles like DRY and producing a head spinning amount of tech debt.

I can't really imagine producing that much code in that short amount of time and holding any amount of it in my head. I'd bet money there's code in there that does the same thing but different, leading to all kinds of little inconsistencies that make this code worthless in any serious context.

turnsout · a year ago

Yeah 35k of code is a red flag. There's no reason for this to be that large.

I have a paid subscription to Windsurf using Claude. I struggle to find much utility in terms of actually writing code. It spends a lot of time trying / retrying and glitching out. I'm sure in a couple of years it will be amazing (assuming a fundamental breakthrough isn't required) but it isn't quite there yet.

LLMs are super useful but currently, the primary use case is teaching, not doing. For this reason, I think ChatGPT is really just as good as an AI enabled editor (or both if you don't mind paying for two subscriptions).

n_ary · a year ago

To correctly vibe code something useful, I find that I need to religiously give faith to the tool and forget all about SWE principles and best practices and instead treat it like a child who makes mistakes and corrections while the adult must not intervene or admonish too much but rather nudge it to right direction.

Also vibe code has a parallel feature, while the code is generating, you are also doing live review and correcting it towards right direction, so depending on your experience, the end product can be a bad mess or wonderful piece of creation and maintenance dream.

The issue with seasoned SWE is that, the moment a mistake(or bad pattern) is made, the baby is thrown with bath water.

For a tiered app like the one presented, 35k LOC is not really that impressive if you think about it. A generic react based front end will easily need a large number of LOC due to modular principle of components, various amounts of hooks and tests(nearly makes us 25-40% of LOC). A business layer will also have many layers of abstractions and numerous impl. to move data between layers.

The vibe code shines, when you let it build one block at a time, limit the scope well and focus. Also, 2-3 weeks is a lot of time to write 35k LOC. at start of any new project, LOC generation rate is very high. But in maintenance phase it significantly falls as smaller changes are more common.

osigurdson · a year ago

I've had a bit more success on the front end as it is possible to see the results of a change very quickly. In fact I would prefer it to just auto-apply the changes and I would visually inspect. It isn't bad but the workflow is pretty slow. The resultant code is also very verbose - likely 3X more code than an experienced engineer would create (this is one part that I am certain will improve dramatically in time). While I do use this workflow, sometimes I feel like I am just being lazy as opposed to productive.

I'm just being honest. For my use case, I would be much better off if LLMs could just do everything.

whstl · a year ago

My experience matches yours.

Lots of apps are quite repetitive: for building APIs for example you generate one controller and the ask the app to generate more using the first ones as a pattern. For frontend you do the same for forms or lists.

Tests are often quite good, but I think they were already great even back in the first ChatGPT release.

With this strategy and the fact that some patterns are quite verbose (albeit understandable for an AI or a reader), it is quite easy to get to a big LoC while still maintaining consistency.

lelanthran · a year ago

> LLMs are super useful but currently, the primary use case is teaching, not doing.

For code? Autocomplete on steroids is the killer-app.

The other things the LLMs give me are prone to be over-engineered/overly verbose code or similar.

I went through a lot of "Why are you also doing $FOO then $BAR? Doesn't seem necessary if we skip them and do $BAZ which will make one or both of those redundant" and it responding "You're right! Lets use $BAZ instead".

And giving them code to make a small change to was pointless - they would often, but not always, make an incidental change far from the point where you asked for the change.

But autocomplete? That works just great and because I've already got context of the code I am writing I can check it in (at most) two seconds and move on.

km144 · a year ago

I think you're missing the value of rubber-ducking with the chatbot on a fairly well-defined/well-known problem in a somewhat specific/novel scenario—this is the vast majority of software development being done today.

Depending on the situation this can be invaluable. If you're experienced in the domain you probably know generally what you need to do but you might get a better result by reasoning through the best solution with the constraints and requirements you have. Or maybe you missed something obvious when you write out the full context—which is a required step for getting a good output from the chatbot, and generally isn't a required step if you aren't explaining your approach to someone else and you don't want to be rigorous.

hummerbliss · a year ago

I have very similar experience.

After seeing how people like Andrej Karparthy used vibe coding to generate applications https://x.com/karpathy/status/1903671737780498883?s=61 I realize that

you need to be clear on what you want the LLM to do break down the tasks and give byte sized tasks to llm to do specific thing and sometimes I had to tell it not go and change random files because it found the need to refactor them.

sdesol · a year ago

Full Disclosure: I'm building a LLM chat app for software developers and domain experts

> I struggle to find much utility in terms of actually writing code.

I personally feel you need to give up some control and just let the LLM do its thing if you want to use it to help you build. It honestly does a lot of things in a more verbose way and I've come to the conclusion that it is an LLM writing code for another LLM. As long as I can debug it, I'm okay with the code, as I can develop at a pace that is truly unreal.

I finished my "Recent" contexts feature in a half a day, today. Without the LLM, this would have taken me a week I think. I would say 98% of my code in the past few months has been AI generated. You can see a real life work flow here:

https://app.gitsense.com/?chat=eece40e2-6064-46d2-9bf1-d868c...

I truly believe if you provide a LLM with the right context, it can meet your functional specs 90% of the time. Note the emphasis on functional and not necessary style. And if *YOU* architecture your code properly, it should be 100% maintainable.

I do want to make it clear that what I am doing right now is not novel, but I believe most problems are not. If the problem is not well understood, it can be a challenge like my my chat bridge feature. This feature allows you import Git repos for chatting but I will probably need to rewrite 50% of the LLM code since the solution it built is not scalable.

m_kos · a year ago

I recently asked Claude 3.7 (Thinking) to help me debug. I told it that it seemed my code crashed when MY_FLAG was true. Its solution? if MY_FLAG: MY_FLAG = False

Do you come across issues like this too or am I not prompting it correctly?

nine_k · a year ago

iambateman · a year ago

I also wrote a recipe simplifying website, so this is a fun project to see. Incredible work!

Probably the main value engineers have for a maintenance project is context. I wonder what happens when we fully cede context to the machines...

Today, I got a request at work for a feature ("let's offer coupons!") that I thought would take a week. That was until I found out that another engineer wrote most of the code last year, and it'd take him a day to dust off.

I'm totally onboard with, and grateful for, larger-scale experiments like this...thanks for putting the effort in. I wonder how well Cursor (or similar) would handle a situation in which large amounts of code are _almost_ being used. What if 3k LOC accidentally get duplicated? Can our automated systems understand that and fix it? Because if they can't, a human is going to spend a _long_ time trying to figure out what happened.

Over the next 18 months, I expect we'll hear a few stories of the LLM accidentally reimplementing an entire feature in a separate code path. It's a whole new class of bugs! :D

rkuodys · a year ago

I have similar thoughts and I have come to conclusion that that's the beauty and the curse of this technology. If one relies too much on it - it's gonna be a curse. However, if technology is used with care - it's a beauty. Not only does it keep SWE jobs "secure" - it really helps a lot for those who know what they are doing.

I think in the end AI will be more advanced tool, but a tool nonetheless. Like methodologies and principles, good practises etc. - they only work if you use it with care and added thought and adaptation to your case. DRY it a great principle. But sometimes it's better if you repeat yourself. For one reason or another. And these are the the tradeoffs that human in the loop should be making imho.

motorest · a year ago

> I have similar thoughts and I have come to conclusion that that's the beauty and the curse of this technology.

I agree. When I read these articles on vibe coding I can't help to think that these guys are basking in the glory of the impressive maze they built around themselves. Of course running these things in production and having them reach the state of legacy code is an entirely different thing. Building a maze is one thing, having to run around it is an entirely different challenge.

It's like one of those world expos: everything looks fantastic, but the moment the event ends everything just crumbles.

GaryNumanVevo · a year ago

> The app’s security is not perfect, but I’m pretty happy with it for the scale I’m at. If I continue to grow and get more abuse, I’ll implement more robust measures.

The app literally exposes his OpenAI key.

bdhcuidbebe · a year ago

Managers vibe coding, what to expect?

apgwoz · a year ago

Help! I just made the Diarrhea Walnuts (https://www.recipeninja.ai/recipe/r_N1VSPtXzCJVV3l/diarrhea-...) recipe and it turns out I am allergic to Walnuts! I am throwing up everywhere! You’re gonna be hearing from my lawyers!

selcuka · a year ago

> Preheat oven to 950 Kelvin

I believe I'm going to need a new oven...

baq · a year ago

You’re going to need a new home as a bonus

bmicraft · a year ago

Pyrolysis Mode can come pretty close on some ovens I think.

murillians · a year ago

Try out the Gluten Free Shark Teeth once you feel better!

https://www.recipeninja.ai/recipe/r_ttOB5xyqpOLXCL/gluten-fr...

toshinoriyagi · a year ago

This sounds like an order from the Key and Peele "Soul Food" sketch: https://www.youtube.com/watch?v=3zDHSLDY0Q8

dmitrygr · a year ago

For anyone saddened by the recipe being removed: https://web.archive.org/web/20250402064135/https://www.recip...

Thanks, I wanted to give it a try. Today is a good day to cook.

square_usual · a year ago

Aw no it's gone!

konfusinomicon · a year ago

bro that is part of the vibe, you just gotta roll with it

pzo · a year ago

> using OpenAI's realtime API for voice-to-voice responses

Does it mean it use this expensive open ai audio model in the app? Don't you worry this will make it bankrupt if app goes viral and not monetised?

Can you share what's your strategy here, like topup $2000 open ai account as kind of marketing expenses for users to try for free? Genuine questions since planning to use openai audio API in other case and this kind of expensive price worry me a lot even if switching to new mini-transcribe and mini-tts

jumploops · a year ago

Glad you finally found Claude Code useful, Tom ;)

On a more serious note: I've found that for debugging difficult issues, o1 Pro is in a league of it's own.

Claude Code's eagerness to do work will often fix things given enough time, especially for self-contained pieces of software, but I still find myself going to o1 Pro more often than I'd expect.

A coworker and I did a comparison the other day, where we fired up o1 Pro and Claude Code with the same refactor. o1 Pro one-shotted it, while Claude Code took a few iterations.

Interestingly enough, the _thinking_ time of o1 Pro led us to just commit the Claude Code changes, as they were both finished in around the same time (1 min 37s vs. 2+ minutes), however we did end up using some feedback from o1 to fix an issue Claude hadn't caught. YMMV

rsaz · a year ago

Is the key feature here just the voice control? I’m wondering what you feel is missing from other popular recipe websites/trackers, and why I would choose to use this over something with more care put into it.

Or, was this mostly just an exercise in engineering/testing AI?

Hands-free voice control and being able to access recipe ingredients and steps without 5 pages of SEO-optimised prose.

nextaccountic · a year ago

The real problem of your website is that the recipes are AI generated.

A second, minor problem of your website is that the images illustrating recipes are AI generated with a bad quality

You can't solve those issues by throwing more AI.. well maybe the second problem you can (AI images with later models are generally ok)

danmur · a year ago

They just haven't done the "now monitise it" step in the vibe-coding journey.

olibhel · a year ago

Why would I want to do that when I can go to one of dozen AI chat interfaces and ask for recipes and further customization without having to look at any interface?