drusepth (u/drusepth)

drusepth commented on Grok Code Fast 1 x.ai/news/grok-code-fast-... · Posted by u/Terretta

Nuzzerino · a day ago

The article you linked talks about the voice personality prompt for "unhinged mode", which is an entertainment mode. It has nothing to do with the code writing model.

drusepth · a day ago

It's a comment about the company/brand behind the models, not the individual models themselves.

drusepth commented on Claude Sonnet will ship in Xcode developer.apple.com/docum... · Posted by u/zora_goron

mirkodrummer · 2 days ago

That is funny for how much is wrong. Ask the LLMs to vibe code a text editor and you'll get a React app using Supabase. Engineering !== Token prediction

drusepth · a day ago

I think this comment exposes an important point to make: people have different opinions of what "vibe coding" even means. If I were to ask an LLM to vibe code a text editor, I guarantee you I wouldn't get a React app using Supabase -- because I'd give it pages of requirements documentation and tell it not only what I want, but the important decisions on how to make it.

Obviously no model is going to one-shot something like a full text editor, but there's an ocean of difference between defining vibe coding as prompting "Make me a text editor" versus spending days/weeks going back and forth on architecture and implementation with a model while it's implementing things bottom-up.

Both seem like common definitions of the term, but only one of them will _actually_ work here.

drusepth commented on Claude Sonnet will ship in Xcode developer.apple.com/docum... · Posted by u/zora_goron

ZYbCRq22HbJ2y7 · 2 days ago

Do you really think so? Have you ever explored the source of something like:

https://github.com/JetBrains/intellij-community

drusepth · a day ago

I've worked in 3 different WYSIWYG editors for web and desktop applications over the years, lightly contributed to a handful of other open-source editors, and spent plenty of time building my own personal editors from scratch (and am currently using gpt-5 to fix my own human bugs in a rewrite of the Notebook.ai text editor that I re-re-implemented ~8 years ago).

Editors are incredibly complex and require domain knowledge to guide agents toward the correct architecture and implementation (and away from the usual naive pitfalls), but in my experience the latest models reason about and implement features/changes just fine.

drusepth commented on Grok Code Fast 1 x.ai/news/grok-code-fast-... · Posted by u/Terretta

drusepth · a day ago

Definitely fast, but initial use puts quality either comparable to or below gpt-5-nano. This might be a low-cost option for people who don't mind babysitting the output (or working in very small projects), but claude/gpt-5/gemini all seem to have significantly higher quality at marginally more cost/time.

By just emphasizing the speed here, I wonder if their workflows revolve more around the vibe practice of generating N solutions to a problem in parallel and selecting the "best". If so, it might still win out on speed (if it can reliably produce at least one higher-quality output, which remains to be seen), but also quickly loses any cost margin benefits.

Deleted Comment

drusepth commented on Claude Sonnet will ship in Xcode developer.apple.com/docum... · Posted by u/zora_goron

throwawa14223 · 2 days ago

It's getting harder to find IDEs that properly boycott LLMs.

drusepth · 2 days ago

Ironically, you could probably vibe code your own.

drusepth commented on AI adoption linked to 13% decline in jobs for young U.S. workers: study cnbc.com/2025/08/28/gener... · Posted by u/pseudolus

londons_explore · 2 days ago

> Audit quality will continue to suffer

I wonder how much this actually matters? I understand that for an auditor, having a quality reputation matters. But if all audits from all firms are bad, how much would the world economy suffer?

Likewise for the legal profession, if all judges made twice the number of mistakes, how much would the world suffer?

drusepth · 2 days ago

> Likewise for the legal profession, if all judges made twice the number of mistakes, how much would the world suffer?

Is this hyperbole? It seems like the real question being asked here is "would the world be worse off without deterministic checks and balances", which I think most people would agree is true, no?

drusepth commented on AWS CEO says using AI to replace junior staff is 'Dumbest thing I've ever heard' theregister.com/2025/08/2... · Posted by u/JustExAWS

systemf_omega · 9 days ago

> B2B SaaS

Perhaps that's part of it.

People here work on all kinds of industries. Some of us are implementing JIT compilers, mission-critical embedded systems or distributed databases. In code bases like this you can't just wing it without breaking a million things, so LLM agents tend to perform really poorly.

drusepth · 9 days ago

On the other hand, fault-intolerant codebases are also often highly defined and almost always have rigorous automated tests already, which are two contexts where coding agents specifically excel in.

drusepth commented on Claude Sonnet 4 now supports 1M tokens of context anthropic.com/news/1m-con... · Posted by u/adocomplete

jpc0 · 18 days ago

> These tools are creating a huge amount of value...

> The people doing so don’t have a lot of time to comment about it on HN since we’re busy building…

“We’re so much more productive that we don’t have time to tell you how much more productive we are”

Do you see how that sounds?

drusepth · 18 days ago

I feel this, honestly. I get so much more work done (currently: building & shipping games, maintaining websites, managing APIs, releasing several mobile apps, and developing native desktop applications) managing 5x claude instances that the majority of my time is sucked up by just prompting whichever agent is done on their next task(s), and there's a real feeling of lost productivity if any agent is left idle for too long.

The only time to browse HN left is when all the agents are comfortably spinning away.

drusepth commented on Claude Opus 4.1 anthropic.com/news/claude... · Posted by u/meetpateltech

generalizations · 25 days ago

Get a subscription and use claude code - that's how you get actual reasonable economics out of it. I use claude code all day on the max subscription and maybe twice in the last two weeks have I actually hit usage limits.

drusepth · 24 days ago

Is there any documentation on what the max sub usage limit is? A coworker tried it and was booted off Opus within just a couple hours due to "high usage". I haven't made the jump since I expect my $3k/mo on API would just instantly fly by a $200/mo sub and then I'd just be back on API again, but if it could carve out $1k-2k of costs for a little bit of time managing sub(s) it might be worth it.

u/drusepth

KarmaCake day3713June 10, 2011

About

Chroma Golem co-founder (2023-Present); Indent Labs founder and maker of Notebook.ai (2016-Present); I also run a few other niche sites for authors and/or game devs.

Feel free to contact me about any of the above at andrew@indentlabs.com (writing) or dru@chromagolem.com (gaming).

View Original