badlogic (u/badlogic)

badlogic commented on Claude Opus 4.6 anthropic.com/news/claude... · Posted by u/HellsMaddy

krystofbe · 4 days ago

I did some debugging on this today. The results are... sobering.

Memory comparison of AI coding CLIs (single session, idle):

  | Tool        | Footprint | Peak   | Language      |
  |-------------|-----------|--------|---------------|
  | Codex       | 15 MB     | 15 MB  | Rust          |
  | OpenCode    | 130 MB    | 130 MB | Go            |
  | Claude Code | 360 MB    | 746 MB | Node.js/React |

That's a 24x to 50x difference for tools that do the same thing: send text to an API.

vmmap shows Claude Code reserves 32.8 GB virtual memory just for the V8 heap, has 45% malloc fragmentation, and a peak footprint of 746 MB that never gets released, classic leak pattern.

On my 16 GB Mac, a "normal" workload (2 Claude sessions + browser + terminal) pushes me into 9.5 GB swap within hours. My laptop genuinely runs slower with Claude Code than when I'm running local LLMs.

I get that shipping fast matters, but building a CLI with React and a full Node.js runtime is an architectural choice with consequences. Codex proves this can be done in 15 MB. Every Claude Code session costs me 360+ MB, and with MCP servers spawning per session, it multiplies fast.

badlogic · 4 days ago

OpenCode is not written in Go. It's TS on Bun, with OpenTUI underneath which is written in Zig.

badlogic commented on What I learned building an opinionated and minimal coding agent mariozechner.at/posts/202... · Posted by u/SatvikBeri

theturtletalks · 9 days ago

I'll check it out, thanks for your work on this!

Not only did you build a minimal agent, but the framework around it so anyone can build their own. I'm using Pi in the terminal, but I see you have web components. Any tips or creating a "Chat mode" where the messages are like chat bubbles? It would be easier to use on mobile.

badlogic · 9 days ago

The web package has a minimal example. I'm not a frontend developer, so YMM hugely V.

badlogic commented on What I learned building an opinionated and minimal coding agent mariozechner.at/posts/202... · Posted by u/SatvikBeri

kloud · 9 days ago

The OpenClaw/pi-agent situation seems similar to ollama/llama-cpp, where the former gets all the hype, while the latter is actually the more impressive part.

This is great work, I am looking forward how it evolves in the future. So far Claude Code seems best despite its bugs given the generous subscription, but when the market corrects and the prices will get closer to API prices, then probably the pay-per-token premium with optimized experience will be a better deal than to suffer Claude Code glitches and paper cuts.

The realization is that at the end agent framework kit that is customizable and can be recursively improved by agents is going to be better than a rigid proprietary client app.

badlogic · 9 days ago

FWIW, you can use subscriptions with pi. OpenAI has blessed pi allowing users to use their GPT subscriptions. Same holds for other providers, except Flicker Company.

And I'm personally very happy that Peter's project gets all the hype. The pi repo already gets enough vibesloped PRs from openclaw users as is, and its still only 1/100th of what the openclaw repository has to suffer through.

badlogic commented on What I learned building an opinionated and minimal coding agent mariozechner.at/posts/202... · Posted by u/SatvikBeri

theturtletalks · 9 days ago

Can I replace Vercel’s AI SDK with Pi’s equivalent?

badlogic · 9 days ago

It's not an API drop in replacement, if that's what you mean. But the pi-ai package serves the same purpose as Vercel's AI SDK. https://github.com/badlogic/pi-mono/tree/main/packages/ai

badlogic commented on What I learned building an opinionated and minimal coding agent mariozechner.at/posts/202... · Posted by u/SatvikBeri

athrowaway3z · 9 days ago

The solution to the security issue is using `useradd`.

I would add subagents though. They allow for the pattern where the top agent directs / observe a subagent executing a step in a plan.

The top agent is both better at directing a subagent, and it keeps the context clean of details that don't matter - otherwise they'd be in the same step in the plan.

badlogic · 9 days ago

There are lots of ways of doing subagents. It mostly depends on your workflow. That's why pi doesn't ship with anything built in. It's pretty simple to write an extension to do that.

Or you use any of the packages people provide, like this one: https://github.com/nicobailon/pi-subagents

badlogic commented on Porting 100k lines from TypeScript to Rust using Claude Code in a month blog.vjeux.com/2026/analy... · Posted by u/ibobev

thijser · 15 days ago

This reminds me of when I tried to let Claude port an Android libgdx-based game to a WASM-based libgdx version, so I can play the game in the browser.

No matter how much I tried to force it to stick to a mostly line-by-line port, it kept trying to "improve" the code. At some point it had to undo everything as it introduced a number of bugs. I asked it: "What should I add to your prompt so you won't do this again?" and it gave me this:

  ### CRITICAL LESSON: Don't "Improve" During Porting
  -  **BIGGEST MISTAKE: Reorganizing working code**
    - **What I did wrong:** Tried to "simplify" by splitting `createStartButton()` into separate creation and layout methods
    - **Why it failed:** Introduced THREE bugs:
      1. Layout overlap (getY() vs getY() - getHeight())
      2. Children not sized (Group.setSize() doesn't affect children)
      3. Origins not updated (scaling animations broken)
    - **The fix:** Deleted my "improvements" and copied the original Android pattern faithfully
    - **Root cause:** Arrogance - assuming I could improve production-tested code without understanding all the constraints
    - **Solution:** **FOLLOW THE PORTING PRINCIPLES ABOVE** - copy first, don't reorganize
    - **Time wasted:** ~1 hour debugging self-inflicted bugs that wouldn't exist if I'd just copied the original
    - **Key insight:** The original Android code is correct and battle-tested. Your "improvements" are bugs waiting to happen.

I like the self-reflection of Claude, unfortunately even adding this to CLAUDE.md didn't fix it and it kept taking wrong turns so I had to abandon the effort.

badlogic · 14 days ago

libGDX, now that's a name I haven't heard in a while.

badlogic commented on Claude Chill: Fix Claude Code's flickering in terminal github.com/davidbeesley/c... · Posted by u/behnamoh

chrislloyd · 20 days ago

Hi! I work on TUI rendering for Claude Code. I know this has been a long-standing frustration — it's taken longer than any of us wanted.

The good news: we shipped our differential renderer to everyone today. We rewrote our rendering system from scratch[1] and only ~1/3 of sessions see at least a flicker. Very, very few sessions see flickers in rapid succession which was so annoying before. Those numbers will keep dropping as people update.

We've also been working upstream to add synchronized output / DEC mode 2026 support to environments where CC runs and have had patches accepted to VSCode's terminal[2] and tmux[3]. Synchronized output totally eliminates flickering. As always, I recommend using Ghostty which has 2026 support and zero flicker.

Happy to answer questions!

[1]: https://github.com/anthropics/claude-code/issues/769#issueco...

[2]: https://github.com/xtermjs/xterm.js/pull/5453

[3]: https://github.com/tmux/tmux/pull/4744

badlogic · 20 days ago

> differential rendering

Now where have I seen that before.

badlogic commented on Claude Chill: Fix Claude Code's flickering in terminal github.com/davidbeesley/c... · Posted by u/behnamoh

badlogic · 20 days ago

It is very 2026, that this exists for the product by a company that goes all in on vibe coding. Kudos for the creative solution.

badlogic commented on Show HN: Vibium – Browser automation for AI and humans, by Selenium's creator github.com/VibiumDev/vibi... · Posted by u/hugs

badlogic · 2 months ago

Neat. Any reason why the MCP server doesn't expose a JavaScript/eval tool? Current models excel at writing JS to drive and inspect the DOM. They aren't great at driving browsers via screenshots.

badlogic commented on OpenAI are quietly adopting skills, now available in ChatGPT and Codex CLI simonwillison.net/2025/De... · Posted by u/simonw

ohghiZai · 2 months ago

Is there a way to implement skills with Gemini?

badlogic · 2 months ago

Create a markdown file, for each SKILL.md of the skills you want to use, put the frontmatter in that single markdown file along with the fulk path to the SKILL.md file. On session start, tell Gemini to read that file. If you put it in your AGENTS.md, you don't have to instruct Gemini. And if you have your skills in a known folder, let Gemini write a small scripts that generates that markdown file for you.