sagarpatil (u/sagarpatil)

sagarpatil commented on GLM 4.5 with Claude Code docs.z.ai/guides/llm/glm-... · Posted by u/vincirufus

sagarpatil · 4 months ago

I was blown away by this model. It was definitely comparable to sonnet 4. In some of my tests, it performed as good as Opus. I subscribed to their paid plan, and now the model seems dumb? I asked it to find and replace a string. It only made the change in one file. Codex worked fine. Can Z.ai confirm if this is the model we get through their API or is it quantized for Claude Code use?

sagarpatil commented on GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models [pdf] arxiv.org/pdf/2508.06471... · Posted by u/SerCe

sagarpatil · 4 months ago

I’ve been using it and I think it’s on par with sonnet.

sagarpatil commented on GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models [pdf] arxiv.org/pdf/2508.06471... · Posted by u/SerCe

nico · 4 months ago

How are you using glm-4.5? Are you consuming the api or running something like glm-4.5 air locally?

sagarpatil · 4 months ago

Not OP. Chutes.ai charges $0.20 per 1M tokens. I don’t think it uses caching though because I ended up burning $30 in an hour or two. I had to move back to Claude Code.

sagarpatil commented on Ask HN: What trick of the trade took you too long to learn? · Posted by u/unsupp0rted

sagarpatil · 5 months ago

Claude Code recently showcased how powerful it can be when you don’t have to memorize commands. My AI agent works similarly. It finds the right CLI commands instead of relying on Playwright or an MCP server to perform tasks. What’s interesting is that even the agent doesn’t know many commands upfront; it simply uses the help option to discover what’s available.

sagarpatil commented on OpenAI’s Windsurf deal is off, and Windsurf’s CEO is going to Google theverge.com/openai/70599... · Posted by u/rcchen

extr · 5 months ago

IMO other than the Microsoft IP issue, I think the biggest thing that has shifted since this acquisition was first in the works is Claude Code has absolutely exploded. Forking an IDE and all the expense that comes with that feels like a waste of effort, considering the number of free/open source CLI agentic tools that are out there.

Let's review the current state of things:

- Terminal CLI agents are several orders of magnitude less $$$ to develop than forking an entire IDE.

- CC is dead simple to onboard (use whatever IDE you're using now, with a simple extension for some UX improvements).

- Anthropic is free to aggressively undercut their own API margins (and middlemen like Cursor) in exchange for more predictable subscription revenue + training data access.

What does Cursor/Windsurf offer over VS Code + CC?

- Tab completion model (Cursor's remaining moat)

- Some UI niceties like "add selection to chat", and etc.

Personally I think this is a harbinger of where things are going. Cursor was fastest to $900M ARR and IMO will be fastest back down again.

sagarpatil · 5 months ago

I strongly agree with you. I’m more of a CLI guy, and Claude Code just works. Most good projects have a CLI anyway (gcloud, GitHub CLI, Vercel, etc.). I prefer CLI vs MCP’s. I’m on the $200 plan, and it’s absolutely worth it (never thought I’d say this for a CLI app).

sagarpatil commented on Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model twitter.com/Kimi_Moonshot... · Posted by u/c4pt0r

sagarpatil · 5 months ago

All the AI models are no using em-dashes. ChatGPT keeps using them even after explicitly told not to. Anybody know what’s up with these models?

sagarpatil commented on Gemini CLI blog.google/technology/de... · Posted by u/sync

AJ007 · 6 months ago

Current best practice for Claude Code is to have heavy lifting done by Gemini Pro 2.5 or o3/o3pro. There are ways to do this pretty seamlessly now because of MCP support (see Repo Prompt as an example.) Sometimes you can also just use Claude but it requires iterations of planning, integration while logging everything, then repeat.

I haven't looked at this Gemini CLI thing yet, but if its open source it seems like any model can be plugged in here?

I can see a pathway where LLMs are commodities. Every big tech company right now both wants their LLM to be the winner and the others to die, but they also really, really would prefer a commodity world to one where a competitor is the winner.

If the future use looks more like CLI agents, I'm not sure how some fancy UI wrapper is going to result in a winner take all. OpenAI is winning right now with user count by pure brand name with ChatGPT, but ChatGPT clearly is an inferior UI for real work.

sagarpatil · 6 months ago

You might want to give this a try: https://github.com/opencode-ai/opencode

sagarpatil commented on Remote MCP Support in Claude Code anthropic.com/news/claude... · Posted by u/surprisetalk

yewenjie · 6 months ago

Does anybody know of a cross-platform LLM-frontend with sync that is also open-source? I am currently using the web version of LobeChat on macOS and Android, but it's quite slow and has some features missing.

sagarpatil · 6 months ago

https://chorus.sh/ has a BYOK version Openwebui

sagarpatil commented on Show HN: Claude Code Usage Monitor – real-time tracker to dodge usage cut-offs github.com/Maciek-roboblo... · Posted by u/Maciej-roboblog

joshmlewis · 6 months ago

For a reference point, it says my max session limit in the past was ~337,492 tokens and I have the Max20 plan and 99% use Opus.

My total tokens used since I started using Claude Code on May 27th was 1,374,439,311 worth around $3397.34.

sagarpatil · 6 months ago

1. Don’t you hit rate limits on Opus? Don’t you find it slow compared to sonnet?

sagarpatil commented on Show HN: EnrichMCP – A Python ORM for Agents github.com/featureform/en... · Posted by u/bloppe

sagarpatil · 6 months ago

Interesting…