Let's see: we have Claude Code vs. Claude the API vs. Claude the website, and they're totally different from each other? One is command line, one integrates into your IDE (which IDE?) and one is just browser based, I guess. Then you have the different pricing plans, Free, Pro, and Max? But then there's also Claude Team and Claude Enterprise? These are monthly plans that only work with Claude the Website, but Claude Code is per-request? Or is it Claude API that's per-request? I have no idea. Then you have the models: Claude Opus and Claude Sonnet, with various version numbers for each?? Then there's Cline and Cursor and GOOD GRIEF! I just want to putz around with something in VSCode for a few hours!
> There's an obvious question looming here — if the models got so confused, how did they consistently pass the reconciliation checks we described above? It may seem like the ability to make forward progress is a good proxy for task understanding and skill, but this isn't necessarily the case. There are ways to hack the validation check – inventing false transactions or pulling in unrelated ones to make the numbers add up.
This is hilarious. I wonder if someone is unintentionally committing fraud by blindly trusting LLMs with accounting. Or even worse, I bet that some governments are already trying to use LLMs to make accounting validators. My government sure wants to shove LLMs into digital government services.
I get that, but what are the particular examples of such jurisdictions? For example, when I run the linter that fixes my code formatting, no one will think that I did not create it. What about autogenerated code? Is it not copyright-protected?
I am using Claude Code almost exclusively. I am using the Claude Pro subscription and it allows Claude Code usage, with limits on the number of prompts per 5 hours, according to their site. I have not hit these limits yet even though I use this full-time, daily.
With other tools, do I have to pay API based costs or are there ways to use my subscription? As I see it, the API costs add up quickly. That means we can be stuck with a few tools from the top tier model companies.
This is untrue; many tried. Almost every major publisher has its own launcher. The problem with them all is they absolutely suck. Even Epic Games Store, the biggest competitor with the most money poured into it, is ridiculously bad in almost every way. Aside from the lack of network effect, it just misses most of the QOL features, is slow, ugly, and very unpleasant to use. Almost universal agreement in the PC gaming community is that EGS is a bootloader for free games that it throws at the user. Every time I use EGS, I am constantly amazed by how bad it is, despite probably tens of millions of investments.
The second point that the article completely misinterprets is Microsoft's role. Microsoft (Xbox specifically) is hands down the closest company to beating Steam in its own game. PC game pass provides a constant stream of very good games available on day one for dirt cheap. The work that Microsoft is doing on optimizing Windows for games in general and for handheld consoles in particular is very promising (see Xbox Ally X). This is the threat that Valve faces. Not just a better store, but the absence of a store and "buying games" in general. For example, I intended to buy The Outer Worlds 2 on launch. Now that I know it will be available day one on Game Pass, there is almost zero chance that I will buy it on Steam or anywhere else.