jacurtis (u/jacurtis)

jacurtis commented on Claude Code on the web anthropic.com/news/claude... · Posted by u/adocomplete

the_duke · 2 months ago

IMO gpt5-codex medium is much better as soon as the task becomes slightly complex, or the context grows a bit.

Sora 4.5 tends to randomly hallucinate odd/inappropriate decisions and goes to make stupid changes that have to be patched up manually.

jacurtis · 2 months ago

Yes Sora hallucinates significantly more than Claude.

I find that Codex generally requires me to remove code to get to what I want, whereas Claude I tend to use what it gives me and I add to it. Whether this is from additional prompting or from manual typing, i just find that codex requires removal to get to desired state, and Claude requires adding to get to desired state. I prefer adding incrementally than removing.

jacurtis commented on Claude Code on the web anthropic.com/news/claude... · Posted by u/adocomplete

swah · 2 months ago

I haven't been able to get anything done with Codex. Claude Code is fast and "gets it". Also does better at running and testing its own stuff.

Its very odd because I was hoping they were very on par.

jacurtis · 2 months ago

The last time I used them both side by side was a month ago, so unless its significantly improved in the past month, I am genuinely surprised that someone is making the argument that Codex is competitive with ClaudeCode, let alone it somehow being superior.

ClaudeCode is used by me almost daily, and it continues to blow me away. I don't use Codex often because every time I have used it, the output is next to worthless and generally invalid. Even if it does get me what I eventually want, it will take much more prompting for me to get the functioning result. ClaudeCode on the other hand gets me good code from the initial prompt. I'm continually surprised at exactly how little prompting it requires. I have given it challenges with very vague prompts where it really exceeds my expectations.

jacurtis commented on Claude Code on the web anthropic.com/news/claude... · Posted by u/adocomplete

jswny · 2 months ago

I find Codex CLI to be very good too, but it’s missing tons of features that I use in Claude Code daily that keep me from switching full time.

- Good bash command permission system

- Rollbacks coupled with conversation and code

- Easy switching between approval modes (Claude had a keybind that makes this easy)

- Ability to send messages while it’s working (Codex just queues them up for after it’s done, Claude injects them into the current task)

- Codex is very frustrating when I have to keep allowing it to run the same commands over and over, Claude this works well when I approve it to run a command for the session

- Agents (these are very useful for controlling context)

- A real plan mode (crucial)

- Skills (these are basically just lazy loaded context and are amazing)

- The sandboxing in codex is so confusing, commands fail all the time because they try to log to some system directory or use internet access which is blocked by default and hard to figure out

- Codex prefers python snippets to bash commands which is very hard to permission and audit

When Codex gets to feature parity, I’ll seriously look at switching, but until then it’s just a really good model wrapped in an okay harness

jacurtis · 2 months ago

Yeah I think the argument is the tooling vs agent. Maybe the OpenAI agent is performing better now, but the tooling is significantly better from anthropic.

The anthropic (ClaudeCode) tooling is best-in-class to me. You listed many features that I have become so reliant on now, that I consider them the Ante that other competitors need to even be considered.

I have been very impressed with the Anthropic agent for code generation and review. I have found the OpenAI agent to be significantly lacking by comparison. But to be fair, the last time I used OpenAI's agent for code was about a month ago, so maybe it has improved recently (not at all unreasonable in this space). But at least a month ago when using them side-by-side the codex CLI was VERY basic compared to the wealth of features and UI in the ClaudeCode CLI. The agents for Claude were also so much better than OpenAI, that it wasn't even close. OpenAI has always delivered me improper code (non-working or invalid) at a very high rate, whereas Claude is generally valid code, the debate is just whether it is the desired way to build something.

jacurtis commented on Claude Sonnet will ship in Xcode developer.apple.com/docum... · Posted by u/zora_goron

Archonical · 4 months ago

This is great. I've been using Xcode with a separate terminal to run Claude Code, which has been a painful setup.

jacurtis · 4 months ago

Agreed. Claude Code is an amazing experience with Jetbrains IDEs, but for some reason Xcode just hates having claude directly edit the files.

jacurtis commented on Are OpenAI and Anthropic losing money on inference? martinalderson.com/posts/... · Posted by u/martinald

dcre · 4 months ago

Good point, very possible that Altman is excluding free tier as a marketing cost even if it loses more than they make on paid customers. On the other hand they may be able to cut free tier costs a lot by having the model router send queries to gpt-5-mini where before they were going to 4o.

jacurtis · 4 months ago

This is very true. ChatGPT has a very generous free tier. I used to pay for it, but realized I was never really hitting the limits of what is needed to pay for it.

However, at the same time, I was using Claude much less, really preferring the answers from it most of the time, and constantly being hit with limits. So guess what I did. I cancelled my OpenAI subscription and moved to Anthropic. Not only do i get Claude Code, which OpenAI really has no serious competitor for.

I still use both models but never run into problems with OpenAI, so i see no reason to pay for it.

jacurtis commented on Are OpenAI and Anthropic losing money on inference? martinalderson.com/posts/... · Posted by u/martinald

BlindEyeHalo · 4 months ago

Why wouldn't you factor in training? It is not like you can train once and then have the model run for years. You need to constantly improve to keep up with the competition. The lifespan of a model is just a few months at this point.

jacurtis · 4 months ago

In a recent episode of Hard Fork podcast, the hosts discussed an on-the-record conversation they had with Sam Altman from OpenAI. They asked him about profitability and he claimed that they are losing money mostly because of the cost of training. But as the model advances, they will train less and less. Once you take training out of the equation he claimed they were profitable based on the cost of serving the trained foundation models to users at current prices.

Now, when he said that, his CFO corrected him and said they aren't profitable, but said "it's close".

Take that with a grain of salt, but thats a conversation from one of the big AI companies that is only a few weeks old. I suspect that it is pretty accurate that pricing is currently reasonable if you ignore training. But training is very expensive and the reason most AI companies are losing money right now.

jacurtis commented on Perplexity is using stealth, undeclared crawlers to evade no-crawl directives blog.cloudflare.com/perpl... · Posted by u/rrampage

EGreg · 4 months ago

1. I actually disagree. I think teasers should be free but websites should charge micropayments for their content. Here is how it can be done seamlessly, without individuals making decisions to pay every minute: https://qbix.com/ecosystem

2. This also intersects with copyright law. Ingesting content to your servers en masse through automation and transforming it there is not the same as giving people a tool (like Safari Reader) they can run on their client for specific sites they visit. Examples of companies that lost court cases about this:

  Aereo, Inc. v. American Broadcasting Companies (2014)
  TVEyes, Inc. v. Fox News Network, LLC (2018)
  UMG Recordings, Inc. v. MP3.com, Inc. (2000)
  Capitol Records, LLC v. ReDigi Inc. (2018)
  Cartoon Network v. CSC Holdings (Cablevision) (2008)
  Image Search Engines: Perfect 10 v. Google (2007)

That last one is very instructive. Caching thumbnails and previews may be OK. The rest is not. AMP is in a copyright grey area, because publishers choose to make their content available for AMP companies to redisplay. (@tptacek may have more on this)

3. Putting copyright law aside, that's the point. Decentralization vs Centralization. If a bunch of people want to come eat at an all-you-can-eat buffet, they can, because we know they have limited appetites. If you bring a giant truck and load up all the food from all all-you-can-eat buffets in the city, that's not OK, even if you later give the food away to homeless people for free. You're going to bankrupt the restaurants! https://xkcd.com/1499/

So no. The difference is that people have come to expect "free" for everything, and this is how we got into ad-supported platforms that dominate our lives.

jacurtis · 4 months ago

I think this is the world we are going to. I'm not going to get mired in the details of how it would happen, but I see this end result as inevitable (and we are already moving that way).

I expect a lot more paywalls for valuable content. General information is commoditized and offered in aggregated form through models. But when an AI is fetching information for you from a website, the publisher is still paying the cost of producing that content and hosting that content. The AI models are increasing the cost of hosting the content and then they are also removing the value of producing the content since you are just essentially offering value to the AI model. The user never sees your site.

I know Ads are unpopular here, but the truth is that is how publishers were compensated for your attention. When an AI model views the information that a publisher produces, then modifies it from its published form, and removes all ad content. Then you now have increased costs for producers, reduced compensation in producing content (since they are not getting ad traffic), and the content isn't even delivered in the original form.

The end result is that publishers now have to paywall their content.

Maybe an interesting middle-ground is if the AI Model companies compensated for content that they access similar to how Spotify compensates for plays of music. So if an AI model uses information from your site, they pay that publisher a fraction of a cent. People pay the AI models, and the AI models distribute that to the producers of content that feed and add value to the models.

jacurtis commented on Man wearing metallic necklace dies after being sucked into MRI machine bbc.com/news/articles/cx2... · Posted by u/brudgers

SketchySeaBeast · 5 months ago

Given that the chain drug him across the room, I can imagine that the actual death might be quite grisly - if it can cause a man to be "hurled towards the machine" it's possible it was worse than a mere strangulation, and that sort of detail isn't really required in the article.

jacurtis · 5 months ago

There is a video of it floating around for the morbidly curious. I won't link it here. It is very NSFL. I was accidently shown it while scrolling instagram and wish I hadn't seen it.

He is able to talk, you can make out his words, but he is clearly choking or being strangled. He was fully sucked into the machine. There was a very strong guy trying with everything to pull him out. He made some pretty sad and harrowing words when he realized he wasn't going to make it. Again, the video is out there if you really want to see it. I do NOT recommend it though.

jacurtis commented on What happens when housing prices go down? clmarohn.substack.com/p/w... · Posted by u/chmaynard

onlyrealcuzzo · 5 months ago

If you drop interest rates to 0%, it's hard to argue you're not bailing out everyone who's in debt (which is every home owner with a mortgage).

Sure, maybe you are PRIMARILY bailing out banks.

But you are also bailing out everyone with a mortgage and enough brain cells to refinance and cut their by-far-largest monthly expense by 30-50%.

jacurtis · 5 months ago

Both you and the parent are correct.

The "bailout" for consumers is that they lower interest to 0%. That's what we did in 2007. If people can refinance their homes from 7% to ~2% then they save a fortune and it spurs buyers back into the market and current homeowners to move around and shuffle inventory.

Of course the Parent comment is also correct because banks get bailed out by low interest rates, but the government also bailed out several banks directly. Corporate bailouts are always a debatable topic. In one way we should let bad businesses fail, they failed because of the risks and choices they made and bailing them out is just inviting those mistakes to happen again. But on the flip side, consumers do need banks (as much as we refuse or hate to admit it). Yes banks make money off of us, but we as consumers also need banks. Which is why bailouts get approved.

We have seen this movie before. I'm not sure why everyone is debating the ending. We watched and lived the ending. It wasn't pretty in the middle there, but the market eventually recovered. Here we are getting ready to rewind and watch the movie again.

jacurtis commented on Test Postgres in Python Like SQLite github.com/wey-gu/py-pgli... · Posted by u/wey-gu

ForHackernews · 6 months ago

Is this about testing Python that uses Postgres?

Because if you're really interested in testing postgres you can test PG in PG: https://pgtap.org/

jacurtis · 6 months ago

It is for testing python projects that connect to postgres.

So often what you do is when unit testing, you need to test CRUD-type operations between the application and the database. So you generally have to spin up a temporary database just to delete it at the end of the tests. Commonly this is done with SQLite during testing, even if you built the application for postgres in production. Because it is fast and easy and you can interact with it as a file, not requiring connection configurations and added bloat.

But then sometimes your app gets complicated enough that you are using Postgres features that SQLite doesn't have comparables to. So now you need a PG instance just for testing, which is a headache.

So this project bridges the gap. Giving you the feature completedness and consistency across envs of using postgres, but with the conveniences of using SQLite.