Now, I just tested against a real database in a docker container. I have over 1k tests that run about 1.5 mins. I’m pretty happy with that.
I guess given that, testing isn’t quite the use case for this (for me). Wonder what else this could be used for.
A short run at a small toy app makes me feel like Opus 4.5 is a bit slower than Sonnet 4.5 was, but that could also just be the day-one load it's presumably under. I don't think Sonnet was holding me back much, but it's far too early to tell.
> For Claude and Claude Code users with access to Opus 4.5, we’ve removed Opus-specific caps. For Max and Team Premium users, we’ve increased overall usage limits, meaning you’ll have roughly the same number of Opus tokens as you previously had with Sonnet. We’re updating usage limits to make sure you’re able to use Opus 4.5 for daily work. These limits are specific to Opus 4.5. As future models surpass it, we expect to update limits as needed.
The biggest catalyst for this is knowing most visual testing tools image diffing is done with 1-2 libraries out there. I worked at Percy for four years (prior to acquisition and after) and it was imagemagik (at the time). Argos CI uses odiff, lots of different platforms use these sorts of libraries.
I wanted something that would allow me to visually test my live sites, with live dynamic data, and not have to plaster ignore regions all over the place. And I think I have!
codesign -dv /Applications/Obsidian.app
Executable=/Applications/Obsidian.app/Contents/MacOS/Obsidian
Identifier=md.obsidian
Format=app bundle with Mach-O universal (x86_64 arm64)
CodeDirectory v=20500 size=759 flags=0x10000(runtime) hashes=13+7 location=embedded
Signature size=8975
Timestamp=Sep 29, 2025 at 12:22:41 PM
Info.plist entries=39
TeamIdentifier=6JSW4SJWN9
Runtime Version=15.4.0
Sealed Resources version=2 rules=13 files=23
Internal requirements count=1 size=172
Also, I love OSS as much as the next person, but not everything needs to be.
If something's wrong with your car's head unit firmware or android auto connection or whatever, of course you'd have a technician look at it?
Pretty much, yeah. I race SCCA and build race cars. Exactly why I want nothing to do with these, you don’t own it. You’re leasing the hardware that’s hogtied to the software.