Readit News logoReadit News
jsharf commented on Claude 4 System Card   simonwillison.net/2025/Ma... · Posted by u/pvg
aabhay · 3 months ago
Given the cited stats here and elsewhere as well as in everyday experience, does anyone else feel that this model isn’t significantly different, at least to justify the full version increment?

The one statistic mentioned in this overview where they observed a 67% drop seems like it could easily be reduced simply by editing 3.7’s system prompt.

What are folks’ theories on the version increment? Is the architecture significantly different (not talking about adding more experts to the MoE or fine tuning on 3.7’s worst failures. I consider those minor increments rather than major).

One way that it could be different is if they varied several core hyperparameters to make this a wider/deeper system but trained it on the same data or initialized inner layers to their exact 3.7 weights. And then this would “kick off” the 4 series by allowing them to continue scaling within the 4 series model architecture.

jsharf · 3 months ago
I want to also mention that the previous model was 3.7. 3.7 to 4 is not an entire increment, it’s theoretically the same as 3 -> 3.3, which is actually modest compared to the capability jump I’ve observed. I do think Anthropic wants more frequent, continuous releases, and using a numeric version number rather than a software version number is their intent. Gradual releases give society more time to react.
jsharf commented on Mass market non-fiction has bad incentives   borretti.me/article/non-f... · Posted by u/bumbledraven
Insanity · 7 months ago
Sounds more like the author just had bad experiences with non fiction? There are plenty of non fiction books which are good and where the book doesn’t just feel like filler. Some that come to mind quickly :

- bad blood (about theranos)

- SPQR

- King Leopold’s Ghost

- Lost in Math

- masters of doom

jsharf · 7 months ago
He's not saying that all non fiction is bad, just that the incentives are misaligned, and to be fair at least in my experience, there are a lot of popular non-fiction books where each chapter is repetitive, and I feel the whole thing could have been written in 2-3 chapters, if publishing a 30-page nonfiction book wasn't taboo
jsharf commented on Phased Array Microphone (2023)   benwang.dev/2023/02/26/Ph... · Posted by u/bglazer
jsharf · 9 months ago
Wow, you can refocus the direction after the audio is recorded!

This would be cool to mix with VR, so you could hear different conversations as you move around a virtual room

jsharf commented on Alaska CEO: We found many loose bolts on our Max planes   nbcnews.com/business/busi... · Posted by u/aurareturn
RandomLensman · 2 years ago
Why would it need to be an "engineering culture" as opposed to, for example, a "quality culture"? And even if you want to former, why does it need an engineer to create that culture? Does a good movie director also need to be a good writer or actor?
jsharf · 2 years ago
I assume it’s easier to find an engineer who went to engineering school to learn how to build airplanes that are safe than it is to find an MBA who went to business school to learn how to build planes that are safe. (It’s not about the knowledge but about the root desire)

Similarly, I assume it’s harder to find an engineer who went into the field purely for money.

I do think on average engineers will prioritize safety (since they likely understand failure modes and production and long tail statistics better. We literally have to take engineering ethics classes), at the cost of doing a worse job at running the business. But when the business requires this level of safety, that IS doing a good job.

jsharf commented on Boeing 737-900ER: Second model to be inspected after 737 MAX 9 blowout   bbc.co.uk/news/world-us-c... · Posted by u/mindracer
imglorp · 2 years ago
A failed door plug might kill several people. A failed rudder will kill all of them.
jsharf · 2 years ago
I think planes can still fly with the rudder loose? If the bolt falls out and it loses control, wind will push it into the neutral position and then flying will still be possible with other control surfaces? But I guess if the pilots don't know and it happens suddenly at a critical moment or if the bolt causes the rudder to get jammed, then that would be really bad. But I assume it falling out would result in the rudder loosely returning to neutral...
jsharf commented on My AI Timelines Have Sped Up (Again)   alexirpan.com/2024/01/10/... · Posted by u/hcarlens
idopmstuff · 2 years ago
I totally agree with this - it always seems like sort of a weird omission when people talk about AGI doing 95% of work without any discussion of robotics. It seems very obvious that you need bipedal robots (or a lot of different types of robots) to hit that number. Do these AGI predictions assume that we'll have those by the time they hit AGI? Or is the word "knowledge" supposed to be implied before the work "work"? Either way, it's clearly inaccurate to say that AI on its own can do all productive work without physical embodiment.
jsharf · 2 years ago
If we have intelligent AI that can automate programming, then making really good robots will not be a problem. While not trivial, actuators and power systems are not the reason why we don’t have robots that can do all manual labor for us. Software is the reason, and the same kind of software that’s learning to code (machine learning) can also be adapted to washing dishes, folding clothing, doing craft labor or previously human manufacturing jobs.

Accelerating programming and information jobs also means accelerating the creation of robots that can do these trade jobs

jsharf commented on Now I can just print that video   blog.forret.com/2023/12/0... · Posted by u/pforret
jsharf · 2 years ago
Recommend passing the speech-to-text narration through a round of GPT4 API to correct for any transcription errors (use some prompt giving context that it's speech to text)
jsharf commented on Improving deep sleep may prevent dementia, study finds   monash.edu/news/articles/... · Posted by u/clouddrover
halfmatthalfcat · 2 years ago
Funny how you say avoid Vit D, yet another poster swears taking Vit D before bed solved their issue.
jsharf · 2 years ago
Suggests there's other variables involved, like time of day taken, other supplements taken simultaneously, metabolic processes, diet, and maybe even the placebo effect.
jsharf commented on Show HN: PFAS.report – Measure the forever chemicals in your blood via Quest   pfas.report/... · Posted by u/mdrew
haskellandchill · 2 years ago
Are you giving someone bad blood in that case?
jsharf · 2 years ago
Well, assuming they likely have some PFAS already in their system, you might just be giving them blood with the same concentration of PFAS that they already have.
jsharf commented on Micromouse: The fastest maze-solving competition [video]   youtube.com/watch?v=ZMQbH... · Posted by u/zdw
newaccount74 · 2 years ago
> the difficulty in setting up a physical maze for the robot

Couldn't you just buy a sheet of plywood, some wood strips, and a bit of wood glue? I mean, setting up a maze will take some time, sure, but it's hardly difficult. Or am I missing something?

jsharf · 2 years ago
They need to be dimensioned and finished quite accurately. Most plywood has a slight bend to it. At the speeds and accelerations needed to be competitive, any imperfection would mean your car will fly off the track or hit a wall.

u/jsharf

KarmaCake day387April 14, 2014View Original