- bad blood (about theranos)
- SPQR
- King Leopold’s Ghost
- Lost in Math
- masters of doom
- bad blood (about theranos)
- SPQR
- King Leopold’s Ghost
- Lost in Math
- masters of doom
This would be cool to mix with VR, so you could hear different conversations as you move around a virtual room
Similarly, I assume it’s harder to find an engineer who went into the field purely for money.
I do think on average engineers will prioritize safety (since they likely understand failure modes and production and long tail statistics better. We literally have to take engineering ethics classes), at the cost of doing a worse job at running the business. But when the business requires this level of safety, that IS doing a good job.
Accelerating programming and information jobs also means accelerating the creation of robots that can do these trade jobs
Couldn't you just buy a sheet of plywood, some wood strips, and a bit of wood glue? I mean, setting up a maze will take some time, sure, but it's hardly difficult. Or am I missing something?
The one statistic mentioned in this overview where they observed a 67% drop seems like it could easily be reduced simply by editing 3.7’s system prompt.
What are folks’ theories on the version increment? Is the architecture significantly different (not talking about adding more experts to the MoE or fine tuning on 3.7’s worst failures. I consider those minor increments rather than major).
One way that it could be different is if they varied several core hyperparameters to make this a wider/deeper system but trained it on the same data or initialized inner layers to their exact 3.7 weights. And then this would “kick off” the 4 series by allowing them to continue scaling within the 4 series model architecture.