It feels like Auto-GPT, BabyAGI, and the like were simply ahead of their time
Part of it is the snappy more minimal UX but also just pure efficacy seems consistently better. Claude does its best work in CC. I'm sure the same is true of Codex.
I am an ML researcher at Cursor, and worked on this project. Would love to hear any feedback you may have on the model, and can answer question about the blog post.
I wish Apple would take gaming more seriously and make GPTK a first class citizen such as Proton on Linux.
In my case, for software development, I'd be happy with an entry-level MacBook Air (now with a minimum of 16GB) for $999.
I think it's not too far-fetched to think about standards, cultures, guardrails, compliance, etc. being documented, versioned, but more importantly, verifiable and applicable. In natural language, no code needed.