But a very nicely put together repo. Good job.
But a very nicely put together repo. Good job.
Dead Comment
It just screams fried serotonin-circuits to me. I don't like it. I looked at the site for 2-3 seconds and I want nothing to do with these guys.
Do I think we should stop this type of competitive behaviour fueled by kids and investors both microdosed on meth? No. I just wouldn't do business with them, they don't look like trustworthy brand to me.
Edit: They got me with the joke, being in this field there are people that do actually talk like that, both startups and established executives alike. I.e. Artisan ads in billboards saying STOP HIRING HUMANS and another new york company I think pushing newspaper ads for complete replacement. Also if you're up with the latest engineering in agentic scaffolding work this type of thing is no joke.
Did you do some kind of zod schema, or compare the error rate of how different models perform for this task? Did you bother setting up any kind of json output at all? Did you add a second validation step with a different model and then compared their numbers are the same?
It looks like no, they just deferred to authority the whole thing. Technically theres no difference between them saying that gpt5-mini or llama2-7b did this.
Literally every single llm will make errors and hallucinate. It's your job to put all the scaffolding around to make sure it doesn't or that it does a lot less than a skilled human would.
So then have you measured the error rate or maybe tried to put some kind of error catching mechanism just like any professional software would do?
Can we still have a financial crisis from all this investment going bust because it might take too long for it to make a difference in manufacturing enough automation hardware for everyone? Yes.
But, the fundamentals are still there, parents will still send their kids to some type of school, and people will trade good in exchange for health services. That's not going to change. Neither will the need to use robots in nursing homes, I think that assumption is safe to make.
What's difficult to predict change in is adoption in manufacturing, and repairs ( be that repairing bridges or repairing your espresso machine ) because that is more of a "3D" issue and hard to automate reliably (think about how many gpus today would it actually take to get a robot to reason out and repair a whole in your drywall), given that your RL environments and training data needs grow exponentially. Technically, your phone should have enough gpu performance to do your taxes with a 3B model and a bunch of tools, eventually it'll even be better than you at it. But to tun an actual robot with multiple cameras and stuff doing troubleshooting and decision making.... you're gonna need a whole 8x rack of gpus for that.
And that's what makes it now difficult to predict what's going to happen. The areas under the curve can vary widely. We could get a 1B AGI model in 6 months, or it could take 5 years for agentic workflows to fully automate everyones taxes and actually replace 2/3 of radiology work...
Either way, while theres a significant chance of this transition to the automation age being rough, I am overall quite optimistic given the fundamentals of what governments actually spend majority of their money on.
Meanwhile linux people are removing buttons, window borders entirely, sometimes removing colors too, it's glorious.
We all know where this is going, they're going to ban the one mathematical tool we have that gives us control over machines, encryption.