Autocomple is also automatically triggered when you place your cursor inside the code.
Autocomple is also automatically triggered when you place your cursor inside the code.
https://eval.16x.engineer/evals/image-analysis
For them to roll out a browser extension must mean that they have found a walkaround or alternative method to solve the vision performance.
What they want is to get rid of apps like YouTube Vanced that are making them lose money (and other Play Store apps)
there are plenty of other benchmarks that disagree with these, with that said. from my experience most of these benchmarks are trash. use the model yourself, apply your own set of problems and see how well it fairs.
I also publish my own evals on new models (using coding tasks that I curated myself, without tools, rated by human with rubrics). Would love you to check out and give your thoughts:
Example recent one on GPT-5:
https://eval.16x.engineer/blog/gpt-5-coding-evaluation-under...
All results:
"You publish TypeScript source, and JSR handles generating API docs, .d.ts files, and transpiling your code for cross-runtime compatibility."
Still would have been nice to have this for private packages.
This makes Deno/Bun much more attractive alternatives
Anthropic probably has 80% of AI coding model market share. That's a trillion dollar market.