Readit News logoReadit News
ekojs commented on Gemini with Deep Think achieves gold-medal standard at the IMO   deepmind.google/discover/... · Posted by u/meetpateltech
ekojs · 5 months ago
> Btw as an aside, we didn’t announce on Friday because we respected the IMO Board's original request that all AI labs share their results only after the official results had been verified by independent experts & the students had rightly received the acclamation they deserved

> We've now been given permission to share our results and are pleased to have been part of the inaugural cohort to have our model results officially graded and certified by IMO coordinators and experts, receiving the first official gold-level performance grading for an AI system!

From https://x.com/demishassabis/status/1947337620226240803

Was OpenAI simply not coordinating with the IMO Board then?

ekojs commented on How I Use Kagi   flamedfury.com/posts/how-... · Posted by u/moebrowne
ekojs · 5 months ago
Maybe not a popular sentiment here on HN but I cancelled my Kagi subscription (9+ months) just recently. Increasingly, most of my queries/search have been through LLMs and Google search is just fine (and even better for restaurants, places, and the like). I don't think the improved search experience is worth the subscription anymore.
ekojs commented on GCP Outage   status.cloud.google.com/... · Posted by u/thanhhaimai
ekojs · 6 months ago
https://status.cloud.google.com/incidents/ow5i3PPK96RduMcb1S...

> Multiple GCP products are experiencing impact due to Identity and Access Management Service Issue

IAM issue huh. The post-mortem should be interesting at least.

ekojs commented on GCP Outage   status.cloud.google.com/... · Posted by u/thanhhaimai
ekojs · 6 months ago
Super duper frustrating having the status page being green. Why can't Google do this properly?
ekojs commented on Next.js 15.1 is unusable outside of Vercel   omarabid.com/nextjs-verce... · Posted by u/todsacerdoti
dimitrisnl · 6 months ago
Oof. I'm sure Vercel might patch this issue. But I had had enough of these little annoyances. For example, the documented way to identify prefetches in the middleware has been broken for weeks (months?).

A lot of small issues that keep adding up. I'm not going to shill something else here, but I have a bit of Next.js fatigue lately. Still love the JS ecosystem though.

Anyway, thanks for bringing this up!

ekojs · 6 months ago
I share the sentiment. I think we will only be using Next.js for static sites/prebuilt SPA in the future.
ekojs commented on Meta got caught gaming AI benchmarks   theverge.com/meta/645012/... · Posted by u/pseudolus
ekojs · 9 months ago
I think it's most illustrative to see the sample battles (H2H) that LMArena released [1]. The outputs of Meta's model is too verbose and too 'yappy' IMO. And looking at the verdicts, it's no wonder by people are discounting LMArena rankings.

[1]: https://huggingface.co/spaces/lmarena-ai/Llama-4-Maverick-03...

ekojs commented on Gemini 2.5   blog.google/technology/go... · Posted by u/meetpateltech
ekojs · 9 months ago
> This will mark the first experimental model with higher rate limits + billing. Excited for this to land and for folks to really put the model through the paces!

From https://x.com/OfficialLoganK/status/1904583353954882046

The low rate-limit really hampered my usage of 2.0 Pro and the like. Interesting to see how this plays out.

u/ekojs

KarmaCake day116October 4, 2022View Original