Readit News logoReadit News
grej commented on Gemini 3.0 spotted in the wild through A/B testing   ricklamers.io/posts/gemin... · Posted by u/ricklamers
CaptainOfCoit · 2 months ago
> The longer a chat goes, it gets worse very quickly.

This has been the same for every single LLM I've used, ever, they're all terrible at that.

So terrible that I've stopped going beyond two messages in total. If it doesn't get it right at the first try, its more and more unlikely to get it right for every message you add.

Better to always start fresh, iterate on the initial prompt instead.

grej · 2 months ago
Yes agree, but it seems gemini drops off more quickly than other foundation models for some reason.
grej commented on Gemini 3.0 spotted in the wild through A/B testing   ricklamers.io/posts/gemin... · Posted by u/ricklamers
grej · 2 months ago
My strange observation is that Gemini 2.5 Pro is maybe the best model overall for many use cases, but starting from the first chat. In other words, if it has all the context it needs and produces one output, it's excellent. The longer a chat goes, it gets worse very quickly. Which is strange because it has a much longer context window than other models. I have found a good way to use it is to drop the entire huge context of a while project (200k-ish tokens) into the chat window and ask one well formed question, then kill the chat.
grej commented on Fireman Sam (Commodore 64)   retrovania-vgjunk.blogspo... · Posted by u/jandeboevrie
grej · 3 months ago
I never played this, but it reminds me the C64 Ghostbusters game which I loved!
grej commented on Improved Gemini 2.5 Flash and Flash-Lite   developers.googleblog.com... · Posted by u/meetpateltech
grej · 3 months ago
I love the gemini models and think Google has done a great job on them, but no model series I use seems to get context rot more in long conversations. Which seems strange given the longer context.
grej commented on Choose Your Own Adventure   filfre.net/2025/09/choose... · Posted by u/naves
grej · 3 months ago
I absolutely adored these books as a kid! Spend every dime of bookfair money on them every year and used to beg my parents to take me to the library to check out others.

I love the framing of them in this article as the gateway drug to interactive entertainment.

grej commented on Stephen Miller's Quota Likely Drove Korean Arrests in Immigration Raid   forbes.com/sites/stuartan... · Posted by u/gok
grej · 3 months ago
In addition to Korea being one of our most important military allies in the world, you need batteries for military drones, and the US is way behind in the development of a domestic manufacturing supply chain for next gen batteries.

So now we know clearly that nationalist xenophobia the true most important priority for this administration. Or at least, more important than either the domestic economic interests of their own base or strategic national security interests.

grej commented on Tau² benchmark: How a prompt rewrite boosted GPT-5-mini by 22%   quesma.com/blog/tau2-benc... · Posted by u/blndrt
grej · 3 months ago
DSPy was ahead of its time and still underutilized.
grej commented on Tesla remotely deactivates rapper's vehicle for singing about the Cybertruck?   threads.com/@brittainfors... · Posted by u/Analemma_
grej · 4 months ago
Wow, if this is true this is some serious Black Mirror line crossing.
grej commented on The dead need right to delete their data so they can't be AI-ified, lawyer says   theregister.com/2025/08/0... · Posted by u/rntn
grej · 5 months ago
Out - A scammer convincing a grandmother to send money using an AI generated voice of their grandchild asking them for money

In - A legal ad tech company using an AI generated deceased grandmother to ask their grandchild to purchase a product

grej commented on Mexico to US livestock trade halted due to screwworm spread   usda.gov/about-usda/news/... · Posted by u/burnt-resistor
grej · 5 months ago
The US successfully eradicated screwworms here in 1966 with a brilliant integrated sterile insect technique - I think the very first use of it (and had previously funded helping other countries control it also). But if we had another outbreak spread, I doubt there's any shred of competence left in this current gutted federal government to do anything like that again. Maybe they can have the new ICE folks try to deport the screwworm flies.

u/grej

KarmaCake day6083October 1, 2013
About
Technologist, Engineer, Strategist & Consultant / Developing Machine Learning, Applied AI & Analytics Solutions
View Original