Readit News logoReadit News
JacobAsmuth commented on “Erdos problem #728 was solved more or less autonomously by AI”   mathstodon.xyz/@tao/11585... · Posted by u/cod1r
aidenn0 · 2 months ago
How do you verify that the AI translation to Lean is a correct formalization of the problem? In other fields, generative AI is very good at making up plausible sounding lies, so I'm wondering how likely that is for this usage.
JacobAsmuth · 2 months ago
You read it yourself :)

If you and the AI agree on the translation of the problem, and lean agrees with the solution, then you're done.

JacobAsmuth commented on Don't fall into the anti-AI hype   antirez.com/news/158... · Posted by u/todsacerdoti
dom96 · 2 months ago
> As a programmer, I want to write more open source than ever, now.

I want to write less, just knowing that LLM models are going to be trained on my code is making me feel more strongly than ever that my open source contributions will simply be stolen.

Am I wrong to feel this? Is anyone else concerned about this? We've already seen some pretty strong evidence of this with Tailwind.

JacobAsmuth · 2 months ago
This is why I never got into open source in the first place. I was worried that new programmers might read my code, learn how to program, and then start independently contributing the the projects I know and love - significantly devaluing my contributions.
JacobAsmuth commented on GPT-5.2-Codex   openai.com/index/introduc... · Posted by u/meetpateltech
freedomben · 3 months ago
The cybersecurity angle is interesting, because in my experience OpenAI stuff has gotten terrible at cybersecurity because it simply refuses to do anything that can be remotely offensive (as in the opposite of "defensive"). I really thought we as an industry had learned our lesson that blocking "good guys" (aka white-hats) from offensive tools/capabilities only empowers the gray-hat/black-hats and puts us at a disadvantage. A good defense requires some offense. I sure hope they change that.
JacobAsmuth · 3 months ago
So in general you think that making frontier AI models more offensive in black hat capabilities will be good for cybersecurity?
JacobAsmuth commented on Gemini 3 Pro: the frontier of vision AI   blog.google/technology/de... · Posted by u/xnx
hodder · 3 months ago
As a consumer I typed this into "Gemini". The behind the scenes model selection just adds confusion.

If "AI" trust is the big barrier for widespread adoption to these products, Alphabet soup isn't the solution (pun intended).

JacobAsmuth · 3 months ago
It works fine for me. https://imgur.com/a/MKNufm1
JacobAsmuth commented on OpenAI declares 'code red' as Google catches up in AI race   theverge.com/news/836212/... · Posted by u/goplayoutside
woeirua · 3 months ago
Wait, shouldn't their internal agents be able to do all this work by now?
JacobAsmuth · 3 months ago
They have a stated goal of an AI researcher for 2028. Several years away.
JacobAsmuth commented on Gemini 3 Pro Model Card [pdf]   storage.googleapis.com/de... · Posted by u/virgildotcodes
patates · 4 months ago
It says it's been trained from scratch. I wonder if it will have the same undescribable magic that makes me spend an hour every day with 2.5. I really love the results I can get with 2.5 pro. Google eventually limiting aistudio will be a sad day.

Also I really hoped for a 2M+ context. I'm living on the context edge even with 1M.

JacobAsmuth · 4 months ago
AIStudio now accepts an API key. Unlimited usage :)
JacobAsmuth commented on Gemini 3 Pro Model Card [pdf]   storage.googleapis.com/de... · Posted by u/virgildotcodes
mynti · 4 months ago
It is interesting that the Gemini 3 beats every other model on these benchmarks, mostly by a wide margin, but not on SWE Bench. Sonnet is still king here and all three look to be basically on the same level. Kind of wild to see them hit such a wall when it comes to agentic coding
JacobAsmuth · 4 months ago
50% of the CLs in SWE-Bench Verified are the DJango codebase. So if you're a big contributor to Django you should care a lot about that benchmark. Otherwise the difference between models is +-2 tasks done correctly. I wouldn't worry too much about it. Just try it out yourself and see if its any better.
JacobAsmuth commented on Gemini 3   blog.google/products/gemi... · Posted by u/preek
markdog12 · 4 months ago
Ah, I should have mentioned it was a slow motion video.

> The default FPS it's analyzing video at is 1

Source?

JacobAsmuth · 4 months ago
https://ai.google.dev/gemini-api/docs/video-understanding#cu...

"By default 1 frame per second (FPS) is sampled from the video."

u/JacobAsmuth

KarmaCake day11November 18, 2025View Original