JacobAsmuth (u/JacobAsmuth)

JacobAsmuth commented on “Erdos problem #728 was solved more or less autonomously by AI” mathstodon.xyz/@tao/11585... · Posted by u/cod1r

aidenn0 · 2 months ago

How do you verify that the AI translation to Lean is a correct formalization of the problem? In other fields, generative AI is very good at making up plausible sounding lies, so I'm wondering how likely that is for this usage.

JacobAsmuth · 2 months ago

You read it yourself :)

If you and the AI agree on the translation of the problem, and lean agrees with the solution, then you're done.

JacobAsmuth commented on Don't fall into the anti-AI hype antirez.com/news/158... · Posted by u/todsacerdoti

dom96 · 2 months ago

> As a programmer, I want to write more open source than ever, now.

I want to write less, just knowing that LLM models are going to be trained on my code is making me feel more strongly than ever that my open source contributions will simply be stolen.

Am I wrong to feel this? Is anyone else concerned about this? We've already seen some pretty strong evidence of this with Tailwind.

JacobAsmuth · 2 months ago

This is why I never got into open source in the first place. I was worried that new programmers might read my code, learn how to program, and then start independently contributing the the projects I know and love - significantly devaluing my contributions.

JacobAsmuth commented on GPT-5.2-Codex openai.com/index/introduc... · Posted by u/meetpateltech

freedomben · 3 months ago

The cybersecurity angle is interesting, because in my experience OpenAI stuff has gotten terrible at cybersecurity because it simply refuses to do anything that can be remotely offensive (as in the opposite of "defensive"). I really thought we as an industry had learned our lesson that blocking "good guys" (aka white-hats) from offensive tools/capabilities only empowers the gray-hat/black-hats and puts us at a disadvantage. A good defense requires some offense. I sure hope they change that.

JacobAsmuth · 3 months ago

So in general you think that making frontier AI models more offensive in black hat capabilities will be good for cybersecurity?

JacobAsmuth commented on Gemini 3 Pro: the frontier of vision AI blog.google/technology/de... · Posted by u/xnx

hodder · 3 months ago

As a consumer I typed this into "Gemini". The behind the scenes model selection just adds confusion.

If "AI" trust is the big barrier for widespread adoption to these products, Alphabet soup isn't the solution (pun intended).

JacobAsmuth · 3 months ago

It works fine for me. https://imgur.com/a/MKNufm1

JacobAsmuth commented on OpenAI declares 'code red' as Google catches up in AI race theverge.com/news/836212/... · Posted by u/goplayoutside

woeirua · 3 months ago

Wait, shouldn't their internal agents be able to do all this work by now?

JacobAsmuth · 3 months ago

They have a stated goal of an AI researcher for 2028. Several years away.

JacobAsmuth commented on Gemini 3 Pro Model Card [pdf] storage.googleapis.com/de... · Posted by u/virgildotcodes

patates · 4 months ago

It says it's been trained from scratch. I wonder if it will have the same undescribable magic that makes me spend an hour every day with 2.5. I really love the results I can get with 2.5 pro. Google eventually limiting aistudio will be a sad day.

Also I really hoped for a 2M+ context. I'm living on the context edge even with 1M.

JacobAsmuth · 4 months ago

AIStudio now accepts an API key. Unlimited usage :)

JacobAsmuth commented on Gemini 3 Pro Model Card [pdf] storage.googleapis.com/de... · Posted by u/virgildotcodes

mynti · 4 months ago

It is interesting that the Gemini 3 beats every other model on these benchmarks, mostly by a wide margin, but not on SWE Bench. Sonnet is still king here and all three look to be basically on the same level. Kind of wild to see them hit such a wall when it comes to agentic coding

JacobAsmuth · 4 months ago

50% of the CLs in SWE-Bench Verified are the DJango codebase. So if you're a big contributor to Django you should care a lot about that benchmark. Otherwise the difference between models is +-2 tasks done correctly. I wouldn't worry too much about it. Just try it out yourself and see if its any better.

JacobAsmuth commented on Gemini 3 blog.google/products/gemi... · Posted by u/preek

markdog12 · 4 months ago

Ah, I should have mentioned it was a slow motion video.

> The default FPS it's analyzing video at is 1

Source?

JacobAsmuth · 4 months ago

https://ai.google.dev/gemini-api/docs/video-understanding#cu...

"By default 1 frame per second (FPS) is sampled from the video."

u/JacobAsmuth

KarmaCake day11November 18, 2025View Original