makin (u/makin) - Readit News

makin commented on GPT-5 openai.com/gpt-5/... · Posted by u/rd

It is frequently suggested that once one of the AI companies reaches an AGI threshold, they will take off ahead of the rest. It's interesting to note that at least so far, the trend has been the opposite: as time goes on and the models get better, the performance of the different company's gets clustered closer together. Right now GPT-5, Claude Opus, Grok 4, Gemini 2.5 Pro all seem quite good across the board (ie they can all basically solve moderately challenging math and coding problems).

As a user, it feels like the race has never been as close as it is now. Perhaps dumb to extrapolate, but it makes me lean more skeptical about the hard take-off / winner-take-all mental model that has been pushed.

Would be curious to hear the take of a researcher at one of these firms - do you expect the AI offerings across competitors to become more competitive and clustered over the next few years, or less so?

makin · a month ago

Companies are collections of people, and these companies keep losing key developers to the others, I think this is why the clusters happen. OpenAI is now resorting to giving million dollar bonuses to every employee just to try to keep them long term.

makin commented on Cognition (Devin AI) to Acquire Windsurf cognition.ai/blog/windsur... · Posted by u/alazsengul

makin · 2 months ago

I was a bit confused as to what "Cognition" was, but they're the makers of Devin (edit: that just got added to the title, for reference), so that makes sense. Just buying the competition, the only surprise is they had more money to spend than the big ones.

makin commented on Certain names make ChatGPT grind to a halt, and we know why arstechnica.com/informati... · Posted by u/rbanffy

sinuhe69 · 9 months ago

I don't understand why they don't let another model "test the waters" first to see if the output of the main model could have a potential legal issue or not. I think it's easy to train an model specifically for this kind of categorization, and it doesn't even require a large network, so it can be very fast and efficient.

If the "legal advisor" detects a potential legal problem, ChatGPT will issue a legal disclaimer and a warning, so that it doesn't have to abruptly terminate the conversation. Of course, it can do a lot of other things, such as lowering the temperature, raising the BS detection threshold, etc., to adjust the flow of the conversation.

It can work, and it would be better than a hard-coded filter, wouldn't it?

makin · 9 months ago

They already do this, it's the moderation model.[1]

This name thing is an additional layer on top of that, maybe because training the model from zero per name (or fine tuning the system message to include an increasingly big list of names that it could leak) is not very practical.

[1] https://platform.openai.com/docs/guides/moderation/overview

makin commented on AI Advent of Code: Implementing Papers leetarxiv.com/... · Posted by u/muragekibicho

Arainach · 9 months ago

This project does not seem associated with Advent of Code (https://adventofcode.com/). Whether or not it's literal trademark infringement, I dislike the name as it's obviously confusing. "AI Paper Advent Calendar" would reuse a generic term. "Advent of Code" is a specific project.

EDIT: "Advent of Code" is a registered trademark and you should change your name. https://adventofcode.com/2024/about

makin · 9 months ago

I think it's a bit absurd to care about trademark infringement of a social activity like this, but it should have been "Advent of AI Code" at least.

makin commented on Megalopolis is baffling and plainly nuts but worth it thespectator.com/book-and... · Posted by u/mdp2021

currymj · a year ago

i can understand walking out due to disgust or confusion but “bored” is hard for me to comprehend.

makin · a year ago

Personally I didn't find the movie boring overall, but there were around five too many romance scenes between Driver and Emmanuel's characters that didn't seem to move anything forward, the kind of scenes that usually get cut for redundancy.

makin commented on I Am Tired of AI ontestautomation.com/i-am... · Posted by u/Liriel

sovietmudkipz · a year ago

I am tired and hungry…

The thing I’m tired of is elites stealing everything under the sun to feed these models. So funny that copyright is important when it protects elites but not when a billion thefts are committed by LLM folks. Poor incentives for creators to create stuff if it just gets stolen and replicated by AI.

I’m hungry for more lawsuits. The biggest theft in human history by these gang of thieves should be held to account. I want a waterfall of lawsuits to take back what’s been stolen. It’s in the public’s interest to see this happen.

makin · a year ago

I'm sorry if this is strawmanning you, but I feel you're basically saying it's in the public's interest to give more power to Intellectual Property law, which historically hasn't worked out so well for the public.

makin commented on Llama 3.2: Revolutionizing edge AI and vision with open, customizable models ai.meta.com/blog/llama-3-... · Posted by u/nmwnmw

minimaxir · a year ago

Off topic/meta, but the Llama 3.2 news topic received many, many HN submissions and upvotes but never made it to the front page: the fact that it's on the front page now indicates that moderators intervened to rescue it: https://news.ycombinator.com/from?site=meta.com (showdead on)

If there's an algorithmic penalty against the news for whatever reason, that may be a flaw in the HN ranking algorithm.

makin · a year ago

The main issue was that Meta quickly took down the first announcement, and the only remaining working submission was the information-sparse HuggingFace link. By the time the other links were back up, it was too late. Perfect opportunity for a rescue.

makin commented on Grok-2 Beta Release x.ai/blog/grok-2... · Posted by u/meetpateltech

Lerc · a year ago

What does irreversibly mean in this context? It seems like negative connotations are implied, but I feel like it's like irreversibly baking a cake.

makin · a year ago

Once the data is "compressed" into the model it cannot be easily removed without starting the training over.

makin commented on Tony Hawk's Pro Strcpy icode4.coffee/?p=954... · Posted by u/ndiddy

makin · a year ago

A bit of a shame about the exploit applying to THUG PRO. The mod is played to this day, since the more competitive side of the Tony Hawk franchise has been dead for almost twenty years (with the exception of the THPS1+2 remake, which was but a blip in the scene).

The mod itself is over 10 years old now, and I think the original developers are gone, explaining why no one was interested in fixing it when Ryan reported it. But this means that now the mod is unusable, no one is going to want to risk a full privilege exploit taking over their PC.

Hopefully this article reaches someone who's a bit more interested in patching the mod.