Readit News logoReadit News
Workaccount2 commented on Evaluating LLMs for my personal use case   darkcoding.net/software/p... · Posted by u/goranmoomin
rplnt · 8 hours ago
> Almost all models got almost all my evaluations correct

I find this the most surprising. I have yet to cross 50% threshold of bullshit to possibly truth. In any kind of topic I use LLMs for.

Workaccount2 · an hour ago
Would you be willing to share some of those chats?
Workaccount2 commented on The use of LLM assistants for kernel development   lwn.net/Articles/1032612/... · Posted by u/Bogdanp
DiabloD3 · 20 hours ago
No, I'm quite aware of how LLMs work. They are statistical models. They have, however, already been caught reproducing source material accurately. There is, inherently, no way to actually stop that if the only training data for a given output is a limited set of inputs. LLMs can and do exhibit extreme overfitting.

As for the Anthropic lawsuit, the piracy part of the case is continuing. Most models are built with pirated or unlicensed inputs. The part that was decided on, although the decision imo was wrong, only covers if someone CAN train a model.

At no point have I claimed you can't train one. The question is can you distribute one, and then use one. An LLM is not simplistic enough to be considered a phonebook, so they can't just handwave that away.

Saying an LLM can do that is like saying an artist can make a JPEG of a Batman symbol, and that's totally okay for them to distribute because the JPEG artifacts are transformative. LLMs ultimately are just a clever way of compressing data, and compressors are not transformative under the law, but possessing a compressor is not inherently illegal, nor is using one on copyrighted material for your own personal use.

Workaccount2 · an hour ago
They will just put a dumb copyright filter on the output, a la YouTube or other hosting services.

Again, it's illegal for artists to recreate copyright, it's not illegal for them to see it or know it. It's not like you cannot hire a guy because he can perfectly visualize Pikachu in his head.

The conflation of training on copyright being equivalent to distribution of copyright is so disingenuous, and thankfully the courts so far recognize that.

Workaccount2 commented on The use of LLM assistants for kernel development   lwn.net/Articles/1032612/... · Posted by u/Bogdanp
DiabloD3 · a day ago
Yes! All of those things DO pose existential copyright risks if they use them to violate copyright!. We're both on the same page.

If you have a VHS deck, copy a VHS tape, then start handing out copies of it, I pick up a copy of it from you, and then see, lo and behold, it contains my copyrighted work, I have sufficient proof to sue you and most likely win.

If you train an LLM on pirated works, then start handing out copies of that LLM, I pick up a copy of it, and ask it to reproduce my work, and it can do so, even partially, I have sufficient proof to sue you and most likely win.

Technically, even involving "which license" is a bit moot, AGPLv3 or not, its a copyright violation to reproduce the work without license. GPL just makes the problem worse for them: anything involving any flavor of GPLv3 can end up snowballing with major GPL rightsholders enforcing the GPLv3 curing clause, as they will most likely also be able to convince the LLM to reproduce their works as well.

The real TL;DR is: they have not discovered an infinite money glitch. They must play by the same rules everyone else does, and they are not warning their users of the risk of using these.

BTW, if I was wrong about this, (IANAL after all), then so are the legal departments at companies across the world. Virtually all of them won't allow AGPLv3 programs in the door just because of the legal risk, and many of them won't allow the use of LLMs with the current state of the legal landscape.

Workaccount2 · a day ago
I think you are confused about how LLMs train and store information. These models aren't archives of code and text, they are surprisingly small, especially relative to the training dataset.

A recent anthropic lawsuit decision also reaffirms that training on copyright is not a violation of copyright.[1]

However outputting copyright still would be a violation, the same as a person doing it.

Most artists can draw a batman symbol. Copyright means they can't monetize that ability. It doesn't mean they can't look at bat symbols.

[1]https://www.npr.org/2025/06/25/nx-s1-5445242/federal-rules-i...

Workaccount2 commented on What is going on right now?   catskull.net/what-the-hel... · Posted by u/todsacerdoti
DrillShopper · 2 days ago
I can barely get GitHub Copilot to output a functional 50 line program, let alone a 5 KLOC program that actually works.
Workaccount2 · 2 days ago
Try Claude or Gemnini pro, copilot is like the dollar store steak of LLMs. Gemini will go up to 8K LOC if you really optimize the context, but that's about the limit. You can use it free in aistudio[1]

[1]https://aistudio.google.com/prompts/new_chat

Workaccount2 commented on Waymo granted permit to begin testing in New York City   cnbc.com/2025/08/22/waymo... · Posted by u/achristmascarl
meagher · 2 days ago
This is great long term for having cars that follow traffic laws since human drivers in NYC are awful (kill/injure pedestrians, bikers, and other street users all the time).

Not so great for getting cars out of NYC and pedestrianizing more of the city/moving towards more “low traffic neighborhoods” as I imagine Waymo and other similar companies are going to fight against these efforts.

Edit: Lots of people talking about human drivers taking advantage of self-driving cars being more cautious/timid. Good news is that once you have enough self-driving cars on the road, it probably slows down/calms other traffic (see related research on speed governors).

Workaccount2 · 2 days ago
Believe it or not, NYC is actually the safest city in the country for pedestrians and bicyclists.[1]

[1]https://www.wagnerreese.com/most-dangerous-cities-cyclists-p...

Workaccount2 commented on What about using rel="share-url" to expose sharing intents?   shkspr.mobi/blog/2025/08/... · Posted by u/edent
biggestfan · 2 days ago
Does anyone even use share buttons? I always just copy the link, and it seems that anyone I see sharing things does the same. It feels more like a way for the social media companies to advertise/track, and those sites have been sending less and less traffic for years, so I wonder why every site still has them.
Workaccount2 · 2 days ago
People use apps nowadays to do everything, the only way to get links out of apps is share buttons.
Workaccount2 commented on What is going on right now?   catskull.net/what-the-hel... · Posted by u/todsacerdoti
ascendantlogic · 2 days ago
> the future is in everyone being able to make their own app.

Everyone can do their own plumbing and electrical work in their homes too. For some people it works out, for others it's still better to pay someone else to do it for them.

Workaccount2 · 2 days ago
I don't think basic software apps have anywhere near the risk profile of electrical or plumbing work.

I'm pretty comfortable letting my mom vibecode a plant watering tracker. Not so much wiring up a distribution box.

Workaccount2 commented on Waymo granted permit to begin testing in New York City   cnbc.com/2025/08/22/waymo... · Posted by u/achristmascarl
nickpinkston · 2 days ago
Is this the first time Waymo is doing winter / snow testing at scale?

I think some of the Pittsburgh-based self-driving firms may have tried this, but unaware how far they got.

Workaccount2 · 2 days ago
We'll see what happens when there is snow in the forecast. They might just call them all back for the storm.
Workaccount2 commented on What is going on right now?   catskull.net/what-the-hel... · Posted by u/todsacerdoti
DrillShopper · 2 days ago
I'll believe it when I see it with my own eyes, otherwise these words read more like sales copy than technological discovery.
Workaccount2 · 2 days ago
If you haven't seen an LLM output a functional 2K or even 5K LOC program at this point, you probably never will.

The problem space of average people problems that can be addressed with <5K LOC is massive. The only real barrier is having to go through an IDE, but that will almost certainly be solved in the near future, it already sort of is with Canvas features.

Workaccount2 commented on CEO pay and stock buybacks have soared at the largest low-wage corporations   ips-dc.org/report-executi... · Posted by u/hhs
_DeadFred_ · 2 days ago
Telling the majority of our population they are dumb for engaging in our current social contract isn't going to work out well for society long term.
Workaccount2 · 2 days ago
Those with a 1000x return almost universally recognize that they have the talent to not work for someone else. Or they do work for someone else, but make far, far, more than a house painter to do it.

Those are probably less than 1% of the population. The majority don't even approach 2x, much less 1000x.

u/Workaccount2

KarmaCake day13940March 24, 2021View Original