Anthropic judge rejects $1.5B AI copyright settlement

I'm an author, and I've confirmed that 3 of my books are in the 500K dataset.

Thus, I stand to receive about $9,000 as a result of this settlement.

I think that's fair, considering that two of those books received advances under $20K and never earned out. Also, while I'm sure that Anthropic has benefited from training its models on this dataset, that doesn't necessarily mean that those models are a lasting asset.

shermozle · 5 months ago

It's far from fair given that if _I_ breach copyright and get caught, I go to jail, not just pay a fine.

dragonwriter · 5 months ago

> It's far from fair given that if _I_ breach copyright and get caught, I go to jail, not just pay a fine.

This settlement has nothing to do with any criminal liability Anrhropic might have, only tort liability (and it doesn’t involves damages, not fines.)

mcv · 5 months ago

Yeah, but this is a corporation. They don't go to jail. They're only people when it's beneficial to them.

YetAnotherNick · 5 months ago

You wont be put to jail if you breach copyright in almost any country, at least not just for downloading content from libgen or torrent. If you are talking about Swartz, he was going to jail for wire fraud and hacking not breaching copyright.

weird-eye-issue · 5 months ago

No you wouldn't

illiac786 · 5 months ago

Wow, where do you live?

I don’t get fined 7000USD for illegally downloading 3 books for example, much less. Although if I’m a repeat offender it can go up to prison I think.

singpolyma3 · 5 months ago

You don't though

stevage · 5 months ago

What? Who goes to jail over copyright infringement?

DyslexicAtheist · 5 months ago

arent the US trying to extradite Kim Dotcom for years now? (or were at least in the past)

Dead Comment

jonplackett · 5 months ago

Will you actually get the mo ey or will your publisher finally earn out the advances?

gpm · 5 months ago

Just a FYI that it's closer to $6750 (Anthropic pays $9000, but 25% is likely to go to the attorneys - the exact number here is up to the court).

Can't help but feel the reporting about $3000/work is going to leave a lot of authors disappointed when they receive ~$2250 even if they'd have been perfectly happy if that was the number they initially saw.

tartoran · 5 months ago

> I think that's fair, considering that two of those books received advances under $20K and never earned out.

It may be fair to you but how about other authors? Maybe it's not fair at all to them.

terminalshort · 5 months ago

Do they sell their books for more than $3000 per copy? In that case it isn't fair. Otherwise they are getting a windfall because of Anthropic's stupidity in not buying the books.

jawns · 5 months ago

Then they can opt out of the class.

eschaton · 5 months ago

In my opinion, as a class member you should push for two things:

1. Getting the maximum statutory damages for copyright infringement, which would be something like &250,000 per instance of infringement; you can be generous and call their training and reproduction of your works as a single instance, though it’s probably many more than that. 2. An admission of wrongdoing plus withdrawal from the market and permanent deletion of all models trained on infringed works. 3. A perpetual agreement to only train new models on content licensed for such training going forward, with safeguards to prevent wholesale reproduction of works.

It’s no less than what they would do if they thought you were infringing their copyrights. It’s only fair that they be subject to the same kind of serious penalties, instead of something they can write off as a slap on the wrist.

sh1mmer · 5 months ago

I’m curious about how you confirmed some things you wrote were in the dataset.

fsckboy · 5 months ago

>Also, while I'm sure that Anthropic has benefited from training its models on this dataset

I thought that they didn't use this data for training, the "crime" here was making the copies.

>I think that's fair, considering that two of those books received advances under $20K and never earned out.

i don't understand your logic here, if they never earned out that means you were already "overpaid" compared to what they were worth in the market. shouldn't fairness mean this extra bonus goes first to cover the unmet earnout?

thayne · 5 months ago

How much of that $9000 will go to your publisher?

jawns · 5 months ago

Remains to be seen, but generally the holder of copyright is the author not the publisher.

Unai · 5 months ago

As I understand, this case is not about training but about illegitimately sourcing the books, so unless you sell your books at $3k per copy, I don't see how it is fair.

Deleted Comment

midnitewarrior · 5 months ago

What's more fair is for Anthropic to put 5% of their preferred shares at their most recent valuation into a pool that the authors of these books can make a claim against. For 18 months, any author in this cache of books can claim their ownership and rights to their proportional amount of the shares within all claimants.

Perhaps tokenize all of the books and assign proportionally for token count of each publication.

xvector · 5 months ago

What a ridiculous assertion. They're already getting 100-1000x the value of their books. Truly bloodlust knows no bounds.

suyash · 5 months ago

For you might be okay, but there are others who probably are losing way too much money because of what happened. Anthropic need to pay 5x to 10x more, it needs to set a deterrent.

Suppafly · 5 months ago

>I think that's fair, considering that two of those books received advances under $20K and never earned out.

Doesn't that mean the money should go to your publisher instead of you?

echelon · 5 months ago

> that doesn't necessarily mean that those models are a lasting asset.

It remains to be seen, but typically this forms a moat. Other companies can't bring together the investment resources to duplicate the effort and they die.

The only reasons why this wouldn't be a moat:

1. Too many investment dollars and companies chasing the same goal, and none of them consolidate. (Non-consolidation feels impractical.)

2. Open source / commoditize-my-complement offerings that devalue foundation models. We have a few of these, but the best still require H100s and they're not building product.

I think there's a moat. I think Anthropic is well positioned to capitalize from this.

Deleted Comment

franze · 5 months ago

where can i check if my book was in it?

pier25 · 5 months ago

Maybe here: https://www.anthropiccopyrightsettlement.com/

simonw · 5 months ago

One of the sources is LibGen, you can search that with this tool: https://www.theatlantic.com/technology/archive/2025/03/searc...

nextworddev · 5 months ago

Fair for you maybe

k__ · 5 months ago

Cool.

Where can I check if I'm eligible?

jawns · 5 months ago

https://www.theatlantic.com/technology/archive/2025/03/searc...

motbus3 · 5 months ago

Who am I to say anything.

It is just another opinion.

It is not about 9k for your knowledge in that book. Is 9k for taking you out. The faster they are able to grab data and process the less chance you have to make money from your work.

The money is irrelevant if we allow them to break the law. They even might pay you 9k for those books, but you might never get anything again because they would have made copyright useless

hsaliak · 5 months ago

Might be fair for you, is it fair to JK Rowling?

stubish · 5 months ago

Yes. JK Rowling can still sue about her work being used for training. This lawsuit is about the illegal downloading of her works.

iamsaitam · 5 months ago

Does JK Rowling really deserve any fairness? She doesn't seem to think that everyone deserves it

Dead Comment

visarga · 5 months ago

How is it fair? Do you expect 9,000 from Google, Meta, OpenAI, and everyone else? Were your books imitated by AI?

Infringement was supposed to imply substantial similarity. Now it is supposed to mean statistical similarity?

jawns · 5 months ago

You've misunderstood the case.

The suit isn't about Anthropic training its models using copyrighted materials. Courts have generally found that to be legal.

The suit is about Anthropic procuring those materials from a pirated dataset.

The infringement, in other words, happened at the time of procurement, not at the time of training.

If it had procured them from a legitimate source (e.g. licensed them from publishers) then the suit wouldn't be happening.

wingspar · 5 months ago

My understanding is this settlement is about the MANNER in which Anthropic acquired the text of the books. They downloaded illegal copies of the books.

There was no issues with the physical copies of books they purchased and scanned.

I believe the issue of USING these texts for AI training is a separate issue/case(s)

Retric · 5 months ago

Penalties can be several times actual damages, and substantial similarity includes MP3 files and other lossy forms of compression which don’t directly look like the originals.

The entire point of deep learning is to copy aspects from training materials, which is why it’s unsurprising when you can reproduce substantial material from a copyrighted work given the right prompts. Proving damages for individual works in court is more expensive than the payout but that’s what class action lawsuits are for.

gruez · 5 months ago

>Were your books imitated by AI?

Given that books can be imitated by humans with no compensation, this isn't as strong as an argument as you think. Moreover AFAIK the training itself has been ruled legal, so Anthropic could have theoretically bought the book for $20 (or whatever) and be in the clear, which would obviously bring less revenue than the $9k settlement.

SilasX · 5 months ago

Be careful what you wish for.

While I'm sure it feels good and validating to have this called copyright infringement, and be compensated, it's a mixed blessing at best. Remember, this also means that your works will owe compensation to anyone you "trained" off of. Once we accept that simply "learning from previous copyrighted works to make new ones" is "infringement", then the onus is on you to establish a clean creation chain, because you'll be vulnerable to the exact same argument, and you will owe compensation to anyone whose work you looked at in learning your craft.

This point was made earlier in this blog post:

https://blog.giovanh.com/blog/2025/04/03/why-training-ai-can...

HN discussion of the post: https://news.ycombinator.com/item?id=43663941

simonw · 5 months ago

This settlement isn't about an LLM being trained in your work, it's about Anthropic downloading a pirated ebook of your work. https://simonwillison.net/2025/Sep/6/anthropic-settlement/

brendoelfrendo · 5 months ago

It's a good thing that laws can be different for AI training and human consumption. And I think the blog post you linked makes that argument, too, so I'm not sure why you'd contort it into the idea that humans will be compelled to attribute/license information that has inspired them when creating art.

marcus_holmes · 5 months ago

LLMs cannot create copyrightable works. Only humans can do that [0]. So LLMs are not making new copyrightable works.

[0] not because we're so amazingly more creative. But because copyright is a legal invention, not something derived from first principles, and has been defined to only apply to human creations. It could be changed to apply to LLM output in the future.

_DeadFred_ · 5 months ago

An infinitely scaling commercial for profit product designed to replace every creative by applying software processing to previous works is treated very differently than a sentient human being and their process of creativity.

The fact AI proponents can't see that is insane. Reminds me of the quote:

"It is difficult to get a man to understand something, when his salary depends upon his not understanding it."

abtinf · 5 months ago

This is basically the socialist/communist argument for mass expropriation.

I have no empathy for multi-billion dollar companies but intellectual property and copyright does nothing for positive for humanity.

program_whiz · 5 months ago

In an economy where ideas have value, it seems logical we should have property protection, much like we do for physical goods. Its easy to argue "ideas should be freely shared", but if an idea takes 20 years and $100M dollars to develop, and there are no protections for ideas, then no one will take the time to develop them. Most modern technology we have is due to copyright/patents (drugs, electronics, entertainment, etc.), because without those protections, no one would have invested the time and energy to develop them in the first place.

I believe you are probably only looking at the current state of the world and seeing how it "stifles competition" or "hampers innovation". Those allegations are probably true to some extent, especially in specific cases, but its also missing the fact that without those protections, the tech likely wouldn't be created in the first place (and so you still wouldn't be able to freely use the idea, since the person who invented it wouldn't have).

8note · 5 months ago

> drugs

this is a kinda strange example, since the discovery tends to be government funded research, and the safety shown by private money

the USSR went to space without those protections. its not like property protections are the only thing that has driven invention.

MIT licenses are also pretty popular as are creative commons licenses.

people also do things that don't make a lot of money, like teaching elementary school. it costs a ton of money to make and run all those schools, but without any intellectual property being created that can be sold or rented out.

i dont believe that nobody would want to build much of the things we have now, if there wasnt IP around them. Making and inventing things is fun

Permit · 5 months ago

> but if an idea takes 20 years and $100M dollars to develop, and there are no protections for ideas, then no one will take the time to develop them

This sounds trivially true but I have some trouble reconciling it with reality. For example the Llama models probably cost more than this to develop but are made freely available on GitHub. So while it’s true that some things won’t be built, I think it’s also the case that many things would still be built.

tolerance · 5 months ago

I appreciate you giving the parent comment a fair chance.

As a society we’re having trouble defining abstract components of the self (consciousness, intelligence, identity) as is. What makes the legislative notion of an idea and its reification (what’s actually protected under copyright laws) secure from this same scrutiny? Then patent rights. And what do you think may happen if the viability of said economy comes into question afterwards?

netbsdusers · 5 months ago

It's just a fiction to allow something freely copiable - pure information - to be pretended to be a commodity. If the AI firms have only a single redeeming feature, then it is that in them the copyright mafia finally has to face someone their own size, rather than driving little people to suicide, as they did to Aaron Swartz.

jonathanstrange · 5 months ago

Only people who don't create anything say that. Every musician and every author I know (including myself) thinks they should have some rights concerning the distribution and sale of the products of their work. Why should a successful book author be forced to live on charity?

BrawnyBadger53 · 5 months ago

Weird framing, I don't think this is what they were suggesting

arduanika · 5 months ago

What do you do for work, and do you believe it should be given away for free? Or are you just talking about other people's work?

nextworddev · 5 months ago

Are we even sure some of these posters aren’t LLms

2OEH8eoCRo0 · 5 months ago

I think the term has gotten way too long (70+ years at least) and we can thank Disney for that.

Deleted Comment