Ask HN: How do I prevent AI from reading/training off my content?

How can I prevent AI (ChatGPT, etc) from reading and training on the blog posts, reddit posts, or code that I post online like on Github?

Is there a license that I can use that specifically prohibits AI companies from reading and training off my content? Or do I need to put my content behind a login?

al_borland · a month ago

We’ve seen time and time again and that AI companies won’t respect any license you may use. When it comes to platforms you don’t own (Reddit, GitHub, etc), you don’t have much control in the matter.

Stop sharing stuff online, or if you do, put it behind some kind of login where you control the platform. Of course that that point you might as well just host stuff in a home lab without access to the public internet.

bnchrch · a month ago

My honest advice? Don't bother with the things you can't control or don't matter.

Whether or not someone is using your data is one of them.

toomuchtodo · a month ago

Cloudflare or login gate.

https://blog.cloudflare.com/declaring-your-aindependence-blo...

https://news.ycombinator.com/item?id=40865627

Deleted Comment

mindfulhacker · a month ago

i'm less worried about someone profiting from my intellectual property, and more worried about maintaining my edge... making sure i stay in a state where i can come up with unique, novel, fresh perspectives on a continuous basis... mainly though meditation, breathwork, bodywork, yoga.

TXTOS · a month ago

Honestly, the real danger isn’t just that AI models might train on your content — it’s that they’re training on your semantic patterns.

It’s not just what you wrote. It’s how you resolve ambiguity, how you build tension, how you collapse meaning in hard zones. That’s what large models are extracting — not your sentence, but your semantic signature.

We built WFGY as a defense and an alternative: A semantic engine that can track, explain, and even reverse-engineer those collapse points, making hallucinations traceable — or avoidable.

If the current wave of LLMs are grabbing surface text, WFGY is trying to understand what's buried underneath.

Backed by the creator of tesseract.js (36k) More info: https://github.com/onestardao/WFGY

zerohp · a month ago

My solution is that I no longer contribute to the public internet in any meaningful way. No more open source projects. No more contributions to free software. Bug reports only when it helps me. The hacker ethos is dead. Selfishness and greed won.

Silicon Valley builds empires off the back of free intellectual labor. I'm done with all of it. If they want something from me they can (and do) pay for it.