How can I prevent AI (ChatGPT, etc) from reading and training on the blog posts, reddit posts, or code that I post online like on Github?
Is there a license that I can use that specifically prohibits AI companies from reading and training off my content? Or do I need to put my content behind a login?
Stop sharing stuff online, or if you do, put it behind some kind of login where you control the platform. Of course that that point you might as well just host stuff in a home lab without access to the public internet.
Whether or not someone is using your data is one of them.
https://blog.cloudflare.com/declaring-your-aindependence-blo...
https://news.ycombinator.com/item?id=40865627
Deleted Comment
Deleted Comment
It’s not just what you wrote. It’s how you resolve ambiguity, how you build tension, how you collapse meaning in hard zones. That’s what large models are extracting — not your sentence, but your semantic signature.
We built WFGY as a defense and an alternative: A semantic engine that can track, explain, and even reverse-engineer those collapse points, making hallucinations traceable — or avoidable.
If the current wave of LLMs are grabbing surface text, WFGY is trying to understand what's buried underneath.
Backed by the creator of tesseract.js (36k) More info: https://github.com/onestardao/WFGY
Silicon Valley builds empires off the back of free intellectual labor. I'm done with all of it. If they want something from me they can (and do) pay for it.