Deleted Comment
Dead Comment
This much is obvious, but they seem to be satisfied with theory over practicality.
Anyway I'm just ranting b/c they haven't paid me.
How about an off the wall algorithm to estimate how much each scraped input turns out to influence the bigger picture, as a way to work towards satisfying the copyright question.
Dead Comment
To say they're better than the compute that OpenAI or Google are throwing at the problem is just plain wrong.
I left the ad industry the moment I realised my skills and talents are better used informing people than lying to them.
This thread is not at all comparing the ethical issues of AI with local anything. You're conflating your solution with another problem.
Meta/OpenAI/Google can fuck up a lot because of all their compute, but ultimately we learn from that as the scientists doing the research at those companies would instantly bail if they couldn't publish papers on their techniques to show how clever they are.
Making the argument open source is the answer is an agenda of making your competition spin wheels.
If you wouldn't mind reviewing https://news.ycombinator.com/newsguidelines.html and taking the intended spirit of the site more to heart, we'd be grateful.