Three comments here:
1. I wonder how many of the 20M prompts got a thumbs up or down. I don't think people click that a lot. Unless the UI enforces it. I haven't used Gemini, so I might be unaware.
2. Judging a single response might be not enough to tell if watermarking is acceptable or not. For instance, imagine the watermarking is adding "However," to the start of each paragraph. In a single GPT interaction you might not notice it. Once you get 3 or 4 responses it might stand out.
3. Since when Google is happy with measuring by self declared satisfaction? Aren't they the kings of A/B testing and high volume analysis of usage behavior?
These two sentences next to each other don't make much sense. Or are misleading.
Yeah. I know. Only the client is open source and it calls home.
(See^^^)
I guess the article could be called Falsehoods Programmers Assume of Programming Language Syntaxes.
However idiotic it may be - people are far more willing to pay $$$$$ to have a broken leg treated than they are willing to pay $ for salt or sand to put on their icy sidewalk.