LLMs don't try to scam/fool you, LLM providers do.
Remember how Grok bragged that Musk had the “potential to drink piss better than any human in history” and was the “ultimate throat goat,” whose “blowjob prowess edges out” Donald Trump’s. Grok also posited that Musk was more physically fit than LeBron James, and that he would have been a better recipient of the 2016 porn industry award than porn star Riley Reid.
I had a chuckle reading all of these.
There is a vast gap between the output happening to be what you expect and code being actually correct.
That is, in a way, also the fundamental issue with LLMs: They are designed to produce “expected” output, not correct output.
I didn't mean they do it on the first time, or that it is correct, I mean that you can 'run' and 'test it' to see if it does what you want in the way you want.
The same cannot be said to any other topics like medical advice, life advice, etc.
The point is, how verifiable is the output the LLM gives and so how useful it is.