You need to enable JavaScript to run this app.
Readit News
Posted by
u/mike_hearn
9 days ago
A summary of recent AI research (2016)
blog.plan99.net/the-scien...
CGMthrowaway
·
9 days ago
Funny how there was a lot of concerns then about reward hacking, something I never hear anyone talk about with current AI
jhurliman
·
9 days ago
I think it just got folded under the umbrella concept of model alignment. And it moved from theoretical discussions to practical daily struggles with LLMs deleting failing unit tests