You need to enable JavaScript to run this app.
Readit News
Posted by
u/let_tim_cook_
10 months ago
Digital Agent outperforms o1 by 15% – trained with new RL-variant similar to R1
arxiv.org/abs/2502.01600...
No comments