https://soda-inria.github.io/carte/https://arxiv.org/pdf/2402.16785
The paper includes a comparison to TabPFN v1 (among others), noting the lack of categorical & missing values handling which v2 now seems to have. Would be curious to see an updated comparison.
[1] https://github.com/tensorflow/decision-forests [2] https://github.com/google/yggdrasil-decision-forests
Bonus question: are the stats you're mentioning publically available?
https://arxiv.org/abs/2207.01848