I'm loath to defend Mullenweg but WordPress.com is not 43% of the internet. WordPress (the software) is used by 43% of websites, WordPress.com is used by a few million, making it smaller than Wix, Squarespace and even other WordPress hosts like WPEngine.com. Automattic has no relationship with a meaningful amount of end-user data.
Is there a repo of this website?
It would be good to have for preservation purposes.
Wordpress.anything in general must be a data juristional nightmare. Every plugin has access to UGC and could be sending bits of that anywhere.