What's the benchmark for how long something can be pre-1.0? Seems like a nonsense argument.
Something can be pre-1.0 as long as there are no stability guarantees.
There is a link to a previous post by the same author (within the first ten words even!), which contains the context you're looking for.
EDIT: This has since been fixed in link, so it is outdated.
Do you know you could just use the parsing engine that renders the PDF to get the output? I mean, why raster it, OCR it, and then use AI? Sounds creating a problem to use AI to solve it.
And that is before we even get into text structure, because as everyone knows, reading text is easier if things like paragraphs, columns and tables are preserved in the output. And guess what, if you just use the parsing engine for that, then what you get out is a garbled mess.