Wow, OCR is now basically a general domain. I remember when I spent like a year trying to create one for receipts. Took me 6 months of data curation to prepare.
Yes, and its not just OCR (Optical Character Recognition), it understands layouts, captures signatures, charts, watermarks etc so way beyond just characters
Excited to share Nanonets-OCR2, a state-of-the-art suite of models designed for advanced image-to-markdown conversion and Visual Question Answering (VQA).
Nice job, the scores are superb.
Live Demo -> https://docstrange.nanonets.com/
Blog -> https://nanonets.com/research/nanonets-ocr-2/