Readit News logoReadit News
ntkris commented on Mistral OCR   mistral.ai/fr/news/mistra... · Posted by u/littlemerman
vikp · 9 months ago
I ran a partial benchmark against marker - https://github.com/VikParuchuri/marker .

Across 375 samples with LLM as a judge, mistral scores 4.32, and marker 4.41 . Marker can inference between 20 and 120 pages per second on an H100.

You can see the samples here - https://huggingface.co/datasets/datalab-to/marker_comparison... .

The code for the benchmark is here - https://github.com/VikParuchuri/marker/tree/master/benchmark... . Will run a full benchmark soon.

Mistral OCR is an impressive model, but OCR is a hard problem, and there is a significant risk of hallucinations/missing text with LLMs.

ntkris · 9 months ago
This is awesome. Have you seen / heard of any benchmarks where the data is actually a structured JSON vs. markdown?
ntkris commented on Show HN: Tile.run – Extract structured data from any document via API   tile.run/... · Posted by u/ntkris
rco8786 · a year ago
> We found that getting to accuracy that is reliable enough for automation is challenging.

This is in the problem description of your pitch, and leads me to believe that tile.run has been solving this problem. Is that right?

> Coming Soon:

> - Improved accuracy

Can you expand more?

I have a large need for this sort of tooling, but accuracy is my primary concern.

ntkris · a year ago
Yes, we needed to solve the problem for our other product (https://kili.so). We spent a lot of time getting accuracy up for dense and multi-page invoices. Then realised other teams have this need as well so decided to ship the API.

On the accuracy point, given our work so far we believe we are best in class in terms of accuracy for document extraction. We've also set up a system of evaluations internally that allow us to keep iterating and improving (hence us mentioning that we want to continue working on it).

ntkris commented on Show HN: Tile.run – Extract structured data from any document via API   tile.run/... · Posted by u/ntkris
namanyayg · a year ago
Offtopic but I'm so confused, how and why are there so many players in this space? Who even are the customers?
ntkris · a year ago
Not off topic at all!

I can only speak to our experience. Once you get under the hood, you find that this is a hard problem to solve.

There are also a lot of workflows that involve documents in every sector and every function. In other words, the opportunity is massive.

For our product, our customers are either internal engineering teams or folks building products that require document extraction but don’t want to invest time in it.

ntkris commented on Ask HN: Recommendations for London founder / startup meetups?    · Posted by u/tmitchel2
ntkris · a year ago
Are you just looking to meet other folks building or have a specific goal in mind (e.g. finding a cofounder)? I can recommend accordingly
ntkris commented on Show HN: Boards – Automate document-heavy tasks   kili.so/... · Posted by u/ntkris
SebRollen · a year ago
Do you have an API?
ntkris · a year ago
Not yet, but we're open to creating one
ntkris commented on Show HN: Boards – Automate document-heavy tasks   kili.so/... · Posted by u/ntkris
isawczuk · a year ago
If I understand correctly this tool focuses on ability to create easily extractors from documents. I'm wondering who is your target audience? If it's a company operations (accounting, procurement, etc),you missed it, they want to setup once then use it and access documents. If it's for developers to quickly build extractors for different company ops, then I'm missing API, scoring, integrations.
ntkris · a year ago
We're focussed on company operations (accounting, procurement etc).

The tool is totally self serve and does allow you to set up, upload and access documents.

We clearly need to call that out more so will add this to the landing page

ntkris commented on Show HN: Boards – Automate document-heavy tasks   kili.so/... · Posted by u/ntkris
pedalpete · a year ago
Looking at the pricing, a "credit" is dependent on the size and complexity of the content, but you haven't provided me with any idea what size or complexity mean, so it's a complete black box.
ntkris · a year ago
Great feedback, we will make this more clear

u/ntkris

KarmaCake day18November 10, 2020View Original