ntkris (u/ntkris) - Readit News

ntkris commented on Mistral OCR mistral.ai/fr/news/mistra... · Posted by u/littlemerman

vikp · 9 months ago

I ran a partial benchmark against marker - https://github.com/VikParuchuri/marker .

Across 375 samples with LLM as a judge, mistral scores 4.32, and marker 4.41 . Marker can inference between 20 and 120 pages per second on an H100.

You can see the samples here - https://huggingface.co/datasets/datalab-to/marker_comparison... .

The code for the benchmark is here - https://github.com/VikParuchuri/marker/tree/master/benchmark... . Will run a full benchmark soon.

Mistral OCR is an impressive model, but OCR is a hard problem, and there is a significant risk of hallucinations/missing text with LLMs.

ntkris · 9 months ago

This is awesome. Have you seen / heard of any benchmarks where the data is actually a structured JSON vs. markdown?

Posted by u/ntkris 10 months ago

Show HN: Kili is AI agent that automates admin tasks over email heykili.com/...

ntkris commented on Show HN: Tile.run – Extract structured data from any document via API tile.run/... · Posted by u/ntkris

rco8786 · a year ago

> We found that getting to accuracy that is reliable enough for automation is challenging.

This is in the problem description of your pitch, and leads me to believe that tile.run has been solving this problem. Is that right?

> Coming Soon:

> - Improved accuracy

Can you expand more?

I have a large need for this sort of tooling, but accuracy is my primary concern.

ntkris · a year ago

Yes, we needed to solve the problem for our other product (https://kili.so). We spent a lot of time getting accuracy up for dense and multi-page invoices. Then realised other teams have this need as well so decided to ship the API.

On the accuracy point, given our work so far we believe we are best in class in terms of accuracy for document extraction. We've also set up a system of evaluations internally that allow us to keep iterating and improving (hence us mentioning that we want to continue working on it).

ntkris commented on Show HN: Tile.run – Extract structured data from any document via API tile.run/... · Posted by u/ntkris

namanyayg · a year ago

Offtopic but I'm so confused, how and why are there so many players in this space? Who even are the customers?

ntkris · a year ago

Not off topic at all!

I can only speak to our experience. Once you get under the hood, you find that this is a hard problem to solve.

There are also a lot of workflows that involve documents in every sector and every function. In other words, the opportunity is massive.

For our product, our customers are either internal engineering teams or folks building products that require document extraction but don’t want to invest time in it.

ntkris commented on Ask HN: Recommendations for London founder / startup meetups? · Posted by u/tmitchel2

ntkris · a year ago

Are you just looking to meet other folks building or have a specific goal in mind (e.g. finding a cofounder)? I can recommend accordingly

Posted by u/ntkris a year ago

Show HN: Tile.run – Extract structured data from any document via API tile.run/...

ntkris commented on Show HN: Boards – Automate document-heavy tasks kili.so/... · Posted by u/ntkris

SebRollen · a year ago

Do you have an API?

ntkris · a year ago

Not yet, but we're open to creating one

ntkris commented on Show HN: Boards – Automate document-heavy tasks kili.so/... · Posted by u/ntkris

isawczuk · a year ago

If I understand correctly this tool focuses on ability to create easily extractors from documents. I'm wondering who is your target audience? If it's a company operations (accounting, procurement, etc),you missed it, they want to setup once then use it and access documents. If it's for developers to quickly build extractors for different company ops, then I'm missing API, scoring, integrations.

ntkris · a year ago

We're focussed on company operations (accounting, procurement etc).

The tool is totally self serve and does allow you to set up, upload and access documents.

We clearly need to call that out more so will add this to the landing page

ntkris commented on Show HN: Boards – Automate document-heavy tasks kili.so/... · Posted by u/ntkris

pedalpete · a year ago

Looking at the pricing, a "credit" is dependent on the size and complexity of the content, but you haven't provided me with any idea what size or complexity mean, so it's a complete black box.

ntkris · a year ago

Great feedback, we will make this more clear

Posted by u/ntkris a year ago

Show HN: Boards – Automate document-heavy tasks kili.so/...

u/ntkris

KarmaCake day18November 10, 2020View Original