Readit News logoReadit News
pierre commented on Show HN: Price Per Token – LLM API Pricing Data   pricepertoken.com/... · Posted by u/alexellman
pierre · a month ago
Main issue is that token are not equivalent across provider / models. With huge disparity inside provider beyond the tokenizer model:

- An image will take 10x token on gpt-4o-mini vs gpt-4.

- On gemini 2.5 pro output token are token except if you are using structure output, then all character are count as a token each for billing.

- ...

Having the price per token is nice, but what is really needed is to know how much a given query / answer will cost you, as not all token are equals.

pierre commented on Ask HN: Who is hiring? (March 2025)    · Posted by u/whoishiring
pierre · 6 months ago
LlamaIndex | Senior/Staff Software Engineer (LlamaParse) | San-Francisco, CA | Remote | Full-time | $100K – $300K + Equity | https://www.llamaindex.ai/careers

LlamaIndex is building a platform for AI agents that can find information, synthesize insights, generate reports, and take actions over the most complex enterprise data.

We are seeking an exceptional engineer to join our growing LlamaParse team. Will work at the intersection of document processing, machine learning, and software engineering to push the boundaries of what's possible in document understanding. As a key member of a focused team, will have significant impact on our product's direction and technical architecture.

We are also hiring for a range of other roles, see our career page:

- Backend Software Engineer

- Forward Deploy Engineer

- Founding AI Engineer

- Open Source Engineer Python

- Founding Lead Product Manager

- Platform Engineer

- Senior Developer Relation Engineer

- Senior / Staff Backend Engineer

- Product Marketing Manager

pierre commented on Putting Andrew Ng's OCR models to the test   runpulse.com/blog/putting... · Posted by u/ritvikpandey21
serjester · 6 months ago
Personally I find it frustrating they called it "agentic" parsing when there's nothing agentic about it. Not surprised the quality is lackluster.
pierre · 6 months ago
If you want to try agentic parsing we added support for sonnet-3.7 agentic parse and gemini 2.0 in llamaParse. cloud.llamaindex.ai/parse (select advanced options / parse with agent then a model)

However this come at a high cost in token and latency, but result in way better parse quality. Hopefully with new model this can be improved.

pierre commented on A free, unlimited online PDF converter with Privacy focus   quicklypdf.com... · Posted by u/nicbars
cess11 · 8 months ago
How did you figure that out? Couldn't it be Poppler as well?
pierre · 8 months ago
I read the output header, and see the Artifex (mutools / gs team) headers
pierre commented on A free, unlimited online PDF converter with Privacy focus   quicklypdf.com... · Posted by u/nicbars
pierre · 8 months ago
This is a nice UI for end users, however it seems to be a seems wrapper on top of mutool, which is distributed as AGPL. If you want to process PDF locally, legally and safely you should use their CLI instead.
pierre commented on Show HN: PDF to MD by LLMs – Extract Text/Tables/Image Descriptives by GPT4o   github.com/yigitkonur/swi... · Posted by u/yigitkonur35
pierre · a year ago
Parsing docs using LVM is the way forward (also see OCR2 paper released last week, people are having ablot of success parsing with fine tunned Qwen2).

The hard part is to prevent the model ignoring some part of the page and halucinations (see some of the gpt4o sample here like the xanax notice:https://www.llamaindex.ai/blog/introducing-llamaparse-premiu...)

However this model will get better and we may soon have a good pdf to md model.

pierre commented on Show HN: LlamaExtract, a tool to automatically extract schema from documents   github.com/run-llama/llam... · Posted by u/pierre
BinaryBrain · a year ago
Does it handles multiple input documents for extraction?
pierre · a year ago
yes, you can pass an array of path to the extract function.

u/pierre

KarmaCake day1060June 1, 2014
About
pierre at pld.io
View Original