Readit News logoReadit News
dar8919 commented on GPT 4.5 level for 1% of the price   twitter.com/Baidu_Inc/sta... · Posted by u/decide1000
folli · 6 months ago
Hijacking this thread: what's currently the cheapest way to get structured data out of a PDF?

I assume there's some reasonable tool out there to convert PDFs to Markup and than feed it to some LLM API with okay costs (Gemini? DeepSeek?). Any suggestions?

dar8919 · 6 months ago
https://mistral.ai/news/mistral-ocr , recent release. Its been a step function improvement for my pipelines
dar8919 commented on OpenAI Sales Agent Demo   twitter.com/charliebitda/... · Posted by u/pr337h4m
vekker · 7 months ago
Meh, pretty basic. Most SaaS businesses have something like this in place already.

Closing inbound leads is relatively easy, since they've already shown active interest... The challenge I'm struggling with is (cold) lead generation: finding leads (and how to contact them) that match well with the service you're offering.

There are a lot of dubious scraping tools and B2B lead databases, but I feel like it should now be relatively easy to build a reliable web crawler & lead generator ... Does anyone know state of the art open source tools or services for this?

dar8919 · 7 months ago
Curious why open source? At the end of the day, lead gen is mostly about data curation. You’re either paying for access to curated feeds or spending time/money building your own pipeline.
dar8919 commented on A powerful tool for converting speech into text   app.trintai.com/... · Posted by u/allamaso
dar8919 · a year ago
Can you share more about how it compares with other popular tools in this space? Very hard to tell from your homepage
dar8919 commented on Snowflake Launches Text-Embedding Model for Retrieval Use Cases   snowflake.com/blog/introd... · Posted by u/xkgt
dar8919 · a year ago
Wow, shows very good performance on my wikipedia dataset. Incredible that companies are open sourcing so much good stuff. Hope this trend continues.
dar8919 commented on When Will the GenAI Bubble Burst?   garymarcus.substack.com/p... · Posted by u/isaacfrond
aurareturn · a year ago
Here's one of many example use cases we found for GPT4 API:

Our sales people request invoices from a potential customer. On those invoices are our competitor's services and price. Invoices can come in PDF, png, jpeg, excel, csv, email formats. Content formatting can come in random forms. Pricing breakdowns are also non-standard across invoices. We have matching services and our own prices.

The goal is to find similar services where we charge less. In the past, our sales people would spend hours combing through those invoices. We wrote a prompt for GPT4, fed in our services and prices, and asked it to find services we could potentially replace as well as our profit margin. It took us a day to write this prompt. The results were outstanding and GPT4 gave accurate results. We even asked it to package it up in a PDF for us to send to the potential customers. On a Monday morning, we started on the prompt. By Tuesday morning, we got it working well enough that we were confident ship it to a few of our sales people to test.

This will save our company hundreds of thousands each year and we can get back to the potential customer much faster than before - increasing the likelihood of a sale.

If we had to program this like normal software, it'd probably take months to get it right with dedicated engineering resources to account for new invoice edge cases. Chances are, engineering would never even prioritize this feature for our sales people because there is simply no economical way to account for so many different invoice edge cases.

I believe we're just getting started. If we get GPT6 in two years and massive improvement in inference cost and context size, it's going to change everything we do. Heck, even GPT4 with 100x context size and 100x lower cost per inference would be transformative.

If this is a bubble, I'd like to live in it. I believe that many businesses have found use cases similar to the impact of ours. But they're just not broadcasting them to the internet in order to keep it a business advantage.

dar8919 · a year ago
Thanks for the example and that sounds really solid cost savings and definitely agree with the trend that it is here to stay.

For invoice parsing (various formats), are you just using GPT4V? When GPT4V initially came out, i benchmarked it against an out of the box invoice parser from Google Cloud (https://cloud.google.com/document-ai) on 16 documents and it was much better accuracy wise. For ex: i'd get results parsing 10,100 as 101100 (no comma).

Curious if you saw problems like this in your pipeline or if its gotten much better since?

dar8919 commented on How do you learn to create captivating 3D animation with AI?   sustainablehorizons.ai/... · Posted by u/dar8919
dar8919 · 2 years ago
Hi HN! I recently came across an really impressive 3D animation on Sustainable Horizons website: https://sustainablehorizons.ai/ (I have no affiliation)

The level of detail is so good. Website claims using Generative AI (and three.js).

I was curious if anyone had pointers on how something a AI-powered 3D animation authoring workflow would look like. Any guidance or pointers would be greatly appreciated!

Deleted Comment

u/dar8919

KarmaCake day188August 17, 2013
About
akhilravidas.com

CTO, Flyflow (YC S24)

View Original