There appears to be a pattern. Unmet need is identified: "I want ChatGPT -- but able to read PDFs" or "I want ChatGPT -- but able to do research and produce lengthy reports." Startup gets funding for this and, if they're lucky, releases a rough beta that leans heavily on the OpenAI API. Two months later OpenAI launches a better, much more polished and seamless version, which is integrated into ChatGPT itself.
I had briefly considered forming a Legal AI startup (going so far as to download the fulltext of every legal ruling ever made in the US -- something like 400GB) but then o3/DeepResearch got so good that it became apparent that there'd be little point.
Steve Hsu claims to have solved hallucinations in a customer service context, which might be the only "startup-type" idea that has a head-start over the giants.
I sit on an AI evaluation committee for a huge law firm (It's just a regular old consulting gig) - we get so much inbound from (mostly kids) folks trying to build wrappers for some aspect of legal workflow, but behind the scenes thomson reuters is slowly adding everything they're going to need to software they have been using for 10 years now.
In many fields there is no moat. It’s an execution battle and it comes down to question: can the startup innovate faster and get to the customers or can the incumbent defend its existing distribution well enough.
Microsoft owns GitHub and VSCode yet cursor was able to out execute them. Legora is moving very quickly in the legal space. Not clear yet who will win.
They do exist, but not in the simplistic "AI but for X" you described. The article even explicitly says there were zero of those companies in that round.
You can see "AI startups" examples there, A company that manages your business outbound communication with lots of AI features. AI powered code generation for business operations, Accounting services with lots of AI features, business finance software with lots of AI features
This is the way. Use predictable logic and the control of automation with AI as augmentation. So not AI on the control loop, but AI as a helper. Who manages to find solutions that will find the sweet spot of this setup, will be the next success.
Yes, they do, because ultimately software UI is what gets regular people to use things.
To a technical user there may be little/no difference to you between prompt engineering into a chat box vs. clicking a button with premade text slots.
But to the average non-programmer, a chat app like ChatGPT is somewhat pigeonholed into the chat format, and so use cases that don’t lend themselves to this interface will be outcompeted by specific apps that do.
"I had briefly considered forming a Legal AI startup (going so far as to download the fulltext of every legal ruling ever made in the US -- something like 400GB) but then o3/DeepResearch got so good that it became apparent that there'd be little point."
I think it absolutely makes sense. ChatGPT's strength is how generalised it is as a tool, but openAI will never able to adapt the platform to every single use case. You can absolutely use it to learn a language for instance, but a great AI language learning platform needs a better tailored UI, it needs all kinds of non-AI functionality around it like idk a spaced repetition system, it might need to integrate into other platforms, and good prompting to be effective. AI isn't the product itself, but a component to try to solve a problem. And honestly I wish more startups focused less on simply "AI" and more on the problems it should solve.
If for nothing else openAI won't be able to market itself for every single use case, and so long as people aren't using chatGPT for some use case (even if it could perform the task) there's still an opening.
But any use case that gets large enough and makes money will be absorbed by openai direct, based on the market developed by the startup. OpenAI is using the Amazon model. Let someone else spend the money figuring out which market segments are profitable, then steal them with their inherently better access to the platform.
PACER exists at the federal level. Otherwise you have to piece it together from each jurisdiction yourself, defeating any anti-scraping measures in the process. Unless someone happens to have made such a dataset available via torrent at some point?
I'm doubtful. Remember when Google said their strategy was AI First? Baidu too? I'm old enough to remember that the criticism then was along the line "AI is technology. What problems do you want to solve?". The line of thinking seems still relevant to me today.
Thing is, OpenAI/Anthropic/etc. are demonstrably taking up those ideas. There actually were (are?) AI PDF reader startups and AI research assistant startups. (And AI coding startups, AI video startups, image analysis startups, etc.)
"AI startups," if they make sense, seem to have a very short shelf-life. They're either overtaken by the continuing improvement in LLM context windows, or, if there's a real and general unmet need for what they offer, the giants will tend to integrate it.
Because of a mix of comparative advantage and opportunity cost. Google as an entity absolutely dwarfs those other companies, and competes at that scale. Airbnb’s annual revenues are lower than Googles annual r&d spend. Google’s “wins” need to move the needle on a $2tn valuation, and an Airbnb size win doesn’t do that.
I see a pattern with AI companies. They always try to solve a really hard and not very useful problem. It's the same as with self driving car companies ten years ago: If you believe self driving tech is ripe for commercialization, the reasonable thing to do is something capital intensive and a special case where the technology most likely to succeed. For instance, heavy trucks automatically following others in formations for long drives. Saves gas, money, and potentially personnel.
There is a clear business case and buying large trucks is already a capex play. Then slowly work your way through more complex logistic problems from there. But no! The idea to sell was clearly the general problem including small cars that drive children to school through a suburban ice storm with lots of cyclists. Because that's clearly where the money is?
It's the same with AI. The consumer case is clearly there, people are easily impressed by it, and it is a given that consumers would pay to use it in products such as Illustrator, Logic Pro, modelling software etc. Maybe yet another try in radiology image processing, the death trap of startups for many decades now, but where there is obvious potential. But no! We want to design general purpose software -- in general purpose high level languages intended for human consumption! -- not even generating IR directly or running the model itself interactively.
If the technology really was good enough to do this type of work, why not find a specialized area with a few players limited by capex? Perhaps design a new competitive CPU? That's something we already have both specifications and tests for, and should be something a computer could do better than human. If an LLM could do a decent job there, it would easily be a billion dollar business. But no, let's write Python code and web apps!
AI allows for exquisite demos, demos that tantalize the audience into thinking of the infinite potential of the technology, that stunning vision expands and expands until the universe of potential overwhelms the dreamer into a state of terminal fantasy. So it is always a solution looking for a problem. There are cases where the two meet more realistically and a valuable impactful company develops it.
Agreed, the agents people are building are not solving the real issues.
The other thing people have been trying to do is build general agents e.g. Manus.
I just think this misses the key value add that agents can add at the moment.
A general agent would need to match the depth of every vertical agent, which is basically AGI. Until we reach AGI, verticalized agents for specific real
issues will be where the money/value is at.
That's exactly the approach to NLP which these super-successful LLMs are contradicting. They are generalists who can best with ease customized software developed over many years in all the subfields of NLP.
The railroad can’t have individual cars break off from the line to go to arbitrary warehouses, stores, and residences.
The railroad is an amazingly low cost way to move tonnage, if you’re going from a place where the railroad stops to another place where the railroad stops. There aren’t really companies that _could_ be using rail and aren’t.
But it just isn’t cost effective in many cases once you add in last-mile costs. If we built more rail (politically infeasible), you might see more usage but ultimately you still suffer from needing at least one locomotive per train.
Hah. Sort of. But the big difference is the railroad doesn't let anyone else use it. A regular road can support cars, trucks, truck convoys and maybe even bikes or pedestrians. A railroad can support trains.
> Zero LLM evaluation, observability, or tooling companies in the Series-A data.
This makes sense. The entire engineering/tooling field is so gonna change. Picking a winner makes isn't really possible. Most people are just starting to solve real problems with it and starting to build patterns that are not complete nonesense. But it will still change a lot
> “AI for X” verticals are surprisingly narrow.
I think that makes sense too. Those were a significant part of the initial hype. A lot of people promising that they'll take a "generic" LLM (which you all have seen how already smart that is) but now train it specifically on parenting, or trivia, or your emails, or your help center. It's a service type that will continue to exist. Perhaps it needs to tailor to a specific enterprise scenarios to gain traction as a startup. Though the need for these companies to manage the privacy concerns of the customers with their ability to inspect and look at the data and clean it might not be fully solved yet.
> Reducto - Reducto is an AI-driven API that specializes in converting unstructured documents like PDFs and images into structured data.
This is an example of the type of companies where "extracting LLM relevant context from X" and are relevant for any company doing the "AI for X" schtick or enterprise doing AI development on their own. This company is specifically about PDF and images, but we probably gonna see others that are for videos, archives, isos, msoffice docs, and even the ultimate holy grail of "universal binary => very rich structured data" API.
> Developer Tools & Infrastructure
The picks in this category are the most perplexing to me.
A too-narrow approach after Apple beefed it? Nobody knows how to bring AI to market yet but OpenAI, Anthropic, and Google. Long shots are one thing, but all the ideas I've heard for b2c AI so far are mostly more like pipe dreams. Look for a Zynga play once the field starts opening up for that in maybe a year or so, would be what I'd try to do.
"after Apple beefed it?" ... what? Apple's inability to improve their OS is somehow an indictment of B2C AI offerings in a general sense?
You seem unfamiliar with the space, there are plenty of players outside of OpenAI, Anthropic, and Google bringing AI to the consumer space: https://a16z.com/100-gen-ai-apps-4/
Consumer AI is arguably doing better than enterprise where 99% of the spend is poorly scaling undertakings that don't deliver on even 1/10th of their cost.
B2C has gone mostly 'free' which means it is either relying on shady business models, sensitive to regulation enforcement and so a risky investment, or a numbers game which requires significant upfront investment with a 'hit' business model return.
In both cases backing 1 company with significant investment is not rational.
3. YC network becomes large enough that startups can exist purely to serve other YC startups. (B2YC)
4. A new accelerator is launched which aims to fund YC companies that serve other YC companies. (YC4YC)
5. ?
Mostly joking, but I do sometimes look at the social media accounts of people in YC / Silicon Valley and wonder if they are living in an increasingly insular world. I think they would benefit from stepping outside of that into the greater world economy more deliberately.
It is also a sign of where something is in its cycle - when engines were first invented they laboured in mines originally, then moving traction engines/tractors, then trains - it was a long time before the average person owned an engine for their own use.
One way to interpret this might be that in consumer products, it's easier for incumbents to add AI to improve an already well-marketed product than to build and market one from scratch.
Yeah, and I think it’s also simply that inference with strong models is expensive.
OpenAI is lighting boatloads of money on fire to provide the ChatGPT free version. Same with Google for their search results AI, and perplexity which has also raised a lot. Unless you can raise a billion and find a unique wedge, it’s hard to even be in the game.
You can try to use small cheap models, but people will notice that free ChatGPT is 10x better.
How is it YC's fault the consumer apps failed to raise a Series A?
I personally have a consumer AI product that had 3 competitors get into YC, and they just didn't perform very well:
- One has so little distribution the only sign of life in the last 3 months was that they updated their landing page.
- Another released a disappointing app, didn't really iterate on it, and eventually pivoted into being a legal AI answering machine after that flopped.
- The third took down their app shortly after YC and pivoted to a content creation site for YT channels... then randomly let their site start going down, ignoring the customers, and doesn't seem to be doing anything anymore.
Meanwhile some competitors that didn't get into YC are now at 7 figure MRR (I'm at a measly 5 figure MRR). So it's not like the space these apps were in is as disastrous as these comments are making them out to be: YC took a chance and unfortunately these teams just weren't the right teams.
Hard to beat the chat interface when it comes to consumer products if you ask me. Pre-AI I often wished I could just talk to an application rather than try to figure out how the buttons the developers had chosen to wire up mapped onto what I was trying to achieve.
You would rather have a thing that solves a specific problem in a completely reliable way. An application that knows what you want to do because there is only one thing to do in the universe. AI can write it but never be it.
I don't want to sound like the infamous dropbox comment, but isn't Reducto just an LLM function call?
This was trickier 18 months ago, but every major LLM provider has solid support for this now. You can just drop an API call to Google, OpenAI, etc. your existing pipeline. What am I missing? Maybe the selling point was batch, but all LLM providers have a batch product now too.
To measure what's working for B2B YC companies, especially over the course of two years period you need to answer the following questions:
a) what are the growth rate when measuring customers which are *not* YC companies?
b) what is the churn rate for that same group.
Measuring by Series A (Assuming investors are the compass not the users - aka the market), is completely anti-YC the way I perceive the YC philosophy from afar.
Just raising as much money as you can when you can from even great firms is how so many founders get themselves in big trouble down the road and reduce their exit optionality or introduce deathly signaling risk. But if you can take lots of secondary to derisk and build that nest egg I guess have at it
I am curious about how this compares to past years.
I was pretty shocked that of 275 companies in the Winter 2023 batch, only 12 have received Series A deals. Granted, I know a huge part of that is that the VC environment has just collapsed due to the end of the ZIRP era, but those numbers at least sound pretty brutal to me.
There appears to be a pattern. Unmet need is identified: "I want ChatGPT -- but able to read PDFs" or "I want ChatGPT -- but able to do research and produce lengthy reports." Startup gets funding for this and, if they're lucky, releases a rough beta that leans heavily on the OpenAI API. Two months later OpenAI launches a better, much more polished and seamless version, which is integrated into ChatGPT itself.
I had briefly considered forming a Legal AI startup (going so far as to download the fulltext of every legal ruling ever made in the US -- something like 400GB) but then o3/DeepResearch got so good that it became apparent that there'd be little point.
Steve Hsu claims to have solved hallucinations in a customer service context, which might be the only "startup-type" idea that has a head-start over the giants.
Microsoft owns GitHub and VSCode yet cursor was able to out execute them. Legora is moving very quickly in the legal space. Not clear yet who will win.
You can see "AI startups" examples there, A company that manages your business outbound communication with lots of AI features. AI powered code generation for business operations, Accounting services with lots of AI features, business finance software with lots of AI features
To a technical user there may be little/no difference to you between prompt engineering into a chat box vs. clicking a button with premade text slots.
But to the average non-programmer, a chat app like ChatGPT is somewhat pigeonholed into the chat format, and so use cases that don’t lend themselves to this interface will be outcompeted by specific apps that do.
Has the download now been deleted
Will it be shared with others
500GB/1TB of storage is not expensive
Were any permissions obtained prior to download
If for nothing else openAI won't be able to market itself for every single use case, and so long as people aren't using chatGPT for some use case (even if it could perform the task) there's still an opening.
Where can I find this?
The "opinions" are what you want.
These are huge files heavily compressed, so they're quite difficult to handle.
I'm doubtful. Remember when Google said their strategy was AI First? Baidu too? I'm old enough to remember that the criticism then was along the line "AI is technology. What problems do you want to solve?". The line of thinking seems still relevant to me today.
Google has all the technical infrastructure, talent and everything to make something like AirBnB, Docusign and hell even intellij. Why not?
"AI startups," if they make sense, seem to have a very short shelf-life. They're either overtaken by the continuing improvement in LLM context windows, or, if there's a real and general unmet need for what they offer, the giants will tend to integrate it.
[1] https://workspace.google.com/resources/esignature/
There is a clear business case and buying large trucks is already a capex play. Then slowly work your way through more complex logistic problems from there. But no! The idea to sell was clearly the general problem including small cars that drive children to school through a suburban ice storm with lots of cyclists. Because that's clearly where the money is?
It's the same with AI. The consumer case is clearly there, people are easily impressed by it, and it is a given that consumers would pay to use it in products such as Illustrator, Logic Pro, modelling software etc. Maybe yet another try in radiology image processing, the death trap of startups for many decades now, but where there is obvious potential. But no! We want to design general purpose software -- in general purpose high level languages intended for human consumption! -- not even generating IR directly or running the model itself interactively.
If the technology really was good enough to do this type of work, why not find a specialized area with a few players limited by capex? Perhaps design a new competitive CPU? That's something we already have both specifications and tests for, and should be something a computer could do better than human. If an LLM could do a decent job there, it would easily be a billion dollar business. But no, let's write Python code and web apps!
The other thing people have been trying to do is build general agents e.g. Manus.
I just think this misses the key value add that agents can add at the moment.
A general agent would need to match the depth of every vertical agent, which is basically AGI. Until we reach AGI, verticalized agents for specific real issues will be where the money/value is at.
Congratulations, you just reinvented the railroad.
The railroad is an amazingly low cost way to move tonnage, if you’re going from a place where the railroad stops to another place where the railroad stops. There aren’t really companies that _could_ be using rail and aren’t.
But it just isn’t cost effective in many cases once you add in last-mile costs. If we built more rail (politically infeasible), you might see more usage but ultimately you still suffer from needing at least one locomotive per train.
This makes sense. The entire engineering/tooling field is so gonna change. Picking a winner makes isn't really possible. Most people are just starting to solve real problems with it and starting to build patterns that are not complete nonesense. But it will still change a lot
> “AI for X” verticals are surprisingly narrow.
I think that makes sense too. Those were a significant part of the initial hype. A lot of people promising that they'll take a "generic" LLM (which you all have seen how already smart that is) but now train it specifically on parenting, or trivia, or your emails, or your help center. It's a service type that will continue to exist. Perhaps it needs to tailor to a specific enterprise scenarios to gain traction as a startup. Though the need for these companies to manage the privacy concerns of the customers with their ability to inspect and look at the data and clean it might not be fully solved yet.
> Reducto - Reducto is an AI-driven API that specializes in converting unstructured documents like PDFs and images into structured data.
This is an example of the type of companies where "extracting LLM relevant context from X" and are relevant for any company doing the "AI for X" schtick or enterprise doing AI development on their own. This company is specifically about PDF and images, but we probably gonna see others that are for videos, archives, isos, msoffice docs, and even the ultimate holy grail of "universal binary => very rich structured data" API.
> Developer Tools & Infrastructure
The picks in this category are the most perplexing to me.
You seem unfamiliar with the space, there are plenty of players outside of OpenAI, Anthropic, and Google bringing AI to the consumer space: https://a16z.com/100-gen-ai-apps-4/
Consumer AI is arguably doing better than enterprise where 99% of the spend is poorly scaling undertakings that don't deliver on even 1/10th of their cost.
In both cases backing 1 company with significant investment is not rational.
1. YC startups target consumers. (B2C)
2. YC startups target businesses. (B2B)
3. YC network becomes large enough that startups can exist purely to serve other YC startups. (B2YC)
4. A new accelerator is launched which aims to fund YC companies that serve other YC companies. (YC4YC)
5. ?
Mostly joking, but I do sometimes look at the social media accounts of people in YC / Silicon Valley and wonder if they are living in an increasingly insular world. I think they would benefit from stepping outside of that into the greater world economy more deliberately.
OpenAI is lighting boatloads of money on fire to provide the ChatGPT free version. Same with Google for their search results AI, and perplexity which has also raised a lot. Unless you can raise a billion and find a unique wedge, it’s hard to even be in the game.
You can try to use small cheap models, but people will notice that free ChatGPT is 10x better.
I personally have a consumer AI product that had 3 competitors get into YC, and they just didn't perform very well:
- One has so little distribution the only sign of life in the last 3 months was that they updated their landing page.
- Another released a disappointing app, didn't really iterate on it, and eventually pivoted into being a legal AI answering machine after that flopped.
- The third took down their app shortly after YC and pivoted to a content creation site for YT channels... then randomly let their site start going down, ignoring the customers, and doesn't seem to be doing anything anymore.
Meanwhile some competitors that didn't get into YC are now at 7 figure MRR (I'm at a measly 5 figure MRR). So it's not like the space these apps were in is as disastrous as these comments are making them out to be: YC took a chance and unfortunately these teams just weren't the right teams.
You would rather have a thing that solves a specific problem in a completely reliable way. An application that knows what you want to do because there is only one thing to do in the universe. AI can write it but never be it.
How about it just working? No need to ask. The way a great assistant just makes the things you need and want happen.
This was trickier 18 months ago, but every major LLM provider has solid support for this now. You can just drop an API call to Google, OpenAI, etc. your existing pipeline. What am I missing? Maybe the selling point was batch, but all LLM providers have a batch product now too.
It isn't a no-brainer. Many founders in the new era are weighing bootstrapping, seed-strapping, and VC money without a clear answer.
If you can see a path to growing MMs of revenue with little need for staff, you may just not go do a follow-on round.
Deleted Comment
I was pretty shocked that of 275 companies in the Winter 2023 batch, only 12 have received Series A deals. Granted, I know a huge part of that is that the VC environment has just collapsed due to the end of the ZIRP era, but those numbers at least sound pretty brutal to me.