It seems like such a crowded space and there are many tools doing document extraction, I wonder if there's anything particular pulling more attention into the space?
I would not underestimate any of these players in the space.
Thoughts?
On the other hand, I do agree with you that it is quite a big challenge for Fivetran to try and become Snowflake.
I think consolidation in the space has been coming for quite some time now and this merger only confirms what us, along with many others, have been saying: the data tooling is in a miserable state and we had to glue together a bunch of different tools that don't work with each other.
At this point, I think it is quite obvious that Fivetran is going for Snowflake/Databricks's market share. They own the ingestion for many companies already, and they will offer a managed data lake product in order to compete with the data giants. By owning the means of bringing the data in (Fivetran) as well as the transformation layer (dbt/sqlmesh) they will aim to get ahead of Snowflakes of the world.
I think it'll be a win for the data community if they maintain and continue investing into the existing tooling, as they are running in quite a few places already, especially dbt core running in a self-managed way. I certainly hope they won't try to squeeze revenue for the sake of it from their combined users.
It's an interesting time to be in the space, and it feels great to be one of the few independent players in the market.
I can see myself deploying something like this for some of our internal usecases, however the deployment options around such tooling seem to be rather complicated, especially when it comes to the need for RAG and stuff. Would I need to host something else on top of Upsonic for these sorts of usecases?
Golang just gets bogged down in irrelevant details way too easily for this.