It's perhaps "unfair" but it's also extremely pragmatic.
Agreed.
> and communicating across processes often requires making expensive copies of data
SharedMemory [0] exists. Never understood why this isn’t used more frequently. There’s even a ShareableList which does exactly what it sounds like, and is awesome.
[0]: https://docs.python.org/3/library/multiprocessing.shared_mem...
In contrast, no one thinks about what happens if a thread dies independently because the failure mode is joint.
There are some examples[0] of enabling DuckDB to manage distributed workloads, but these are pretty experimental.
The product is still centered Spark, but most companies don't want or need Spark and a combination of Iceberg and DuckDB will work for 95% of companies. It's cheaper, just as fast or faster and way easier to reason about.
We're building a data platform around that premise at Definite[0]. It includes everything you need to get started with data (ETL, BI, datalake).
You are incorrect.
> Apple Inc has assembled $14 billion worth of iPhones in India in fiscal 2024, Bloomberg News reported on Wednesday.
That design allows arbitrary nested JSON data to be indexed using inverted indexes on top a variation of B-trees called Bw-trees, and seems like a nice way of indexing data automatically in a way that preserves the ability to do both exact and range matching on arbitrarily nested values.
Not sure if the query capabilities and syntax match azure docdb but the basic functionality should be workable.
If a transaction log is replayed, then an identical set of relations will be obtained. Ergo, the log is the prime form of the database.
It’s that simple.
This also demonstrates the type of things Google could do with Gemini integrated into Google Docs if they step up their game a bit.
Honestly I’m scratching my head on OpenAI’s desire to double down on building out their consumer B2C use cases rather than truly focussing on being the infrastructure/API provider for other services to plug into. If I had to make a prediction, I think OpenAI will end up being either an infrastructure provider OR a SaaS, but not both, in the long-term (5-10 yrs from now).