Readit News logoReadit News
sla99 commented on Apache Hudi vs. Delta Lake vs. Apache Iceberg Lakehouse Feature Comparison   onehouse.ai/blog/apache-h... · Posted by u/bhasudha
marsupialtail_2 · 3 years ago
I think the blog post should point out very early that Onehouse is a Hudi company. There are some other recent benchmarks published in CIDR by Databricks that might paint a different picture: https://petereliaskraft.net/res/cidr_lakehouse.pdf
sla99 · 3 years ago
It looks like the benchmarks used the latest versions of Delta and Iceberg, but chose a version of Hudi that is over 6 months old. Hudi v0.12.2 is more advanced than v0.12.0 which the benchmark did not consider. As the Databricks CIDR paper states, and as mentioned in the Onehouse article, Hudi by default is optimized for UPSERTs vs INSERTs and is a 1-line config change that is appropriate for a true apples-apples comparison. See both: https://www.onehouse.ai/blog/apache-hudi-vs-delta-lake-trans... and https://github.com/brooklyn-data/delta/pull/2

u/sla99

KarmaCake day1January 14, 2023View Original