Kai Waehner’s Post

View profile for Kai Waehner, graphic
Kai Waehner Kai Waehner is an Influencer

Global Field CTO | Author | International Speaker | Follow me with Data in Motion

"Apache Iceberg – The Open Table Format for Lakehouse AND Data Streaming" => My latest blog post... Perfect weekend read :-) Every data-driven organization has operational and analytical workloads. A best of breed approach emerges with various data platforms, including #datastreaming, data lake, data warehouse and #lakehouse solutions and cloud services. An open table format framework like #ApacheIceberg is essential in the enterprise architecture to ensure reliable data management and sharing, seamless schema evolution, efficient handling of large-scale datasets and cost-efficient storage while providing strong support for ACID transactions and time travel queries. This blog post explores market trends, adoption of table format frameworks like Iceberg, #ApacheHudi, #DeltaLake and #ApacheXTable, and the product strategy of leading vendors of data platforms such as Snowflake, Databricks (Apache Spark), Confluent (Apache Kafka / Flink), Amazon Athena and Google BigQuery. Looking forward to your thoughts and feedback on one of the most interesting developments in the software industry... https://lnkd.in/edNst5wR

  • No alternative text description for this image

I still don't get what makes Iceberg so special, before all those table wars there was Kudu from Cloudera, which has never really taken off, and say Delta Lake - by feature-to-feature comparisons it's just better, why all off this fuzz about Iceberg?

Like
Reply
Kyle Weller

Head of Product @ Onehouse.ai | ex Azure Databricks

4w

It's sad that Iceberg isn't even well suited for Confluent or streaming, but because one other vendor chose to support only iceberg it forces the dominoes to fall. Paimon and Hud (even Delta to some degree) are miles ahead on performance and functionality for real-time scenarios, but seems few people want to talk about the technical engineering details... 🤦

Kai, curious, how would you fit Apache Paimon into this discussion, as another new kid on the block? G

Ruedi Blattmann

Managing Partner at LSCP Life Sciences Consulting Partners

4w

Kai, nice graph, thanks BUT, "you name it" has its limitations, as all the others: which enables your business purpose best ...

Like
Reply
Vishwajeet Dabholkar

Solutions Engineer| Prompt Engineer| GenAI | VectorsDBs | RAG Applications | LLM applicatios | Data Engineer | Python | PySpark | SQL | SingleStore Database |AWS | Databricks | Azure | REST API | Ex-TCS | Ex-Lumiq.ai

4w

SingleStore Also supports integration with Apache Iceberg (in public preview). You can now use SingleStore to seamlessly ingest data from and write back to Iceberg tables — in real time, with no additional tooling required. Check this out: https://www.singlestore.com/blog/bidirectional-integration-for-apache-iceberg/

Like
Reply
Chris Z.

Partner at Wing Venture Capital

3w

Your content is excellent!

Like
Reply
POOJA JAIN

Storyteller | Linkedin Top Voice 2024 | Senior Data Engineer@ Globant | Linkedin Learning Instructor | 2xGCP & AWS Certified | LICAP'2022

4w

Splendid post shared on APACHE ICEBERG. Kai Waehner Data engineers must have a look at the features it provides for data streaming and data lake house.

John K. Moran

Helping retailers solve their data problems.

4w

Your nuanced analysis of market trends and product strategies adds great depth to the discussion.

Andrew C. Madson

Data Doctor | Professor | 250k+ Subscribers

4w

Great post, Kai!

D. ABDUL GANI

Web Developement Enthusiast | Passionate About Learning IT Technologies | Final Year Student@Sri venkatesa perumal college of Engineering And Technology

4w

Love this

Like
Reply
See more comments

To view or add a comment, sign in

Explore topics