Lakehouse Days: March 2025, Bengaluru

Want to see e6data in action?

Learn how data teams power their workloads.

Get Demo
Get Demo

About the Event

Join us for an exclusive in-person event on “Introduction, optimization, integrations, table management, streaming data” hosted by e6data in Bengaluru!

Lakehouse Days - in collaboration with RisingWave, is designed specifically for data engineers, data architects, and senior software engineers who constantly seek to optimize their data architecture to make it more price-performant while delivering the best user experience. In this edition, we will dive deep into the internal architecture of open table formats like Apache Iceberg, merge-on-read query, serverless compaction and Iceberg table sharing with RisingWave, optimizing query performance, handling data transfers with Apache Arrow Flight, and Iceberg’s integration with GCP.

Lakehouse Days - in collaboration with RisingWave, is designed to enable fellow data geeks to meet, network, and have insightful discussions on the entropic world of data.

Register Now!

Reserve your spot through this link: https://lu.ma/pd0r4bmr?utm_source=website 

Venue - Accel LaunchPad, Koramangala

​Date and time - Mar 22, 2025, from 09:30 AM to 2:00 PM

Meet the Speakers

Rayees Pasha, CPO, RisingWave Labs

Topic: Streaming-first Approach to Iceberg with RisingWave

Summary: The session will provide an overview of the technical challenges of building a new Iceberg Table engine that is purpose-built for streaming workloads. The talk will highlight how RisingWave has built end-to-end key capabilities for Iceberg table management, including Iceberg’s merge-on-read query, Serverless Compaction, and Iceberg table sharing to allow direct queries from other engines. A key feature in this project is the native Iceberg compaction service written in Rust using Apache DataFusion and Apache Iceberg-Rust as foundational components.

Time: 09:30 - 10:15 AM IST

Ankur Ranjan, Sr Software Engineer, e6data

Topic: Apache Arrow Flight: Reshaping How We Handle Data Transfers

Summary: In this talk, we will explore how Apache Arrow Flight overcomes the challenges of traditional protocols like ODBC and JDBC by providing a columnar-native transport that maintains data in its original format throughout the transfer process. Arrow Flight promises to enhance analytical workloads and align perfectly with modern data architectures by eliminating unnecessary conversions and streamlining data transfers. Join us to discover how this innovative approach can substantially improve data processing efficiency.

Time: 10:30 - 11:15 AM IST

Sai Vineel Thamishetty, Sr Data Engineer, Walmart

Topic: Apache Iceberg with Google Cloud Platform (GCP)

Summary: This talk will explore the exciting developments with Apache Iceberg and its integration with Google Cloud Platform. Iceberg is now allowing users to store tables on Google Cloud Storage, which means we can use GCP’s scalable infrastructure alongside Iceberg’s performance enhancements. Popular data processing engines like Apache Spark and Trino have improved their support for Iceberg, making it easier for us to work with these tables directly in the cloud. There’s also a lot of buzz around improving interoperability with BigQuery, which could facilitate smoother data transfers and queries.

Time: 11:30 AM - 12:15 PM IST

Read more about Apache Iceberg

Share on
Table of contents:

Subscribe to our newsletter - Data Engineering ACID

Get 3 weekly stories around data engineering at scale that the e6data team is reading.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Share this article

Lakehouse Days: March 2025, Bengaluru

March 22, 2025 from 09:30 AM to 02:00 PM IST
/
Bengaluru
Lakehouse Days

About the Event

Join us for an exclusive in-person event on “Introduction, optimization, integrations, table management, streaming data” hosted by e6data in Bengaluru!

Lakehouse Days - in collaboration with RisingWave, is designed specifically for data engineers, data architects, and senior software engineers who constantly seek to optimize their data architecture to make it more price-performant while delivering the best user experience. In this edition, we will dive deep into the internal architecture of open table formats like Apache Iceberg, merge-on-read query, serverless compaction and Iceberg table sharing with RisingWave, optimizing query performance, handling data transfers with Apache Arrow Flight, and Iceberg’s integration with GCP.

Lakehouse Days - in collaboration with RisingWave, is designed to enable fellow data geeks to meet, network, and have insightful discussions on the entropic world of data.

Register Now!

Reserve your spot through this link: https://lu.ma/pd0r4bmr?utm_source=website 

Venue - Accel LaunchPad, Koramangala

​Date and time - Mar 22, 2025, from 09:30 AM to 2:00 PM

Meet the Speakers

Rayees Pasha, CPO, RisingWave Labs

Topic: Streaming-first Approach to Iceberg with RisingWave

Summary: The session will provide an overview of the technical challenges of building a new Iceberg Table engine that is purpose-built for streaming workloads. The talk will highlight how RisingWave has built end-to-end key capabilities for Iceberg table management, including Iceberg’s merge-on-read query, Serverless Compaction, and Iceberg table sharing to allow direct queries from other engines. A key feature in this project is the native Iceberg compaction service written in Rust using Apache DataFusion and Apache Iceberg-Rust as foundational components.

Time: 09:30 - 10:15 AM IST

Ankur Ranjan, Sr Software Engineer, e6data

Topic: Apache Arrow Flight: Reshaping How We Handle Data Transfers

Summary: In this talk, we will explore how Apache Arrow Flight overcomes the challenges of traditional protocols like ODBC and JDBC by providing a columnar-native transport that maintains data in its original format throughout the transfer process. Arrow Flight promises to enhance analytical workloads and align perfectly with modern data architectures by eliminating unnecessary conversions and streamlining data transfers. Join us to discover how this innovative approach can substantially improve data processing efficiency.

Time: 10:30 - 11:15 AM IST

Sai Vineel Thamishetty, Sr Data Engineer, Walmart

Topic: Apache Iceberg with Google Cloud Platform (GCP)

Summary: This talk will explore the exciting developments with Apache Iceberg and its integration with Google Cloud Platform. Iceberg is now allowing users to store tables on Google Cloud Storage, which means we can use GCP’s scalable infrastructure alongside Iceberg’s performance enhancements. Popular data processing engines like Apache Spark and Trino have improved their support for Iceberg, making it easier for us to work with these tables directly in the cloud. There’s also a lot of buzz around improving interoperability with BigQuery, which could facilitate smoother data transfers and queries.

Time: 11:30 AM - 12:15 PM IST

Read more about Apache Iceberg

Related posts

View All Posts

Related posts

Lakehouse Days
21st Dec 2024 from 8:45 AM to 1:30 PM IST
/
Bengaluru

Lakehouse Days: Dec 2024

Sachin Tripathi — Senior Data Engineer, EarnIn
Soumil Shah — Senior Software Engineer, Zeta Global
Vipul Bharat Marlecha — Senior Software Engineer, Netflix
Ankur Ranjan — Senior Software Engineer, e6data
Fenil Jain — Software Development Engineer, e6data
Lakehouse Days
27th July 2024 from 8:30 AM to 12:30 PM IST
/
Bengaluru

Lakehouse Views: July 2024

Vivek Bansal — Senior Software Engineer, Uber
Sudarsan Lakshmi Narasimhan — Engineering Team, e6data
Faiz Kothari — Senior Engineering Team, e6data
Sagar Prajapati — Founder, Geekcoders
Vishnu Vasanth — Founder & CEO, e6data
Lakehouse Days
17th August 2024 from 8:30 AM to 12:30 PM IST
/
Bengaluru

Lakehouse Days: August 2024

Sagar Sumit — Apache Hudi PMC & Senior Software Engineer, OneHouse
Ashutosh Kumar — Staff Engineer, PayPal
Sudarsan Lakshmi Narasimhan — Engineering Team, e6data
Kiran Nunna — Engineering Team, e6data
Vishnu Vasanth — Founder & CEO, e6data
View All Posts