Lakehouse Days: August 2024

Want to see e6data in action?

Learn how data teams power their workloads.

Get Demo
Get Demo

About the event

Join us for an exclusive in-person event on "Open table formats: Apache Iceberg, Delta, and Apache Hudi," hosted by e6data in collaboration with The Big Data Show. This meetup is designed specifically for data engineers, data architects, and senior software engineers who are constantly looking to optimise their data architecture to make it more price-performant while delivering the best user experience. In this edition, we will be deep-diving into the internal architecture of open table formats like Apache Iceberg, Delta, and Apache Hudi. We aim to raise awareness about these open-table formats and gain a deeper understanding of them.Lakehouse Days is designed to enable fellow data nerds to meet, network, and have insightful discussions on the entropic world of data.

Meet the speakers

Sagar Sumit, Apache Hudi PMC, Senior Software Engineer at OneHouse

Topic: Internals of Apache Hudi

Apache Hudi is a transactional data lake platform that brings database and data warehouse capabilities to the data lake. Hudi reimagines slow old-school batch data processing with a powerful new incremental processing framework for low latency minute-level analytics. This talk will focus on the internal architecture of Apache Hudi and demonstrate the internals of file layouts of Apache Hudi.

Time: 9:00 - 9:45 AM IST

Ashutosh Kumar, Staff Engineer at PayPal

Topic: Internals of Apache Iceberg

Apache Iceberg is an open-source high-performance format for huge analytic tables, which enables the use of SQL tables for big data while making it possible for engines like Spark, Trino, Flink, Presto, and e6data query engines. In this talk, we will discuss and demonstrate the data layer, the metadata layer, and metadata files, along with their subcomponents.

Time: 10:00 - 10:45 AM IST

Sudarsan Lakshmi Narasimhan and Kiran Nunna, Engineering team, e6data

Topic: Internals of Delta file format

Delta Lake is an open-source storage framework that enables building a format-agnostic Lakehouse architecture. In this talk, we will dive deep into Delta's internal file layout and what makes it so performant.

Time: 11:00 - 11:45 AM IST

Vishnu Vasanth, Founder & CEO, e6data

Topic: Panel discussion on emerging use cases and developments in the lakehouse ecosystem

Insights into the evolving landscape and emerging use cases centred around data lakehouse architecture, with emerging players in data catalogs, open table formats, query engines, and more by a curated panel of senior data architects and engineers from leading enterprises.

Time: 11:45 - 12:30 PM IST

Share on
Table of contents:

Subscribe to our newsletter - Data Engineering ACID

Get 3 weekly stories around data engineering at scale that the e6data team is reading.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Share this article

Lakehouse Days: August 2024

17th August 2024 from 8:30 AM to 12:30 PM IST
/
Bengaluru
Lakehouse Days

About the event

Join us for an exclusive in-person event on "Open table formats: Apache Iceberg, Delta, and Apache Hudi," hosted by e6data in collaboration with The Big Data Show. This meetup is designed specifically for data engineers, data architects, and senior software engineers who are constantly looking to optimise their data architecture to make it more price-performant while delivering the best user experience. In this edition, we will be deep-diving into the internal architecture of open table formats like Apache Iceberg, Delta, and Apache Hudi. We aim to raise awareness about these open-table formats and gain a deeper understanding of them.Lakehouse Days is designed to enable fellow data nerds to meet, network, and have insightful discussions on the entropic world of data.

Meet the speakers

Sagar Sumit, Apache Hudi PMC, Senior Software Engineer at OneHouse

Topic: Internals of Apache Hudi

Apache Hudi is a transactional data lake platform that brings database and data warehouse capabilities to the data lake. Hudi reimagines slow old-school batch data processing with a powerful new incremental processing framework for low latency minute-level analytics. This talk will focus on the internal architecture of Apache Hudi and demonstrate the internals of file layouts of Apache Hudi.

Time: 9:00 - 9:45 AM IST

Ashutosh Kumar, Staff Engineer at PayPal

Topic: Internals of Apache Iceberg

Apache Iceberg is an open-source high-performance format for huge analytic tables, which enables the use of SQL tables for big data while making it possible for engines like Spark, Trino, Flink, Presto, and e6data query engines. In this talk, we will discuss and demonstrate the data layer, the metadata layer, and metadata files, along with their subcomponents.

Time: 10:00 - 10:45 AM IST

Sudarsan Lakshmi Narasimhan and Kiran Nunna, Engineering team, e6data

Topic: Internals of Delta file format

Delta Lake is an open-source storage framework that enables building a format-agnostic Lakehouse architecture. In this talk, we will dive deep into Delta's internal file layout and what makes it so performant.

Time: 11:00 - 11:45 AM IST

Vishnu Vasanth, Founder & CEO, e6data

Topic: Panel discussion on emerging use cases and developments in the lakehouse ecosystem

Insights into the evolving landscape and emerging use cases centred around data lakehouse architecture, with emerging players in data catalogs, open table formats, query engines, and more by a curated panel of senior data architects and engineers from leading enterprises.

Time: 11:45 - 12:30 PM IST

Related posts

View All Posts

Related posts

Lakehouse Days
21st Dec 2024 from 8:45 AM to 1:30 PM IST
/
Bengaluru

Lakehouse Days: Dec 2024

Sachin Tripathi — Senior Data Engineer, EarnIn
Soumil Shah — Senior Software Engineer, Zeta Global
Vipul Bharat Marlecha — Senior Software Engineer, Netflix
Ankur Ranjan — Senior Software Engineer, e6data
Fenil Jain — Software Development Engineer, e6data
Lakehouse Days
27th July 2024 from 8:30 AM to 12:30 PM IST
/
Bengaluru

Lakehouse Views: July 2024

Vivek Bansal — Senior Software Engineer, Uber
Sudarsan Lakshmi Narasimhan — Engineering Team, e6data
Faiz Kothari — Senior Engineering Team, e6data
Sagar Prajapati — Founder, Geekcoders
Vishnu Vasanth — Founder & CEO, e6data
Lakehouse Days
17th August 2024 from 8:30 AM to 12:30 PM IST
/
Bengaluru

Lakehouse Days: August 2024

Sagar Sumit — Apache Hudi PMC & Senior Software Engineer, OneHouse
Ashutosh Kumar — Staff Engineer, PayPal
Sudarsan Lakshmi Narasimhan — Engineering Team, e6data
Kiran Nunna — Engineering Team, e6data
Vishnu Vasanth — Founder & CEO, e6data
View All Posts