Table of contents:

Share this article

Lakehouse Days: January 2025

Jan 18, 2025, from 09:45 AM to 2:00 PM IST

Bengaluru

Lakehouse Days

About the event

Join us for an exclusive in-person event on “Apache Iceberg: understanding the internals, performance, and future” hosted by e6data in Gurgaon!

This meetup is designed specifically for data engineers, data architects, and senior software engineers who constantly seek to optimize their data architecture to make it more price-performant while delivering the best user experience. In this edition, we will dive deep into the internal architecture of open table formats like Apache Iceberg, the consequence of the announcement of the AWS S3 tables for Apache Iceberg, and streaming ingestion to Iceberg using a Rust-based solution. We aim to raise awareness about these open-table formats and gain a deeper understanding.

Lakehouse Days is designed to enable fellow data nerds to meet, network, and have insightful discussions on the entropic world of data.
‍

Meet the speakers

Soumil Shah, Sr. Software Engineer at Zeta Global
‍Ankur Ranjan, Senior Softwate Engineer at e6data

Topic: A deep dive into the AWS S3 Tables since the announcement

In this session, Soumil and Ankur will dissect and discuss AWS’s recent announcement of Amazon S3 Tables – a fully managed Apache Iceberg Table offering by AWS, optimized for analytics workloads. They will discuss the consequences of the announcement and how it will shape the Lakehouse world.

Time: 09:45 - 10:30 AM IST

‍

Sachin Tripathi, Senior Data Engineer at EarnIn

Topic: Apache Iceberg 101

This discussion covers key features such as iceberg's ACID-like transactions ,time travel, schema evolution, hidden partitioning, and catalogs. It also offers insights into optimizing analytics, managing metadata, and ensuring interoperability across multi-engine ecosystems, highlighting their advantages.

Time: 10:45 - 11:30 AM IST

‍

Karthic Rao, Principal Engineer at e6data
Shreyas Mishra, Software Development Engineer at e6data

Topic: Streaming ingestion to Apache Iceberg using a rust-based solution

Apache Iceberg is an open-source high-performance format for huge analytic tables that enables using SQL tables for big data while making it possible for engines like Spark, Trino, Flink, Presto, and e6data query engines. In this talk, Karthic and Shreyas will explain how they have re-imagined streaming ingestion to Apache Iceberg using a rust-based solution instead of Apache Flink, Spark Structure streaming, or a Kafka stream. Rust’s memory safety and concurrency features make it ideal for building efficient ingestion pipelines that can transform and write data directly into Iceberg’s table format. This ensures seamless integration, low-latency ingestion, and effective handling of schema evolution, enabling real-time analytics on fresh data.

Time: 11:45 - 12:30 PM IST

View All Posts

Lakehouse Days

27th July 2024 from 8:30 AM to 12:30 PM IST

Bengaluru

Lakehouse Views: July 2024

Vivek Bansal — Senior Software Engineer, Uber

Sudarsan Lakshmi Narasimhan — Engineering Team, e6data

Faiz Kothari — Senior Engineering Team, e6data

Sagar Prajapati — Founder, Geekcoders

Vishnu Vasanth — Founder & CEO, e6data

Lakehouse Days

17th August 2024 from 8:30 AM to 12:30 PM IST

Bengaluru

Lakehouse Days: August 2024

Sagar Sumit — Apache Hudi PMC & Senior Software Engineer, OneHouse

Ashutosh Kumar — Staff Engineer, PayPal

Sudarsan Lakshmi Narasimhan — Engineering Team, e6data

Kiran Nunna — Engineering Team, e6data

Vishnu Vasanth — Founder & CEO, e6data

Lakehouse Days

28th September 2024 from 9:00 AM to 2:00 PM IST

Bengaluru

Lakehouse Days: September Edition || Practice PySpark, SQL, and DSA problems with us

Padmapriya Uppala — Senior Data Engineer, Visa

Sai Vineel Thamishetty — Senior Data Engineer, Walmart

Ankur Ranjan — Senior Software Engineer, e6data

View All Posts

Subscribe to our newsletter - Data Engineering ACID

Lakehouse Days: January 2025

About the event

Meet the speakers

Related posts

Related posts

Lakehouse Views: July 2024

Lakehouse Days: August 2024

Lakehouse Days: September Edition || Practice PySpark, SQL, and DSA problems with us