Engineering

The Ultimate Resource Hub for Optimizing Iceberg Tables

Using links, guides, and resources to optimize lakehouse tables

Optimizing Lakehouse Tables

Want to see e6data in action?

Learn how data teams power their workloads.

Get Demo
Get Demo

At e6data, we are big admirers of Apache Iceberg. We're witnessing a steep increase in its adoption, with our customers running E6data's query engine for heavy workloads.

While we were scrambling for resources on the internet to optimize Iceberg, why not curate it for the rest of the community?

Here's a curated collection of links, guides, and insights to help you discover the best practices for optimizing your Iceberg tables.  

- Optimization Strategies for Iceberg Tables by Cloudera    
- Compaction in Apache Iceberg: Fine-Tuning Your Iceberg Table’s Data Files by Dremio    
- Improving performance with Iceberg sorted tables by Starburst    
- Partitioning and Indexing in Apache Iceberg by IOMETE    
- Optimizing read performance by AWS    
- Maintaining tables by using compaction by AWS    
- Iceberg 101: A Guide to Iceberg Partitioning by Upsolver    
- Iceberg Tables Optimization by Upsolver    
- How Z-Ordering in Apache Iceberg Helps Improve Performance by Dremio    
- Z-ORDER sorting during compaction by IOMETE    
- Iceberg 101: Ten Tips to Optimize Performance by Upsolver    
- Optimizing Iceberg tables by AWS    
- https://iceberg.apache.org/docs/1.6.0/performance/ Apache Iceberg official documentation    
- Manage and Optimize Iceberg tables for efficient data storage and querying by AWS    
- Best practices for optimizing Apache Iceberg workloads by AWS

Check out our GitHub repository for more resources on optimizing lakehouse tables.

Share on

Build future-proof data products

Try e6data for your heavy workloads!

Get Started for Free
Get Started for Free
Frequently asked questions (FAQs)
How do I integrate e6data with my existing data infrastructure?

We are universally interoperable and open-source friendly. We can integrate across any object store, table format, data catalog, governance tools, BI tools, and other data applications.

How does billing work?

We use a usage-based pricing model based on vCPU consumption. Your billing is determined by the number of vCPUs used, ensuring you only pay for the compute power you actually consume.

What kind of file formats does e6data support?

We support all types of file formats, like Parquet, ORC, JSON, CSV, AVRO, and others.

What kind of performance improvements can I expect with e6data?

e6data promises a 5 to 10 times faster querying speed across any concurrency at over 50% lower total cost of ownership across the workloads as compared to any compute engine in the market.

What kinds of deployment models are available at e6data ?

We support serverless and in-VPC deployment models. 

How does e6data handle data governance rules?

We can integrate with your existing governance tool, and also have an in-house offering for data governance, access control, and security.

Table of contents:
Listen to the full podcast
Apple Podcasts
Spotify
Share this article

The Ultimate Resource Hub for Optimizing Iceberg Tables

November 22, 2024
/
Karthic Rao
Fredson Lewis
Engineering
Optimizing Lakehouse Tables

At e6data, we are big admirers of Apache Iceberg. We're witnessing a steep increase in its adoption, with our customers running E6data's query engine for heavy workloads.

While we were scrambling for resources on the internet to optimize Iceberg, why not curate it for the rest of the community?

Here's a curated collection of links, guides, and insights to help you discover the best practices for optimizing your Iceberg tables.  

- Optimization Strategies for Iceberg Tables by Cloudera    
- Compaction in Apache Iceberg: Fine-Tuning Your Iceberg Table’s Data Files by Dremio    
- Improving performance with Iceberg sorted tables by Starburst    
- Partitioning and Indexing in Apache Iceberg by IOMETE    
- Optimizing read performance by AWS    
- Maintaining tables by using compaction by AWS    
- Iceberg 101: A Guide to Iceberg Partitioning by Upsolver    
- Iceberg Tables Optimization by Upsolver    
- How Z-Ordering in Apache Iceberg Helps Improve Performance by Dremio    
- Z-ORDER sorting during compaction by IOMETE    
- Iceberg 101: Ten Tips to Optimize Performance by Upsolver    
- Optimizing Iceberg tables by AWS    
- https://iceberg.apache.org/docs/1.6.0/performance/ Apache Iceberg official documentation    
- Manage and Optimize Iceberg tables for efficient data storage and querying by AWS    
- Best practices for optimizing Apache Iceberg workloads by AWS

Check out our GitHub repository for more resources on optimizing lakehouse tables.

Listen to the full podcast
Share this article

FAQs

How does e6data reduce Snowflake compute costs without slowing queries?
e6data is powered by the industry’s only atomic architecture. Rather than scaling in step jumps (L x 1 -> L x 2), e6data scales atomically, by as little as 1 vCPU. In production with widely varying loads, this translates to > 60% TCO savings.
Do I have to move out of Snowflake?
No, we fit right into your existing data architecture across cloud, on-prem, catalog, governance, table formats, BI tools, and more.

Does e6data speed up Iceberg on Snowflake?
Yes, depending on your workload, you can see anywhere up to 10x faster speeds through our native and advanced Iceberg support. 

Snowflake supports Iceberg. But how do you get data there in real time?
Our real-time streaming ingest streams Kafka or SDK data straight into Iceberg—no Flink. Landing within 60 seconds and auto-registering each snapshot for instant querying.

How long does it take to deploy e6data alongside Snowflake?
Sign up the form and get your instance started. You can deploy it to any cloud, region, deployment model, without copying or migrating any data from Snowflake.

Related posts

View All Posts

Related posts

View All
Engineering
This is some text inside of a div block.
June 11, 2025
/
Adishesh Kishore
Vector & Semantic Search in the Lakehouse: Faster Insight from Unstructured Data
Adishesh Kishore
June 11, 2025
View All
Engineering
This is some text inside of a div block.
June 6, 2025
/
Rajath Gowda
Solving Geospatial Analytics Performance Bottleneck: H3 vs Quadkey
Rajath Gowda
June 6, 2025
View All
Engineering
This is some text inside of a div block.
April 23, 2025
/
e6data Team
e6data’s Architectural Bets: our Head of Engineering’s conversation w/Pete at Zero Prime Podcast
e6data Team
April 23, 2025
View All Posts