Improved open-table analytics stack with Iceberg, Polaris, Hudi, Delta Lake
Product

TL;DR: Connect once, query everywhere. Our latest release provides advanced support for Apache Iceberg, Apache Polaris, Apache Hudi, and Delta Lake, enabling you to run fast SQL queries across every open table format without copying data or rewriting pipelines.
Why does this matter for your lakehouse?
Open table formats have become the backbone of modern lakehouses. Iceberg delivers petabyte-scale performance and versioned metadata. Polaris adds a cloud-native catalog with enterprise-grade governance, while Delta Lake brings ACID guarantees and time-travel to the data lake. Teams want the freedom to mix these technologies without the pain of managing multiple engines, drivers, and security models.
With this improved support, e6data’s query engine supports all four dialects natively, consolidating your existing catalogs into a single workspace.
What’s new in this release?
Format |
What e6data now does |
Biggest win |
Apache Polaris |
Create, edit, and delete Polaris catalog connections in the UI, with full Iceberg table exploration (schema discovery, partitioned queries) and improved metadata sync + catalog validation during setup. |
Centralized governance with cloud-native scale. |
Apache Iceberg |
Reads any Iceberg table exposed through Snowflake Open Catalog, AWS Glue, or Polaris with support for partitioned and non‑partitioned tables, enhanced schema evolution and metadata‑aware querying, plus an improved catalog management UI for editing and managing integrations. |
Query massive, partitioned Iceberg datasets in seconds—no extra Spark job needed. |
Apache Hudi |
Reads any Hudi table via AWS Glue or Apache Hive catalogs, with CoW/MoR support, read‑optimized / incremental / real‑time (MoR) query modes, time travel, schema evolution, upserts/inserts/deletes, and access to commit metadata for incremental processing. External metastore integration is improved for smoother table discovery. |
CDC‑style incremental queries in seconds using commit metadata — ideal for upsert‑heavy MoR workloads. |
Delta Lake |
Reads any Delta Lake table via AWS Glue or Unity Catalog, with improved schema/file‑stats visibility, a cleaner catalog selection & validation UI, and a browsing/querying experience aligned with other table formats. |
Keep your Delta pipelines as-is while unlocking sub-second SQL in e6data. |
How it works (architecture and performance path)
- Smart Catalog Layer – e6data now understands Iceberg and Delta layout metadata natively and communicates with Polaris through the same REST interface Iceberg uses.
- Zero-copy ingestion – Data stays in your object-storage (be it S3, GCS, or ADLS). We only read the files your query touches, reducing scan costs and time.
- Cross-catalog joins – Need to join a Delta customer table with an Iceberg events log? One SQL query across all.
Fine-grained security – We respect Polaris RBAC and Unity Catalog privileges automatically, so users see only what they’re allowed to see.
Quick start
- Create or select a workspace in the e6data console.
- Add catalog → Polaris / Iceberg / Hudi / Delta (choose one or all)
- Run SHOW TABLES FROM <catalog>.<namespace>; to confirm visibility.
- Point your BI tool (Tableau, Power BI, Looker) at the e6data endpoint and start exploring.
Reference links
- Apache Polaris documentation
- Apache Iceberg documentation
- Apache Hudi documentation
- Delta Lake documentation
Roadmap: write support and deeper governance
- Write-support for Iceberg and Delta—including MERGE, UPDATE, and DELETE—is on the roadmap.
- Table-level RBAC in Polaris and automatic metadata sync across engines are in active development.
Stay in the know - Data Engineering ACID
3 Curated stories each week on data engineering at scale—handpicked by the e6data team.
Subscribe Now
Listen to the full podcast