Kickstart your e6data journey: A quick deployment guide

A quick guide on how e6data deploys within the security of your cloud VPC in 4 steps

By Vishnu Vasanth on 25 May, 2024

e6data's cloudprem deployment model. It is secure, governend, and private.

Secure, Private, Governed. Your data is in your control plane.

In this post

Deployment Guide: a 4-step process
1. Deploy e6data within the security of your cloud VPC
2. Connect to your Data Sources
3. Spin up a Cluster
4. Run your First Query
Data Stacks / Integrations supported on e6data
A. Open File Formats
B. Open Table Formats
C. Object Store
D. Data Catalog
E. Data Governance
F. BI / Reporting Tools

Built from scratch to be disaggregated, decentralised, and Kubernetes-native, e6data follows a robots cloudprem deployment model in which all customer data stays within the customer VPC, with powerful guardrails and customisable controls, for the most secure and governed experience possible. We are also universally interoperable and open-source friendly, and support all major data infrastructure components like catalogs, open-table formats, BI tools, etc. and also support custom requests. Since we are Kubernetes-native, setting up e6data in your system and running your first query only takes 4 steps and less than 30 minutes. Below are the simple steps we follow:

Deployment Guide: a 4-step process

1. Deploy e6data within the security of your cloud VPC

We are a kubernetes-native engine, and have created an automated workspace and cluster creation form, which ensures efficient resource allocation and access control, completing in less than 20 minutes.

e6data's architecture diagram: where is fits in existing data stacks.

Step 1: Creating a workspace

2. Connect to your Data Sources

We are universally interoperable, and fit right into your existing data stack. You can connect e6data to your existing metastore and also easily integrate with AWS S3, GCS, or Azure Blob Storage, accessing necessary metadata within minutes.

e6data's architecture diagram: where is fits in existing data stacks.

Step 2: Creating a catalog connection

3. Spin up a Cluster

Set up scalable e6data clusters effortlessly, integrating with your existing tools. We ensure adequate resource availability to handle varying user cases and loads in an automated mode.

e6data's architecture diagram: where is fits in existing data stacks.

Step 3: Creating a cluster

4. Run your First Query

Use the Query Editor to write and execute SQL queries efficiently. Access running clusters and catalogs with ease, and view query results immediately for quick insights.

e6data's architecture diagram: where is fits in existing data stacks.

Step 4: Running your first query on e6data query editor

Besides being super-easy to deploy, we also assign a dedicated onboarding manager and provide 24/7 customer support to all our lighthouse customers. Want to learn more about us? Read our documentation.

Data Stacks / Integrations supported on e6data

A. Open File Formats

As a next-generation lakehouse compute engine, e6data supports querying from a wide range of file formats, providing comprehensive analytics capabilities to meet diverse business needs. Currently, our platform handles commonly used file formats such as Parquet, ORC, AVRO, JSON, and CSV, among others.

B. Open Table Formats

To deliver high performance and cost optimization across diverse workloads, e6data is compatible with the industry's most powerful and widely adopted open table formats. Our current support includes Apache Hudi, Apache Iceberg, Apache Hive, Delta Lake, XTable, and UniForm, with plans to expand compatibility to emerging formats in the near future.

C. Object Store

e6data seamlessly integrates with leading object storage solutions, including Amazon S3, Google Cloud Storage (GCS), and Azure Blob Storage. This native support ensures high availability, durability, and performance, enabling scalable and efficient data management.

D. Data Catalog

We seamlessly integrate with leading data catalog solutions, including AWS Glue, Apache Hive, Unity Catalog, and Dataproc, to ensure comprehensive metadata management, discoverability, and governance, without any migration.

E. Data Governance

e6data seamlessly integrates with leading data governance solutions, including Immuta, Privacera, and Apache Ranger. In addition to these solutions, e6data offers an in-house data governance product that features seamless integration and advanced capabilities such as attribute-based data management, data masking, and role-based access control (RBAC), on a single unified platform with a user-friendly interface, making data governance efficient and accessible to data stewards and data teams alike.

F. BI / Reporting Tools

e6data seamlessly integrates with leading BI and reporting tools, including Tableau, Power BI, Metabase, Superset, and Looker to enable data teams to query and visualise data on their choice of platforms without any migration and training effort.

Through our Lighthouse Customer Program, we consistently engage with demanding customers on cutting-edge use cases, pushing the boundaries of Analytics, Data Engineering, and GenAI. You can read more about it here.

Join our Lighthouse Customer Program

  1. Pick 1-2 high impact use cases.
  2. The e6 team works closely with you to demonstrate success: On your data, queries, and load patterns.
  3. We also support you as you put use cases into production