Dell Data Lakehouse: Increased Performance and Connectivity Boosts

Discover new performance, connectivity, operations, and ease-of-use features that streamline data access and accelerate AI initiatives.

Since the launch of the Dell Data Lakehouse back in March, we’ve been thrilled by the response from you—our customers and partners! The team has been hard at work, and today we’re excited to unveil the latest enhancements that mark a significant leap forward in our mission to streamline data access and eliminate silos for the enterprise. These improvements are designed to help fast-track AI initiatives with high quality data, ensuring that your analytics and AI endeavors are more powerful and efficient than ever.

Key Highlights

Turbocharging Performance

We understand that speed is crucial when it comes to handling large volumes of data. That’s why we’ve focused on significantly enhancing performance:

  • Warp Speed. Read all about it here. Warp Speed dramatically boosts query performance (3x – 5x faster!) by leveraging automated learning of query patterns and optimizing indexes and caches. This feature is only supported on data lakes that reside on Dell S3-compatible storage.
  • New SSDs. Enhanced compute node configurations now integrate high-performance SSDs, rigorously tested and benchmarked to support Warp Speed.

These advancements mean that your data queries will be faster and more efficient, helping you to extract insights quicker and more reliably.

Better Connectivity

Connecting to various data sources seamlessly is critical for maintaining a robust data infrastructure. We’ve made substantial improvements to ensure better connectivity:

  • Connecting to existing meta stores. Added support for connecting to an existing Hive Metastore securely from within the Dell Data Lakehouse via Kerberos, enabling seamless metadata operations and enhanced data governance.
  • New connectors. Introducing a Neo4j graph database connector in public preview and an improved Snowflake connector for efficient querying. The parallel connector for Snowflake was launched earlier to replace the now-deprecated Snowflake distributed connector.
  • Improved connectors. We have upgraded connectors to popular sources like Iceberg, Delta Lake and Hive, as well as Db2, Netezza, RedShift, SAP HANA, Snowflake, SQL Server, Synapse and Teradata. These connectors are now faster and more capable to perform operations such as join push down and data type handling.
  • PowerScale & ObjectScale. Solutions are now fully validated with Dell Data Lakehouse and provide the best end-to-end experience and reliability for a data lakehouse.

These enhancements ensure that your data lakehouse can integrate seamlessly with a wide array of data sources, enhancing overall data accessibility and governance.

Simplified Day 2 Operations

Managing a data lakehouse shouldn’t be a complex or cumbersome task. We’ve streamlined Day 2 operations to make maintenance and monitoring more straightforward:

  • Health checkup. Dell support teams can now work with you to easily assess the state of your cluster prior to or after an install or upgrade using an automated health check. In addition to ongoing cluster monitoring and alerting, the health check is crucial to ensuring zero downtime.
  • Critical telemetry. The Dell Data Lakehouse can now send critical hardware system failure alerts directly to Dell Support teams for proactive handling of failure states or pending failure conditions.
  • Encryption. Optional end-to-end encryption for internal components, including all the compute nodes, cache service and the meta store will further secure the lakehouse. However, note that this feature will impact performance and thus should be considered when sizing the cluster to meet performance SLAs.

These features are designed to reduce the complexity of maintaining a data lakehouse, ensuring that your system remains robust, secure and operational with minimal effort.

Ease of Consumption

We’re introducing several new options to simplify consumption of the Dell Data Lakehouse:

  • Extended subscription option. Introducing a five-year software subscription option on Dell Data Lakehouse. This is in addition to existing one and three-year subscriptions and will help align the lengths of HW and SW support terms to ease procurement.
  • Wider global availability. We’re now shipping to more countries across Europe, Africa and Asia to meet growing demand.
  • Hands-on experience. Access the Dell Data Lakehouse in the Dell Demo Center and soon in the Customer Solution Center for interactive exploration and solution validation. To get started, create an account in the Demo Center (available to global customers and partners free of cost).

These updates make it easier for you to access, explore and validate the Dell Data Lakehouse, ensuring that it meets your specific needs and helping you to get the most out of your investment.

To get a full, hands-on experience, visit the Dell Demo Center to interactively explore the Dell Data Lakehouse with labs hand-picked for you by Dell Technologies’ experts. You can also contact your Dell account executive to explore the Dell Data Lakehouse for your data needs.

And check out this blog to find out more about Warp Speed!

About the Author: Vrashank Jain

Vrashank is a Product Manager in the Data Management team at Dell Technologies. He is focused on product management, strategic business development and commercialization in the data management space to help customers unlock value from their data. Prior to this role, he spent nearly 4 years leading strategy teams in Dell’s Corporate Strategy unit focused on setting up new business units like Automotive and Telco, partnership strategy in the data space, launching joint solutions with VMware (Project Monterey), org design, corporate social responsibility, and others. Before joining Dell, Vrashank worked for nearly 5 years in ZS Associates, a sales and marketing consulting firm based in Pune, India focused on improving sales performance through analytics-backed incentive programs for the bestselling drug in the US pharmaceutical market. He also worked on designing new ways to deliver sales readiness through mobile apps to help sales reps spend less time managing their tools and more time selling. Vrashank holds a Bachelor of Engineering degree with Distinction in Computer Science from BIT Mesra, India and a Master’s degree in Business Administration from Tuck School of Business at Dartmouth