StreamSets and Databricks Joint Solution Leverages the Power of DataOps to Accelerate ETL and Machine Learning

Partnership Enables Organizations to Achieve Full Power of Analytics and Data Science by Embracing DataOps

StreamSets®, provider of the industry’s first DataOps data integration platform, today announced a partnership with Databricks, the leader in unified data analytics, to empower data teams with rapid, no-code development using Apache Spark and Delta Lake for modern analytics and data science at scale.

“Delta Lake builds on our history of innovation by providing an open-source storage layer for data lakes with support for transactions and massive scale,” said Pankaj Dugar, vice president of ISV and Technology Partners, Databricks. “We are excited to partner with StreamSets and make Delta Lake and Apache Spark even more accessible through their DataOps platform.”

The joint solution of the StreamSets DataOps Platform and Delta Lake on Databricks helps organizations apply a DataOps mentality to their critical data science and advanced analytics projects. Specifically, the StreamSets DataOps Platform makes it easy to ingest, integrate and monitor data moving into Databricks by mitigating data drift and operationalizing the process through its full life cycle. As a result, the Databricks customer has more operational control, greater visibility and deeper insights into data — all within a single, highly integrated solution. Further, given StreamSets Transformer’s unique, Spark-native approach to data processing, even Spark non-experts can benefit from the power of Databricks. With the combined approach, the two companies are enabling more data to be delivered faster to drive analytics.

“This partnership with Databricks is key to expanding our solution to new sets of data and AI applications, as we’re forming an alliance with a company whose reputation as an innovator is unmatched,” said Jobi George, GM of Cloud Business at StreamSets. “Together, StreamSets and Databricks are bringing the full power of Spark and the Databricks platform to a new spectrum of users, easing the drive toward digital transformation.”

The joint solution is available immediately. For more information, including pricing, please visit www.streamsets.com.

About DataOps
Analytics has modernized in our always-on, always-changing world. How you deliver data to drive analytics has to modernize, too. DataOps is a set of practices and technologies that operationalizes data management and integration to ensure resiliency and agility despite ceaseless change. It combines the DevOps principles of continuous delivery with the ability to tame data drift (unexpected and undocumented changes to data). By embedding these principles, DataOps makes it possible to deliver the continuous data needed to drive modern analytics and digital transformation.

About StreamSets
StreamSets built the industry’s first multi-cloud DataOps platform for modern data integration, helping enterprises to continuously flow big, streaming and traditional data to their data science and data analytics applications. The platform uniquely handles data drift, those frequent and unexpected changes to upstream data that break pipelines and damage data integrity. The StreamSets DataOps Platform allows for execution of any-to-any pipelines, ETL processing and machine learning with a cloud-native operations portal for the continuous automation and monitoring of complex multi-pipeline topologies.

Founded in 2014, StreamSets is backed by top-tier Silicon Valley venture capital firms, including Battery Ventures, New Enterprise Associates (NEA), and Accel Partners. For more information, visit www.streamsets.com.