Cloudera Expands Data Lineage, Metadata Management Capabilities With Octopai Acquisition

Cloudera also debuts new AI assistant to help data scientists, data engineers and developers boost productivity and streamline data workflows.

Data platform giant Cloudera has struck a deal to acquire data lineage and data catalog technology developed by Israel-based Octopai in a move to expand Cloudera’s data catalog and metadata management capabilities for data analytics and AI tasks.

The acquisition comes as businesses and organizations are looking for ways to utilize their data for AI, machine learning and predictive analytics initiatives – a move that requires finding and managing all relevant, contextual and trusted data.

Automatically managing metadata to provide a unified view of data has become more complex as data is increasingly spread across distributed data architectures, including hybrid and multi-cloud environments. Data security and governance have also become more complex.

[Related: Cloudera Teams With Nvidia To Create New AI Inference Service]

“When using data to make business-critical decisions, enterprises can’t afford to have blind spots or inaccuracies, and they certainly shouldn’t let identifying trusted data slow down progress,” said Charles Sansbury, CEO of Cloudera, in a statement.

“Our customers need to auto-discover data across multiple repositories, show deep lineage of assets both within and outside the Cloudera estate, and leverage a robust data catalog to identify data assets that can be consumed. The acquisition of Octopai’s platform enhances Cloudera’s data, analytics, and AI platform, enabling customers to have greater visibility of their data regardless of their data management provider,” Sansbury said.

Cloudera said it has signed a definitive agreement for the deal and expects the transaction to be completed by the end of this month. Terms of the acquisition were not disclosed. While Octopai’s website has a banner stating that it has been acquired by Cloudera, the Cloudera statement says it is only acquiring the Octopai platform.

Octopai, founded in 2016, is based in Rosh Ha’ayin, Israel, with U.S. headquarters in Wilmington, Del.

The company’s platform leverages data mapping and knowledge graphs technology to power its automated data discovery, multi-dimensional data lineage, data catalog and impact analysis capabilities.

With the addition of the Octopai technology to the Cloudera platform, customers can expect improved data discoverability, data quality, data governance and data migration assistance capabilities, according to Cloudera.

"Cloudera and Octopai represent a perfect symbiosis by bringing together centralized data and metadata management,” said Octopai CEO Yael Ben Arie, in the statement. “By combining Octopai's metadata capabilities with Cloudera's comprehensive data, analytics and AI platform, we're addressing the critical challenge of understanding and governing data across multi-cloud and on-premises environments.”

In additional news, Cloudera, part of the CRN 2024 Big Data 100, also has launched Cloudera Copilot for Cloudera AI, which the company said provides “secure and intelligent” assistance capabilities to help data scientists, data engineers and developers boost productivity and streamline data workflows.

Cloudera Copilot improves reproducibility across projects, ultimately helping to enable enterprises to get trusted data, analytics and AI applications into production faster, according to the company.

Specifically, the new offering automates code generation, data transformation and troubleshooting tasks; provides consistent coding assistance; and includes on-demand guidance, optimal solutions and insights for maintaining high coding standards.