EMC Corporation and Cloudera have announced the formation of an alliance so that Hadoop-based services that Cloudera offers will be integrated with EMC’s Greenplum technology. This move will help businesses better manage and analyze the challenge of ever-growing big data – including log files, sensor data, emails, images, receipts, research data and so on.

The integration between Cloudera’s Distribution for Hadoop and Greenplum’s parallel processing database will provide an architecture for collaborative analysis of large amounts of structured and unstructured data. Cloudera will be building a connector between its and Greenplum’s technologies. The connector will enable high-speed bi-directional data transfer between the systems and will be jointly supported by both Cloudera and Greenplum.
According to Cloudera, “The alliance between EMC Greenplum and Cloudera will change the way customers collect, process and store data.” Hadoop is becoming increasingly popular as the solution to big data management, and the two companies believe that their alliance will provide a single place to store and process big data, rather than the combination of database and storage processes that are in use today.
In addition to technical integration, the Cloudera and EMA will also be working together on joint sales activities.
“Together EMC and Cloudera have a real opportunity to change the way companies collect, process and analyze data,” says Michael Olson, CEO of Cloudera. “Customers can use Cloudera’s Distribution for Hadoop to inexpensively stage complex and structured data, while Greenplum Chorus utilizes its cloud-based platform to extract data from a variety of sources and enables collaborative analysis for many users.”
As more organizations begin to work out their strategies for dealing with big data, both Cloudera and EMC seem well poised to offer their services and solutions.