Comprehensive software reviews to make better IT decisions
Main Hadoop Developers – Hortonworks and Cloudera – Under One Roof
Hadoop is listed as an open-source Apache project, however, the main commercial distribution packages were coming from Hortonworks and Cloudera. It happened so that most of Hadoop original developers went to Hortonworks, while one of the main co-creators – Doug Cutting – joined Cloudera.
For almost eight years the two companies rivaled with each other for a reason that does not seem to be a valid cause for a rivalry:
- Hortonworks was trying to prove that the purity of the open-source idea is of utmost importance and all development efforts should focus on code improvement to increase processing speed, scalability, efficiency, etc. (from a meeting with co-founder Alan Gates in 2014).
- Cloudera focused on packaging Hadoop into an “enterprise wrapper” with features that facilitate adoption of any software – administrative controls, metadata manager, data catalog, etc.
Now, the two teams are one company.
This merger had been long overdue. A viable software platform has to evolve along both its performance metrics and its usability index. Hadoop users will definitely benefit from the merger as the product should be improving faster without any waste of resources on rivalry. It could be just a little too late – too many Hadoop competitors have sprung onto the market, but there’s still hope as the market for big data software is still growing fast.
PHEMI is a data privacy solution focused on keeping data-processing activities secure by redacting information based on the role of the accessor. Thus, allowing such data to be used for multiple use cases without compromising privacy.
Joining the ranks of giants such as Snap (Snapchat’s parent company), Microsoft and Tesla, Immuta the automated Data Governance company has been named to Fast Company’s 2020 list of the World’s 50 Most Innovative Companies.
Databricks has launched a new Data Ingestion Network, made up of partners whose integrations to Data Ingest provide hundreds of connectors and enable automation to move disparate data into Databricks’ new storage layer, eliminating the need to maintain siloed data in a data lake and data warehouse.
The EU plans to invest €6 billion to build a single European data space, reports EURACTIV. The envisioned space will house personal, business, and “high-quality industrial data” and create the infrastructure for data sharing and use across businesses and nations.
Microsoft claims its newly announced Azure Synapse Analytics service is four times faster than Amazon Redshift and 75 times faster than Google BigQuery. This announcement positions Microsoft as a leader in this market, but it is also likely to generate counterclaims from its competitors.
AWS Lake Formation makes it easier for users to set up and manage data lakes. But organizations will face challenges in determining how to derive value from their data lakes.
Tableau and AWS Expand Strategic Relationship to Bring Analytics in the Cloud Closer to Their Customers
Leading analytics player Tableau recently announced its new initiative – Modern Cloud Analytics (MCA) – which sees it partnering with Amazon Web Services Inc. (AWS) to make cloud-based analytics more achievable for their customers.
Cambridge Semantics enhanced its Anzo platform to enable data management and analytics over both structured and unstructured data, the firm announced in an August 22 press release.
Several discussion threads on LinkedIn and other social media have been dedicated to the status of Apache Hadoop and the merged Cloudera/Hortonworks. Many predict their demise is not far off. How substantiated are those predictions?