Comprehensive software reviews to make better IT decisions
Building a Data Catalog That Meets Your Needs
Most of the well-established data catalog tools – such as IBM IGC, Informatica EDC, and Collibra – were created over a decade ago and, not surprisingly, are a bit out of synch with current data cataloging requirements. For example, most of them are focused on describing (critical) data elements in great detail, which was great at early maturity stages of data governance.
Today’s requirements are aligned with enabling data self-service, self-serve business intelligence (BI) & analytics – and include (but are not limited to) the following capabilities:
- Describe a dataset (i.e. a set of data elements forming an informational unit)
- Track its provenance (i.e. show where it comes from)
- Control authenticity of data values (i.e. whether any values borrowed from a preceding dataset are changed at the time of the new dataset publishing)
“Legacy” vendors are enhancing their offerings, e.g. IBM has added integration with Watson Knowledge Catalog to its Information Governance Catalog, Collibra has added Machine Learning capabilities to its Data Governance Catalog, Informatica has added Axon to enhance its Enterprise Information Catalog. However, all these enhancements seem to share the typical symptoms of “catching up” – integration problems, complicated product offering, unclear product strategy.
Since product integration problems exist even in the “original” vendor offerings, third-party software vendors are jumping in with their add-ons to the existing suites, e.g. Compact Solutions, which is compatible with Collibra DGC, IBM IGC, and Informatica EDC.
Note: Some master data management and ETL platforms have fairly rich data cataloging capabilities built into their platforms – e.g. Ataccama ONE, TIBCO EBX (a.k.a. Orchestra), Talend Data Fabric, Anzo (by Cambridge Semantics).
If you are looking for a suit of data cataloging functionalities in a seamlessly packaged suite, then you should look for the new generation products – such as TopBraid Enterprise Data Governance, Alation Data Catalog, Waterline Data Catalog.
Whether you already have a “legacy” data catalog or are looking to establish a new one, do not compromise on the required functionalities – there are lots of technology options.
Put your requirements first: user-friendly comprehensive catalog of your organizational data assets with business descriptions, rich metadata, and complete provenance information.
“Search & browse” are today’s “staple” requirements – “select & connect” are the essentials to enable true data and analytics self-service.
Want to Know More?
Informatica World 2022 Highlights
On May 24-25, Informatica held its annual conference in Las Vegas – the first time “in-person” since the beginning of the COVID-19 pandemic.
Alation Launches Active Data Governance
Data intelligence software vendor Alation has made the move to emphasize data governance amongst its solution offerings to make the data catalog a dynamic platform for “a broad range of data intelligence solutions.”
HelpSystems Brings Top-Tier Data Classification Vendors to Its Product Mix
IT software company HelpSystems has acquired leading data classification software vendors Titus and Boldon James to enhance data security capabilities within its current suite of IT systems.
TIBCO Acquires Orchestra Networks: Potential Centerpiece of a New Super Smart Data Management Platform
Orchestra Networks was earning attention even before TIBCO’s acquisition. Now that it is part of the TIBCO family of software products, it can become the centerpiece of a very powerful data management, governance, integration, and analytics platform.
DevOps for Financial Enterprise Systems: We Have the Way if They Have the Will
A prevalent urban legend in enterprise tech is that DevOps and Agile are not ready for tackling transformation at scale. At Info-Tech Research Group, we believe it’s the other way around. DevOps practices like CI/CD are being used by digital banking startups for fintech products. They are leveraging cloud services for demand management and capacity planning but what about the “too big to fail” banks, with global outreach and massive investments in legacy tech?
Databricks and Immuta Partner to Provide End-to-End Data Governance for Machine Learning
Databricks, a data processing and analytics platform with a strong focus on AI and ML, has partnered with Immuta to deliver automated end-to-end data governance for AI, data science, and ML projects.
Netwrix Report: Growing Security Concerns Bringing Data Out of the Cloud – But Not for Good Reason
According to the latest Netwrix, concerns around data security have been encouraging 46% of organizations to move their personally identifiable information (PII) back on premises, from the cloud.
Microsoft’s Ignite Announcements Take Direct Aim at the Competition
Microsoft’s Ignite conference speaks to enterprise developers and IT teams, and this year’s event is making waves with a slew of new product announcements aimed squarely at a host of its competitors. Microsoft continues its multi-year evolution from a company full of disparate products and services to a true platform-oriented provider, seeking to liberate data silos across service lines and vendors and to enable Microsoft solutions to be deployed in non-Microsoft environments.