Comprehensive software reviews to make better IT decisions
Building a Data Catalog That Meets Your Needs
Most of the well-established data catalog tools – such as IBM IGC, Informatica EDC, and Collibra – were created over a decade ago and, not surprisingly, are a bit out of synch with current data cataloging requirements. For example, most of them are focused on describing (critical) data elements in great detail, which was great at early maturity stages of data governance.
Today’s requirements are aligned with enabling data self-service, self-serve business intelligence (BI) & analytics – and include (but are not limited to) the following capabilities:
- Describe a dataset (i.e. a set of data elements forming an informational unit)
- Track its provenance (i.e. show where it comes from)
- Control authenticity of data values (i.e. whether any values borrowed from a preceding dataset are changed at the time of the new dataset publishing)
“Legacy” vendors are enhancing their offerings, e.g. IBM has added integration with Watson Knowledge Catalog to its Information Governance Catalog, Collibra has added Machine Learning capabilities to its Data Governance Catalog, Informatica has added Axon to enhance its Enterprise Information Catalog. However, all these enhancements seem to share the typical symptoms of “catching up” – integration problems, complicated product offering, unclear product strategy.
Since product integration problems exist even in the “original” vendor offerings, third-party software vendors are jumping in with their add-ons to the existing suites, e.g. Compact Solutions, which is compatible with Collibra DGC, IBM IGC, and Informatica EDC.
Note: Some master data management and ETL platforms have fairly rich data cataloging capabilities built into their platforms – e.g. Ataccama ONE, TIBCO EBX (a.k.a. Orchestra), Talend Data Fabric, Anzo (by Cambridge Semantics).
If you are looking for a suit of data cataloging functionalities in a seamlessly packaged suite, then you should look for the new generation products – such as TopBraid Enterprise Data Governance, Alation Data Catalog, Waterline Data Catalog.
Whether you already have a “legacy” data catalog or are looking to establish a new one, do not compromise on the required functionalities – there are lots of technology options.
Put your requirements first: user-friendly comprehensive catalog of your organizational data assets with business descriptions, rich metadata, and complete provenance information.
“Search & browse” are today’s “staple” requirements – “select & connect” are the essentials to enable true data and analytics self-service.
Want to Know More?
On May 24-25, Informatica held its annual conference in Las Vegas – the first time “in-person” since the beginning of the COVID-19 pandemic.
Data intelligence software vendor Alation has made the move to emphasize data governance amongst its solution offerings to make the data catalog a dynamic platform for “a broad range of data intelligence solutions.”
IT software company HelpSystems has acquired leading data classification software vendors Titus and Boldon James to enhance data security capabilities within its current suite of IT systems.
TIBCO Acquires Orchestra Networks: Potential Centerpiece of a New Super Smart Data Management Platform
Orchestra Networks was earning attention even before TIBCO’s acquisition. Now that it is part of the TIBCO family of software products, it can become the centerpiece of a very powerful data management, governance, integration, and analytics platform.
A prevalent urban legend in enterprise tech is that DevOps and Agile are not ready for tackling transformation at scale. At Info-Tech Research Group, we believe it’s the other way around. DevOps practices like CI/CD are being used by digital banking startups for fintech products. They are leveraging cloud services for demand management and capacity planning but what about the “too big to fail” banks, with global outreach and massive investments in legacy tech?
Databricks, a data processing and analytics platform with a strong focus on AI and ML, has partnered with Immuta to deliver automated end-to-end data governance for AI, data science, and ML projects.
According to the latest Netwrix, concerns around data security have been encouraging 46% of organizations to move their personally identifiable information (PII) back on premises, from the cloud.
Microsoft’s Ignite conference speaks to enterprise developers and IT teams, and this year’s event is making waves with a slew of new product announcements aimed squarely at a host of its competitors. Microsoft continues its multi-year evolution from a company full of disparate products and services to a true platform-oriented provider, seeking to liberate data silos across service lines and vendors and to enable Microsoft solutions to be deployed in non-Microsoft environments.