Blog, Data Quality and Metadata - Data Leaders Brief

Announcing Open Source DataOps Data Quality TestGen 3.0

DataKitchen

FEBRUARY 20, 2025

Announcing DataOps Data Quality TestGen 3.0: Open-Source, Generative Data Quality Software. You don’t have to imagine — start using it today: [link] Introducing Data Quality Scoring in Open Source DataOps Data Quality TestGen 3.0! DataOps just got more intelligent.

Data Quality

Data Quality Scorecard Testing Dashboards

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

1) What Is Data Quality Management? 4) Data Quality Best Practices. 5) How Do You Measure Data Quality? 6) Data Quality Metrics Examples. 7) Data Quality Control: Use Case. 8) The Consequences Of Bad Data Quality. 9) 3 Sources Of Low-Quality Data.

Data Quality

Data Quality Metrics Data-driven Management

Octopai Acquisition Enhances Metadata Management to Trust Data Across Entire Data Estate

Cloudera

NOVEMBER 13, 2024

By adding the Octopai platform, Cloudera customers will benefit from: Enhanced Data Discovery: Octopai’s automated data discovery enables instantaneous search and location of desired data across multiple systems. This guarantees data quality and automates the laborious, manual processes required to maintain data reliability.

Metadata

Metadata Management Data Governance Data-driven

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Data Governance and Metadata Management: You Can’t Have One Without the Other

erwin

FEBRUARY 13, 2020

When an organization’s data governance and metadata management programs work in harmony, then everything is easier. Data governance is a complex but critical practice. There’s always more data to handle, much of it unstructured; more data sources, like IoT, more points of integration, and more regulatory compliance requirements.

Metadata

Metadata Data Governance Management Cost-Benefit

7 Benefits of Metadata Management

erwin

FEBRUARY 19, 2021

Metadata management is key to wringing all the value possible from data assets. However, most organizations don’t use all the data at their disposal to reach deeper conclusions about how to drive revenue, achieve regulatory compliance or accomplish other strategic objectives. What Is Metadata? Harvest data.

Metadata

Metadata Management Data Quality Cost-Benefit

Best Practices for Metadata Management

Alation

JULY 19, 2021

What Is Metadata? Metadata is information about data. A clothing catalog or dictionary are both examples of metadata repositories. Indeed, a popular online catalog, like Amazon, offers rich metadata around products to guide shoppers: ratings, reviews, and product details are all examples of metadata.

Metadata

Metadata Management Data Governance Machine Learning

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

AWS Big Data

JULY 29, 2024

It addresses many of the shortcomings of traditional data lakes by providing features such as ACID transactions, schema evolution, row-level updates and deletes, and time travel. In this blog post, we’ll discuss how the metadata layer of Apache Iceberg can be used to make data lakes more efficient.

Metadata

Metadata Snapshot Data Lake Metrics

RDF-Star: Metadata Complexity Simplified

Ontotext

JUNE 10, 2021

Not Every Graph is a Knowledge Graph: Schemas and Semantic Metadata Matter. To be able to automate these operations and maintain sufficient data quality, enterprises have started implementing the so-called data fabrics , that employ diverse metadata sourced from different systems. Such examples are provenance (e.g.

Metadata

Metadata Cost-Benefit OLAP Modeling

Very Meta … Unlocking Data’s Potential with Metadata Management Solutions

erwin

OCTOBER 24, 2019

Untapped data, if mined, represents tremendous potential for your organization. While there has been a lot of talk about big data over the years, the real hero in unlocking the value of enterprise data is metadata , or the data about the data. Metadata Is the Heart of Data Intelligence.

Metadata

Metadata Management Data-driven Data Architecture

How Metadata Makes Data Meaningful

erwin

DECEMBER 12, 2019

Metadata is an important part of data governance, and as a result, most nascent data governance programs are rife with project plans for assessing and documenting metadata. But in many scenarios, it seems that the underlying driver of metadata collection projects is that it’s just something you do for data governance.

Metadata

Metadata Data Governance Digital Transformation Data Quality

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

AWS Big Data

DECEMBER 4, 2024

Data teams struggle to find a unified approach that enables effortless discovery, understanding, and assurance of data quality and security across various sources. Having confidence in your data is key. Automate data profiling and data quality recommendations, monitor data quality rules, and receive alerts.

Data Analytics

Data Analytics Analytics Data Lake Data Quality

Top 10 Metadata Management Influencers, Sites, and Blogs You Must Follow in 2021

Octopai

APRIL 19, 2021

Aptly named, metadata management is the process in which BI and Analytics teams manage metadata, which is the data that describes other data. In other words, data is the context and metadata is the content. Without metadata, BI teams are unable to understand the data’s full story.

Metadata

Metadata Management Business Intelligence Data Governance

Data Insights Assure Quality Data and Confident Decisions!

Smarten

NOVEMBER 26, 2024

If the data is not easily gathered, managed and analyzed, it can overwhelm and complicate decision-makers. Data insight techniques provide a comprehensive set of tools, data analysis and quality assurance features to allow users to identify errors, enhance data quality, and boost productivity.’

Machine Learning

Machine Learning Data Quality Predictive Modeling Metadata

Perform data parity at scale for data modernization programs using AWS Glue Data Quality

AWS Big Data

OCTOBER 9, 2024

Some customers build custom in-house data parity frameworks to validate data during migration. Others use open source data quality products for data parity use cases. This takes away important person hours from the actual migration effort into building and maintaining a data parity framework.

Data Quality

Data Quality Data Lake Data Warehouse Metrics

Use open table format libraries on AWS Glue 5.0 for Apache Spark

AWS Big Data

DECEMBER 4, 2024

These formats, exemplified by Apache Iceberg, Apache Hudi, and Delta Lake, addresses persistent challenges in traditional data lake structures by offering an advanced combination of flexibility, performance, and governance capabilities. These are useful for flexible data lifecycle management.

Snapshot

Snapshot Metadata Data Lake Optimization

Top 10 Data Lineage Podcasts, Blogs, and Magazines

Octopai

JANUARY 31, 2021

We have identified the top ten sites, videos, or podcasts online that deal with data lineage. Our list of Top 10 Data Lineage Podcasts, Blogs, and Websites To Follow in 2021. Data Engineering Podcast. This podcast centers around data management and investigates a different aspect of this field each week.

Data Governance

Data Governance Data Processing Data Quality Metadata

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

Rocket-Powered Data Science

FEBRUARY 15, 2023

I have developed a few rules to help drive quick wins and facilitate success in data-intensive and AI ( e.g., Generative AI and ChatGPT) deployments. Love thy data: data are never perfect, but all the data may produce value, though not immediately. The latter is essential for Generative AI implementations.

Strategy

Strategy Experimentation Uncertainty Machine Learning

What is a Data Mesh?

DataKitchen

AUGUST 3, 2021

Discoverable – users have access to a catalog or metadata management tool which renders the domain discoverable and accessible. Secure and permissioned – data is protected from unauthorized users. Governed – designed with data quality and management workflows that empower data usage. The post What is a Data Mesh?

Data Architecture

Data Architecture Data Lake Cost-Benefit Data Warehouse

Why data observability is essential to AI governance

erwin

DECEMBER 9, 2024

If you are not observing and reacting to the data, the model will accept every variant and it may end up one of the more than 50% of models, according to Gartner , that never make it to production because there are no clear insights and the results have nothing to do with the original intent of the model.

Metadata

Metadata Data Quality Sales Modeling

Data Intelligence and Its Role in Combating Covid-19

erwin

MARCH 30, 2020

To marry the epidemiological data to the population data it will require a tremendous amount of data intelligence about the: Source of the data; Currency of the data; Quality of the data; and. Unraveling Data Complexities with Metadata Management. Data lineage to support impact analysis.

Metadata

Metadata IT Data Governance Data Quality

Maximize your data dividends with active metadata

IBM Big Data Hub

NOVEMBER 28, 2022

Metadata management performs a critical role within the modern data management stack. It helps blur data silos, and empowers data and analytics teams to better understand the context and quality of data. This, in turn, builds trust in data and the decision-making to follow. Improve data discovery.

Metadata

Metadata Data Quality Data-driven Data Governance

Metadata Management and Data Governance with Cloudera SDX

Cloudera

JANUARY 26, 2024

In this article, we will walk you through the process of implementing fine grained access control for the data governance framework within the Cloudera platform. In a good data governance strategy, it is important to define roles that allow the business to limit the level of access that users can have to their strategic data assets.

Metadata

Metadata Data Governance Management Finance

Data integrity vs. data quality: Is there a difference?

IBM Big Data Hub

JULY 13, 2023

When we talk about data integrity, we’re referring to the overarching completeness, accuracy, consistency, accessibility, and security of an organization’s data. Together, these factors determine the reliability of the organization’s data. Data quality Data quality is essentially the measure of data integrity.

Data Quality

Data Quality Data Integration Metadata Cost-Benefit

Dark Data: How to Find It and What to Do with It

Timo Elliott

JANUARY 6, 2022

Like the proverbial man looking for his keys under the streetlight , when it comes to enterprise data, if you only look at where the light is already shining, you can end up missing a lot. The data you’ve collected and saved over the years isn’t free. Analyze your metadata. Real-time, cloud-based data ingestion and storage.

IT

IT Metadata Data-driven Data Governance

Advance top 2025 data initiatives with analyst firm-recognized erwin by Quest

erwin

JANUARY 23, 2025

Data intelligence software is continuously evolving to enable organizations to efficiently and effectively advance new data initiatives. With a variety of providers and offerings addressing data intelligence and governance needs, it can be easy to feel overwhelmed in selecting the right solution for your enterprise.

Metadata

Metadata Data Quality Data Governance Software

Alation + Soda: Dynamic Data Quality with the Data Catalog

Alation

DECEMBER 7, 2021

Alation and Soda are excited to announce a new partnership, which will bring powerful data-quality capabilities into the data catalog. Soda’s data observability platform empowers data teams to discover and collaboratively resolve data issues quickly. Do we have end-to-end data pipeline control?

Data Quality

Data Quality Metadata Data Governance Testing

How Metadata Makes Data Meaningful

erwin

DECEMBER 12, 2019

Metadata is an important part of data governance, and as a result, most nascent data governance programs are rife with project plans for assessing and documenting metadata. But in many scenarios, it seems that the underlying driver of metadata collection projects is that it’s just something you do for data governance.

Metadata

Metadata Data Governance Digital Transformation Data Quality

Data architecture strategy for data quality

IBM Big Data Hub

JANUARY 5, 2023

Poor data quality is one of the top barriers faced by organizations aspiring to be more data-driven. Ill-timed business decisions and misinformed business processes, missed revenue opportunities, failed business initiatives and complex data systems can all stem from data quality issues.

Data Architecture

Data Architecture Data Quality Strategy Data Lake

IBM named a leader in the 2022 Gartner® Magic Quadrant™ for Data Quality Solutions

IBM Big Data Hub

NOVEMBER 4, 2022

Data is the new oil and organizations of all stripes are tapping this resource to fuel growth. However, data quality and consistency are one of the top barriers faced by organizations in their quest to become more data-driven. Unlock quality data with IBM. and its leading data observability offerings.

Data Quality

Data Quality Metadata Data Governance Data-driven

Alation Launches Open Data Quality Framework

Alation

MAY 24, 2022

In a sea of questionable data, how do you know what to trust? Data quality tells you the answer. It signals what data is trustworthy, reliable, and safe to use. It empowers engineers to oversee data pipelines that deliver trusted data to the wider organization. Today, as part of its 2022.2

Data Quality

Data Quality Metadata Reporting Metrics

Metadata enrichment – highly scalable data classification and data discovery

IBM Big Data Hub

JULY 28, 2022

Metadata enrichment is about scaling the onboarding of new data into a governed data landscape by taking data and applying the appropriate business terms, data classes and quality assessments so it can be discovered, governed and utilized effectively. Scalability and elasticity. Public API.

Metadata

Metadata Machine Learning Data Quality Statistics

The Gold Standard – The Key to Information Extraction and Data Quality Control

Ontotext

MAY 26, 2021

This happens through the process of semantic annotation , where documents are tagged with relevant concepts and enriched with metadata , i.e., references that link the content to concepts, described in a knowledge graph. When “reading” unstructured text, AI systems first need to transform it into machine-readable sets of facts.

Data Quality

Data Quality Machine Learning Measurement Metadata

Constructing A Digital Transformation Strategy: Putting the Data in Digital Transformation

erwin

JULY 17, 2019

Part Two of the Digital Transformation Journey … In our last blog on driving digital transformation , we explored how enterprise architecture (EA) and business process (BP) modeling are pivotal factors in a viable digital transformation strategy. With automation, data quality is systemically assured.

Digital Transformation

Digital Transformation Strategy Metadata Data-driven

What’s the Current State of Data Governance and Automation?

erwin

JANUARY 30, 2020

The results of our new research show that organizations are still trying to master data governance, including adjusting their strategies to address changing priorities and overcoming challenges related to data discovery, preparation, quality and traceability. Most have only data governance operations.

Data Governance

Data Governance Metadata Cost-Benefit Digital Transformation

Bridging the Gap: How ‘Data in Place’ and ‘Data in Use’ Define Complete Data Observability

DataKitchen

SEPTEMBER 21, 2023

These layers help teams delineate different stages of data processing, storage, and access, offering a structured approach to data management. In the context of Data in Place, validating data quality automatically with Business Domain Tests is imperative for ensuring the trustworthiness of your data assets.

Testing

Testing Data Quality Predictive Modeling Metrics

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

AWS Big Data

DECEMBER 4, 2024

It’s the preferred choice when customers need more control and customization over the data integration process or require complex transformations. This flexibility makes Glue ETL suitable for scenarios where data must be transformed or enriched before analysis. In the navigation pane, under Data catalog , choose Zero-ETL integrations.

Data Integration

Data Integration Data Lake Statistics Data-driven

What is Active Metadata & Why it Matters: Key Insights from Gartner’s Market Guide

Alation

MARCH 2, 2023

Analysis, however, requires enterprises to find and collect metadata. This data about data is valuable. In fact, Gartner’s “Market Guide for Active Metadata Management” points to “ active metadata management ” as the key to continuous data analysis – which supports smarter human usage and more valuable insights.

Metadata

Metadata Marketing IT Data Quality

What is Data Lineage? Top 5 Benefits of Data Lineage

erwin

APRIL 29, 2020

An understanding of the data’s origins and history helps answer questions about the origin of data in a Key Performance Indicator (KPI) reports, including: How the report tables and columns are defined in the metadata? Who are the data owners? Data lineage offers proof that the data provided is reflected accurately.

Metadata

Metadata Data Governance Key Performance Indicator Data Quality

2024 Gartner Market Guide To DataOps

DataKitchen

AUGUST 16, 2024

At DataKitchen, we think of this is a ‘meta-orchestration’ of the code and tools acting upon the data. Data Pipeline Observability: Optimizes pipelines by monitoring data quality, detecting issues, tracing data lineage, and identifying anomalies using live and historical metadata.

Marketing

Marketing Data Quality Testing Metadata

Introducing erwin Data Intelligence 14: Dive into data quality, ensure data reliability and leverage new deployment flexibility

erwin

SEPTEMBER 2, 2024

Added data quality capability ready for an AI era Data quality has never been more important than as we head into this next AI-focused era. erwin Data Quality is the data quality heart of erwin Data Intelligence. erwin Data Quality is the data quality heart of erwin Data Intelligence.

Data Quality

Data Quality Data Processing Measurement Metadata

A Day in the Life of a DataOps Engineer

DataKitchen

OCTOBER 11, 2021

Based on business rules, additional data quality tests check the dimensional model after the ETL job completes. While implementing a DataOps solution, we make sure that the pipeline has enough automated tests to ensure data quality and reduce the fear of failure. Monitoring Job Metadata.

Testing

Testing Metadata Dashboards Statistics

What is a data fabric architecture?

IBM Big Data Hub

MARCH 25, 2022

Automated data enrichment : To create the knowledge catalog, you need automated data stewardship services. These services include the ability to auto-discover and classify data, to detect sensitive information, to analyze data quality, to link business terms to technical metadata and to publish data to the knowledge catalog.

Metadata

Metadata Data Quality Data Governance Data Integration

Why Data Governance Is Crucial for All Enterprise-Level Businesses

Cloudera

MARCH 3, 2022

In 2017, Anthem reported a data breach that exposed thousands of its Medicare members. The medical insurance company wasn’t hacked, but its customers’ data was compromised through a third-party vendor’s employee. 86% of Experian survey respondents’, for instance, are prioritizing moving their data to the cloud in 2022.

Data Governance

Data Governance Enterprise Data Quality Metadata

Introducing Cloudera Observability Premium

Cloudera

JULY 10, 2024

Observability for your most secure data For your most sensitive, protected data, we understand even the metadata and telemetry about your workloads must be kept under close watch, and it must stay within your secured environment. The post Introducing Cloudera Observability Premium appeared first on Cloudera Blog.

Cost-Benefit

Cost-Benefit Metadata Optimization Measurement

Announcing Open Source DataOps Data Quality TestGen 3.0

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

Webinars

Trending Sources

Octopai Acquisition Enhances Metadata Management to Trust Data Across Entire Data Estate

Webinars

Data Governance and Metadata Management: You Can’t Have One Without the Other

7 Benefits of Metadata Management

Best Practices for Metadata Management

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

RDF-Star: Metadata Complexity Simplified

Very Meta … Unlocking Data’s Potential with Metadata Management Solutions

How Metadata Makes Data Meaningful

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Top 10 Metadata Management Influencers, Sites, and Blogs You Must Follow in 2021

Data Insights Assure Quality Data and Confident Decisions!

Perform data parity at scale for data modernization programs using AWS Glue Data Quality

Use open table format libraries on AWS Glue 5.0 for Apache Spark

Top 10 Data Lineage Podcasts, Blogs, and Magazines

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

What is a Data Mesh?

Why data observability is essential to AI governance

Data Intelligence and Its Role in Combating Covid-19

Maximize your data dividends with active metadata

Metadata Management and Data Governance with Cloudera SDX

Data integrity vs. data quality: Is there a difference?

Dark Data: How to Find It and What to Do with It

Advance top 2025 data initiatives with analyst firm-recognized erwin by Quest

Alation + Soda: Dynamic Data Quality with the Data Catalog

How Metadata Makes Data Meaningful

Data architecture strategy for data quality

IBM named a leader in the 2022 Gartner® Magic Quadrant™ for Data Quality Solutions

Alation Launches Open Data Quality Framework

Metadata enrichment – highly scalable data classification and data discovery

The Gold Standard – The Key to Information Extraction and Data Quality Control

Constructing A Digital Transformation Strategy: Putting the Data in Digital Transformation

What’s the Current State of Data Governance and Automation?

Bridging the Gap: How ‘Data in Place’ and ‘Data in Use’ Define Complete Data Observability

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

What is Active Metadata & Why it Matters: Key Insights from Gartner’s Market Guide

What is Data Lineage? Top 5 Benefits of Data Lineage

2024 Gartner Market Guide To DataOps

Introducing erwin Data Intelligence 14: Dive into data quality, ensure data reliability and leverage new deployment flexibility

A Day in the Life of a DataOps Engineer

What is a data fabric architecture?

Why Data Governance Is Crucial for All Enterprise-Level Businesses

Introducing Cloudera Observability Premium

Stay Connected