Data Quality, IT and Metadata - Data Leaders Brief

Announcing Open Source DataOps Data Quality TestGen 3.0

DataKitchen

FEBRUARY 20, 2025

Announcing DataOps Data Quality TestGen 3.0: Open-Source, Generative Data Quality Software. It assesses your data, deploys production testing, monitors progress, and helps you build a constituency within your company for lasting change. New Quality Dashboard & Score Explorer.

Data Quality

Data Quality Scorecard Testing Dashboards

The state of data quality in 2020

O'Reilly on Data

FEBRUARY 11, 2020

We suspected that data quality was a topic brimming with interest. The responses show a surfeit of concerns around data quality and some uncertainty about how best to address those concerns. Key survey results: The C-suite is engaged with data quality. Data quality might get worse before it gets better.

Data Quality

Data Quality Metadata Data Governance Publishing

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

1) What Is Data Quality Management? 4) Data Quality Best Practices. 5) How Do You Measure Data Quality? 6) Data Quality Metrics Examples. 7) Data Quality Control: Use Case. 8) The Consequences Of Bad Data Quality. 9) 3 Sources Of Low-Quality Data.

Data Quality

Data Quality Metrics Data-driven Management

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Octopai Acquisition Enhances Metadata Management to Trust Data Across Entire Data Estate

Cloudera

NOVEMBER 13, 2024

Cloudera’s mission since its inception has been to empower organizations to transform all their data to deliver trusted, valuable, and predictive insights. This acquisition delivers access to trusted data so organizations can build reliable AI models and applications by combining data from anywhere in their environment.

Metadata

Metadata Management Data Governance Data-driven

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

As technology and business leaders, your strategic initiatives, from AI-powered decision-making to predictive insights and personalized experiences, are all fueled by data. Yet, despite growing investments in advanced analytics and AI, organizations continue to grapple with a persistent and often underestimated challenge: poor data quality.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

When is data too clean to be useful for enterprise AI?

CIO Business Intelligence

NOVEMBER 27, 2024

Once the province of the data warehouse team, data management has increasingly become a C-suite priority, with data quality seen as key for both customer experience and business performance. But along with siloed data and compliance concerns , poor data quality is holding back enterprise AI projects.

Enterprise

Enterprise Data Quality Structured Data Modeling

It’s 2025. Are your data strategies strong enough to de-risk AI adoption?

CIO Business Intelligence

DECEMBER 11, 2024

In fact, a data framework is critical first step for AI success. There is, however, another barrier standing in the way of their ambitions: data readiness. If youre not keeping up the fundamentals of data and data management, your ability to adopt AIat whatever stage you are at in your AI journeywill be impacted, Kulkarni points out.

Risk

Risk Data Strategy Strategy Data Governance

Manage concurrent write conflicts in Apache Iceberg on the AWS Glue Data Catalog

AWS Big Data

APRIL 8, 2025

Concurrent UPDATE/DELETE on overlapping partitions When multiple processes attempt to modify the same partition simultaneously, data conflicts can arise. For example, imagine a data quality process updating customer records with corrected addresses while another process is deleting outdated customer records.

Snapshot

Snapshot Management Metadata Big Data

SAP Datasphere Powers Business at the Speed of Data

Rocket-Powered Data Science

MARCH 20, 2023

This is where we dispel an old “big data” notion (heard a decade ago) that was expressed like this: “we need our data to run at the speed of business.” Instead, what we really need is for our business to run at the speed of data. Datasphere is not just for data managers.

Data Warehouse

Data Warehouse Metadata Digital Transformation Machine Learning

7 Benefits of Metadata Management

erwin

FEBRUARY 19, 2021

Metadata management is key to wringing all the value possible from data assets. However, most organizations don’t use all the data at their disposal to reach deeper conclusions about how to drive revenue, achieve regulatory compliance or accomplish other strategic objectives. What Is Metadata? Analyze metadata.

Metadata

Metadata Management Data Quality Cost-Benefit

Data Governance and Metadata Management: You Can’t Have One Without the Other

erwin

FEBRUARY 13, 2020

When an organization’s data governance and metadata management programs work in harmony, then everything is easier. Data governance is a complex but critical practice. There’s always more data to handle, much of it unstructured; more data sources, like IoT, more points of integration, and more regulatory compliance requirements.

Metadata

Metadata Data Governance Management Cost-Benefit

What Is Active Metadata Management and How Does It Work?

Octopai

OCTOBER 18, 2021

First, what active metadata management isn’t : “Okay, you metadata! Now, what active metadata management is (well, kind of): “Okay, you metadata! Data assets are tools. Metadata are the details on those tools: what they are, what to use them for, what to use them with. . Quit lounging around!

Metadata

Metadata Management IT Data Quality

Best Practices for Metadata Management

Alation

JULY 19, 2021

What Is Metadata? Metadata is information about data. A clothing catalog or dictionary are both examples of metadata repositories. Indeed, a popular online catalog, like Amazon, offers rich metadata around products to guide shoppers: ratings, reviews, and product details are all examples of metadata.

Metadata

Metadata Management Data Governance Machine Learning

RDF-Star: Metadata Complexity Simplified

Ontotext

JUNE 10, 2021

With graph databases the representation of relationships as data make it possible to better represent data in real time, addressing newly discovered types of data and relationships. This is a graph of millions of edges and vertices – in enterprise data management terms it is a giant piece of master/reference data.

Metadata

Metadata Cost-Benefit OLAP Modeling

Very Meta … Unlocking Data’s Potential with Metadata Management Solutions

erwin

OCTOBER 24, 2019

While there has been a lot of talk about big data over the years, the real hero in unlocking the value of enterprise data is metadata , or the data about the data. They don’t know exactly what data they have or even where some of it is. Metadata Is the Heart of Data Intelligence.

Metadata

Metadata Management Data-driven Data Architecture

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

AWS Big Data

JULY 29, 2024

In the era of big data, data lakes have emerged as a cornerstone for storing vast amounts of raw data in its native format. They support structured, semi-structured, and unstructured data, offering a flexible and scalable environment for data ingestion from multiple sources.

Metadata

Metadata Snapshot Data Lake Metrics

Implement data quality checks on Amazon Redshift data assets and integrate with Amazon DataZone

AWS Big Data

AUGUST 15, 2024

Data quality is crucial in data pipelines because it directly impacts the validity of the business insights derived from the data. Today, many organizations use AWS Glue Data Quality to define and enforce data quality rules on their data at rest and in transit.

Data Quality

Data Quality Visualization Metadata Key Performance Indicator

What Is a Metadata Management Tool?

Octopai

DECEMBER 12, 2021

A data asset is only an asset if you can use it to help your organization. What enables you to use all those gigabytes and terabytes of data you’ve collected? Metadata is the pertinent, practical details about data assets: what they are, what to use them for, what to use them with. Where does metadata come from?

Metadata

Metadata Management Data Quality Data Governance

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

AWS Big Data

DECEMBER 4, 2024

In today’s rapidly evolving financial landscape, data is the bedrock of innovation, enhancing customer and employee experiences and securing a competitive edge. Like many large financial institutions, ANZ Institutional Division operated with siloed data practices and centralized data management teams.

Metadata

Metadata Data Governance Data Quality Data-driven

Amazon DataZone now integrates with AWS Glue Data Quality and external data quality solutions

AWS Big Data

APRIL 3, 2024

Today, we are pleased to announce that Amazon DataZone is now able to present data quality information for data assets. Other organizations monitor the quality of their data through third-party solutions. Additionally, Amazon DataZone now offers APIs for importing data quality scores from external systems.

Data Quality

Data Quality Visualization Metadata Metrics

How Metadata Makes Data Meaningful

erwin

DECEMBER 12, 2019

Metadata is an important part of data governance, and as a result, most nascent data governance programs are rife with project plans for assessing and documenting metadata. But in many scenarios, it seems that the underlying driver of metadata collection projects is that it’s just something you do for data governance.

Metadata

Metadata Data Governance Digital Transformation Data Quality

Data Intelligence and Its Role in Combating Covid-19

erwin

MARCH 30, 2020

Shirley Ann Jackson, president of RPI, states that “because this is a global pandemic, there is a huge amount of epidemiological data that has to be folded in to understand the disease globally, to be able to understand populations and how it can effect populations differently.”. Answering these questions is at the heart of data intelligence.

Metadata

Metadata IT Data Governance Data Quality

Alation 2022.2: Open Data Quality Initiative and Enhanced Data Governance

Alation

MAY 24, 2022

generally available on May 24, Alation introduces the Open Data Quality Initiative for the modern data stack, giving customers the freedom to choose the data quality vendor that’s best for them with the added confidence that those tools will integrate seamlessly with Alation’s Data Catalog and Data Governance application.

Data Quality

Data Quality Data Governance Metadata Metrics

The Missing Link in Enterprise Data Governance: Metadata

Octopai

JUNE 26, 2020

In order to figure out why the numbers in the two reports didn’t match, Steve needed to understand everything about the data that made up those reports – when the report was created, who created it, any changes made to it, which system it was created in, etc. Enterprise data governance. Metadata in data governance.

Metadata

Metadata Data Governance Enterprise Reporting

Dark Data: How to Find It and What to Do with It

Timo Elliott

JANUARY 6, 2022

Like the proverbial man looking for his keys under the streetlight , when it comes to enterprise data, if you only look at where the light is already shining, you can end up missing a lot. Remember that dark data is the data you have but don’t understand. So how do you find your dark data? Analyze your metadata.

IT

IT Metadata Data-driven Data Governance

Why Your Business Should Use a Data Catalog to Organize Its Data

Smart Data Collective

JULY 15, 2021

A data catalog serves the same purpose. By using metadata (or short descriptions), data catalogs help companies gather, organize, retrieve, and manage information. You can think of a data catalog as an enhanced Access database or library card catalog system. A data discovery tool moves beyond simple searches.

Metadata

Metadata IT Data-driven Data Quality

AI adoption in the enterprise 2020

O'Reilly on Data

MARCH 18, 2020

Whether it’s controlling for common risk factors—bias in model development, missing or poorly conditioned data, the tendency of models to degrade in production—or instantiating formal processes to promote data governance, adopters will have their work cut out for them as they work to establish reliable AI production lines.

Enterprise

Enterprise Deep Learning Data Governance Risk

Data Insights Assure Quality Data and Confident Decisions!

Smarten

NOVEMBER 26, 2024

But too much data can also create issues. If the data is not easily gathered, managed and analyzed, it can overwhelm and complicate decision-makers. By some estimates, bad data costs global organizations more than five trillion USD annually. Tools that support data insight include numerous data quality management techniques.

Machine Learning

Machine Learning Data Quality Predictive Modeling Metadata

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

AWS Big Data

DECEMBER 4, 2024

They’re taking data they’ve historically used for analytics or business reporting and putting it to work in machine learning (ML) models and AI-powered applications. Data teams struggle to find a unified approach that enables effortless discovery, understanding, and assurance of data quality and security across various sources.

Data Analytics

Data Analytics Analytics Data Lake Data Quality

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

Their terminal operations rely heavily on seamless data flows and the management of vast volumes of data. Recently, EUROGATE has developed a digital twin for its container terminal Hamburg (CTH), generating millions of data points every second from Internet of Things (IoT)devices attached to its container handling equipment (CHE).

IoT

IoT Machine Learning Metadata Data-driven

Use open table format libraries on AWS Glue 5.0 for Apache Spark

AWS Big Data

DECEMBER 4, 2024

These formats, exemplified by Apache Iceberg, Apache Hudi, and Delta Lake, addresses persistent challenges in traditional data lake structures by offering an advanced combination of flexibility, performance, and governance capabilities. These are useful for flexible data lifecycle management. Apache Iceberg highlights AWS Glue 5.0

Snapshot

Snapshot Metadata Data Lake Optimization

Perform data parity at scale for data modernization programs using AWS Glue Data Quality

AWS Big Data

OCTOBER 9, 2024

Data parity can help build confidence and trust with business users on the quality of migrated data. Some customers build custom in-house data parity frameworks to validate data during migration. Others use open source data quality products for data parity use cases.

Data Quality

Data Quality Data Lake Data Warehouse Metrics

Deep automation in machine learning

O'Reilly on Data

DECEMBER 19, 2018

are only starting to exist; one big task over the next two years is developing the IDEs for machine learning, plus other tools for data management, pipeline management, data cleaning, data provenance, and data lineage. GitHub is an excellent tool for managing code, but we need to think about [code+data].

Machine Learning

Machine Learning Software Metadata Testing

Data Quality Problems Everywhere You Look

TDAN

AUGUST 31, 2021

Data is everywhere! But can you find the data you need? What can be done to ensure the quality of the data? How can you show the value of investing in data? Can you trust it when you get it? These are not new questions, but many people still do not know how to practically […].

Data Quality

Data Quality IT Metadata Data Governance

How to Use a Data Lineage Tool to Ensure Data Quality

Octopai

MARCH 23, 2022

While Sinclair’s famous 1906 novel was intended to expose the plight of employees with horrific working conditions, the American public heard and reacted to a message about food safety and quality. Dirty Meat… and Dirty Data. What caused the atrocious quality issues in the meatpacking industry pre-1906? Cleaning Up Dirty Data.

Data Quality

Data Quality Reporting Modeling Interactive

Don’t Fear Artificial Intelligence; Embrace it Through Data Governance

CIO Business Intelligence

APRIL 29, 2022

This would be straightforward task were it not for the fact that, during the digital-era, there has been an explosion of data – collected and stored everywhere – much of it poorly governed, ill-understood, and irrelevant. Data Centricity. There is evidence to suggest that there is a blind spot when it comes to data in the AI context.

Data Governance

Data Governance IT Data Lake Risk

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

Rocket-Powered Data Science

FEBRUARY 15, 2023

Know thy data: understand what it is (formats, types, sampling, who, what, when, where, why), encourage the use of data across the enterprise, and enrich your datasets with searchable (semantic and content-based) metadata (labels, annotations, tags). Think strategically, but act tactically: think big, start small, learn fast.

Strategy

Strategy Experimentation Uncertainty Machine Learning

Why data observability is essential to AI governance

erwin

DECEMBER 9, 2024

When it comes to using AI and machine learning across your organization, there are many good reasons to provide your data and analytics community with an intelligent data foundation. For instance, Large Language Models (LLMs) are known to ultimately perform better when data is structured.

Metadata

Metadata Data Quality Sales Modeling

6 BI challenges IT teams must address

CIO Business Intelligence

DECEMBER 21, 2022

Every day, organizations of every description are deluged with data from a variety of sources, and attempting to make sense of it all can be overwhelming. By 2025, it’s estimated we’ll have 463 million terabytes of data created every day,” says Lisa Thee, data for good sector lead at Launch Consulting Group in Seattle.

IT

IT Business Intelligence Sales Key Performance Indicator

Alation and Salesforce partner on data governance for Data Cloud

CIO Business Intelligence

SEPTEMBER 19, 2024

Data intelligence platform vendor Alation has partnered with Salesforce to deliver trusted, governed data across the enterprise. It will do this, it said, with bidirectional integration between its platform and Salesforce’s to seamlessly delivers data governance and end-to-end lineage within Salesforce Data Cloud.

Data Governance

Data Governance Metadata Unstructured Data Structured Data

Maximize your data dividends with active metadata

IBM Big Data Hub

NOVEMBER 28, 2022

Metadata management performs a critical role within the modern data management stack. It helps blur data silos, and empowers data and analytics teams to better understand the context and quality of data. This, in turn, builds trust in data and the decision-making to follow. Improve data discovery.

Metadata

Metadata Data Quality Data-driven Data Governance

Data integrity vs. data quality: Is there a difference?

IBM Big Data Hub

JULY 13, 2023

When we talk about data integrity, we’re referring to the overarching completeness, accuracy, consistency, accessibility, and security of an organization’s data. Together, these factors determine the reliability of the organization’s data. Data quality Data quality is essentially the measure of data integrity.

Data Quality

Data Quality Data Integration Metadata Cost-Benefit

What are model governance and model operations?

O'Reilly on Data

JUNE 19, 2019

Versioning (of models, feature vectors , data) and the ability to roll out, roll back, or have multiple live versions. Metadata and artifacts needed for a full audit trail. A dashboard that provides custom views for all principals (operations, ML engineers, data scientists, business owners).

Modeling

Modeling Machine Learning Testing Metrics

What is data governance? Best practices for managing data assets

CIO Business Intelligence

MARCH 24, 2023

Data governance definition Data governance is a system for defining who within an organization has authority and control over data assets and how those data assets may be used. It encompasses the people, processes, and technologies required to manage and protect data assets.

Data Governance

Data Governance Management Metadata Data Quality

Announcing Open Source DataOps Data Quality TestGen 3.0

The state of data quality in 2020

Webinars

Trending Sources

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

Webinars

Octopai Acquisition Enhances Metadata Management to Trust Data Across Entire Data Estate

Data’s dark secret: Why poor quality cripples AI and growth

When is data too clean to be useful for enterprise AI?

It’s 2025. Are your data strategies strong enough to de-risk AI adoption?

Manage concurrent write conflicts in Apache Iceberg on the AWS Glue Data Catalog

SAP Datasphere Powers Business at the Speed of Data

7 Benefits of Metadata Management

Data Governance and Metadata Management: You Can’t Have One Without the Other

What Is Active Metadata Management and How Does It Work?

Best Practices for Metadata Management

RDF-Star: Metadata Complexity Simplified

Very Meta … Unlocking Data’s Potential with Metadata Management Solutions

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

Implement data quality checks on Amazon Redshift data assets and integrate with Amazon DataZone

What Is a Metadata Management Tool?

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

Amazon DataZone now integrates with AWS Glue Data Quality and external data quality solutions

How Metadata Makes Data Meaningful

Data Intelligence and Its Role in Combating Covid-19

Alation 2022.2: Open Data Quality Initiative and Enhanced Data Governance

The Missing Link in Enterprise Data Governance: Metadata

Dark Data: How to Find It and What to Do with It

Why Your Business Should Use a Data Catalog to Organize Its Data

AI adoption in the enterprise 2020

Data Insights Assure Quality Data and Confident Decisions!

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

How EUROGATE established a data mesh architecture using Amazon DataZone

Use open table format libraries on AWS Glue 5.0 for Apache Spark

Perform data parity at scale for data modernization programs using AWS Glue Data Quality

Deep automation in machine learning

Data Quality Problems Everywhere You Look

How to Use a Data Lineage Tool to Ensure Data Quality

Don’t Fear Artificial Intelligence; Embrace it Through Data Governance

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

Why data observability is essential to AI governance

6 BI challenges IT teams must address

Alation and Salesforce partner on data governance for Data Cloud

Maximize your data dividends with active metadata

Data integrity vs. data quality: Is there a difference?

What are model governance and model operations?

What is data governance? Best practices for managing data assets

Stay Connected