Data Quality, Metrics and Testing

Data Quality

Metrics

Testing

The Race For Data Quality in a Medallion Architecture

DataKitchen

NOVEMBER 5, 2024

The Race For Data Quality In A Medallion Architecture The Medallion architecture pattern is gaining traction among data teams. It is a layered approach to managing and transforming data. It sounds great, but how do you prove the data is correct at each layer? How do you ensure data quality in every layer ?

Data Quality

Data Quality Testing Metrics Reporting

Is Your Team in Denial of Data Quality? Here’s How to Tell

DataKitchen

MAY 2, 2025

Is Your Team in Denial of Data Quality? Here’s How to Tell In many organizations, data quality problems fester in the shadowsignored, rationalized, or swept aside with confident-sounding statements that mask a deeper dysfunction. That’s not testing; that’s wishful thinking. Is your team ready?

Data Quality

Data Quality Snapshot Testing Data-driven

Join 42,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

1) What Is Data Quality Management? 4) Data Quality Best Practices. 5) How Do You Measure Data Quality? 6) Data Quality Metrics Examples. 7) Data Quality Control: Use Case. 8) The Consequences Of Bad Data Quality. 9) 3 Sources Of Low-Quality Data.

Data Quality

Data Quality Metrics Data-driven Management

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Data Observability and Data Quality Testing Certification Series

DataKitchen

MAY 14, 2024

Data Observability and Data Quality Testing Certification Series We are excited to invite you to a free four-part webinar series that will elevate your understanding and skills in Data Observation and Data Quality Testing. Don’t miss this opportunity to transform your data practices.

Data Quality

Data Quality Testing Metrics Measurement

Unlocking Data Team Success: Are You Process-Centric or Data-Centric?

DataKitchen

MARCH 20, 2025

We’ve identified two distinct types of data teams: process-centric and data-centric. Understanding this framework offers valuable insights into team efficiency, operational excellence, and data quality. Process-centric data teams focus their energies predominantly on orchestrating and automating workflows.

Data Quality

Data Quality Testing Metrics Management

The DataOps Vendor Landscape, 2021

DataKitchen

APRIL 13, 2021

Testing and Data Observability. We have also included vendors for the specific use cases of ModelOps, MLOps, DataGovOps and DataSecOps which apply DataOps principles to machine learning, AI, data governance, and data security operations. . Genie — Distributed big data orchestration service by Netflix.

Testing

Testing Machine Learning Consulting Data Science

Visualize data quality scores and metrics generated by AWS Glue Data Quality

AWS Big Data

JUNE 6, 2023

AWS Glue Data Quality allows you to measure and monitor the quality of data in your data repositories. It’s important for business users to be able to see quality scores and metrics to make confident business decisions and debug data quality issues.

Data Quality

Data Quality Metrics Visualization Dashboards

Amazon DataZone now integrates with AWS Glue Data Quality and external data quality solutions

AWS Big Data

APRIL 3, 2024

Today, we are pleased to announce that Amazon DataZone is now able to present data quality information for data assets. Other organizations monitor the quality of their data through third-party solutions. Additionally, Amazon DataZone now offers APIs for importing data quality scores from external systems.

Data Quality

Data Quality Visualization Metadata Metrics

When is data too clean to be useful for enterprise AI?

CIO Business Intelligence

NOVEMBER 27, 2024

Once the province of the data warehouse team, data management has increasingly become a C-suite priority, with data quality seen as key for both customer experience and business performance. But along with siloed data and compliance concerns , poor data quality is holding back enterprise AI projects.

Enterprise

Enterprise Data Quality Structured Data Modeling

Navigating the Storm: How Data Engineering Teams Can Overcome a Data Quality Crisis

DataKitchen

JUNE 21, 2024

Navigating the Storm: How Data Engineering Teams Can Overcome a Data Quality Crisis Ah, the data quality crisis. It’s that moment when your carefully crafted data pipelines start spewing out numbers that make as much sense as a cat trying to bark. You’ve got yourself a recipe for data disaster.

Data Quality

Data Quality Measurement Metrics Data Collection

What are model governance and model operations?

O'Reilly on Data

JUNE 19, 2019

In a previous post , we noted some key attributes that distinguish a machine learning project: Unlike traditional software where the goal is to meet a functional specification, in ML the goal is to optimize a metric. Quality depends not just on code, but also on data, tuning, regular updates, and retraining.

Modeling

Modeling Machine Learning Testing Metrics

7 types of tech debt that could cripple your business

CIO Business Intelligence

MARCH 25, 2025

Data debt that undermines decision-making In Digital Trailblazer , I share a story of a private company that reported a profitable year to the board, only to return after the holiday to find that data quality issues and calculation mistakes turned it into an unprofitable one.

Risk

Risk Cost-Benefit Data-driven Digital Transformation

Measure performance of AWS Glue Data Quality for ETL pipelines

AWS Big Data

MARCH 12, 2024

In recent years, data lakes have become a mainstream architecture, and data quality validation is a critical factor to improve the reusability and consistency of the data. In this post, we provide benchmark results of running increasingly complex data quality rulesets over a predefined test dataset.

Data Quality

Data Quality Measurement Testing Visualization

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

As technology and business leaders, your strategic initiatives, from AI-powered decision-making to predictive insights and personalized experiences, are all fueled by data. Yet, despite growing investments in advanced analytics and AI, organizations continue to grapple with a persistent and often underestimated challenge: poor data quality.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

Key Success Metrics, Benefits, and Results for Data Observability Using DataKitchen Software

DataKitchen

MARCH 12, 2024

Key Success Metrics, Benefits, and Results for Data Observability Using DataKitchen Software Lowering Serious Production Errors Key Benefit Errors in production can come from many sources – poor data, problems in the production process, being late, or infrastructure problems. Data errors can cause compliance risks.

Metrics

Metrics Software Cost-Benefit Testing

AI Governance: Act now, thrive later

CIO Business Intelligence

JANUARY 30, 2025

Metrics should include system downtime and reliability, security incidents, incident response times, data quality issues and system performance. Set goals and report metrics to determine if you are achieving the goals set out by the organization or the AI governance committee. version 0125). Its date is May 31st, 2025.

Testing

Testing Metrics Cost-Benefit Modeling

Perform data parity at scale for data modernization programs using AWS Glue Data Quality

AWS Big Data

OCTOBER 9, 2024

Some customers build custom in-house data parity frameworks to validate data during migration. Others use open source data quality products for data parity use cases. This takes away important person hours from the actual migration effort into building and maintaining a data parity framework.

Data Quality

Data Quality Data Lake Data Warehouse Metrics

AI Product Management After Deployment

O'Reilly on Data

OCTOBER 13, 2020

In Bringing an AI Product to Market , we distinguished the debugging phase of product development from pre-deployment evaluation and testing. During testing and evaluation, application performance is important, but not critical to success. require not only disclosure, but also monitored testing. Debugging AI Products.

Management

Management Machine Learning Metrics Modeling

Two Downs Make Two Ups: The Only Success Metrics That Matter For Your Data & Analytics Team

DataKitchen

MARCH 16, 2023

So it’s Monday, and you lead a data analytics team of perhaps 30 people. But wait, she asks you for your team metrics. Like most leaders of data analytic teams, you have been doing very little to quantify your team’s success. Where is your metrics report? What should be in that report about your data team?

Metrics

Metrics Data Analytics Analytics Measurement

Bridging the Gap: How ‘Data in Place’ and ‘Data in Use’ Define Complete Data Observability

DataKitchen

SEPTEMBER 21, 2023

In the context of Data in Place, validating data quality automatically with Business Domain Tests is imperative for ensuring the trustworthiness of your data assets. Moreover, advanced metrics like Percentage Regional Sales Growth can provide nuanced insights into business performance. What is Data in Use?

Testing

Testing Data Quality Predictive Modeling Metrics

DataKitchen Training And Certification Offerings

DataKitchen

MAY 7, 2024

DataKitchen Training And Certification Offerings For Individual contributors with a background in Data Analytics/Science/Engineering Overall Ideas and Principles of DataOps DataOps Cookbook (200 page book over 30,000 readers, free): DataOps Certificatio n (3 hours, online, free, signup online): DataOps Manifesto (over 30,000 signatures) One (..)

Data Quality

Data Quality Testing Consulting Metrics

Data Observability and Monitoring with DataOps

DataKitchen

MAY 10, 2021

Some will argue that observability is nothing more than testing and monitoring applications using tests, metrics, logs, and other artifacts. That’s a fair point, and it places emphasis on what is most important – what best practices should data teams employ to apply observability to data analytics.

Testing

Testing Manufacturing Data Quality Statistics

Top 10 Analytics And Business Intelligence Trends For 2020

datapine

NOVEMBER 27, 2019

Companies are no longer wondering if data visualizations improve analyses but what is the best way to tell each data-story. 2020 will be the year of data quality management and data discovery: clean and secure data combined with a simple and powerful presentation. 1) Data Quality Management (DQM).

Business Intelligence

Business Intelligence Analytics Prescriptive Analytics Data Quality

The Journey to DataOps Success: Key Takeaways from Transformation Trailblazers

DataKitchen

APRIL 26, 2021

GSK had been pursuing DataOps capabilities such as automation, containerization, automated testing and monitoring, and reusability, for several years. At Workiva, they recognized that they are only as good as their data, so they centered their initial DataOps efforts around lowering errors. Multiple Metrics for Success.

Measurement

Measurement Metrics Data-driven Dashboards

Automated data governance with AWS Glue Data Quality, sensitive data detection, and AWS Lake Formation

AWS Big Data

OCTOBER 10, 2023

Due to the volume, velocity, and variety of data being ingested in data lakes, it can get challenging to develop and maintain policies and procedures to ensure data governance at scale for your data lake. Data confidentiality and data quality are the two essential themes for data governance.

Data Quality

Data Quality Data Governance Data Lake Testing

How to Build a Data Quality Strategy to Get Executive Buy-In

Octopai

JULY 11, 2022

And when business users don’t complain, but you know the data isn’t good enough to make these types of calls wisely, that’s an even bigger problem. How are you, as a data quality evangelist (if you’re reading this post, that must describe you at least somewhat, right?), Tie data quality directly to business objectives.

Data Quality

Data Quality Strategy Business Objectives Sales

7 ways gen AI can create more work than it saves

CIO Business Intelligence

NOVEMBER 13, 2024

Managers tend to incentivize activity metrics and measure inputs versus outputs,” she adds. Woolley recommends that companies consolidate around the minimum number of tools they need to get things done, and have a sandbox process to test and evaluate new tools that don’t get in the way of people doing actual work.

IT Consulting ROI Cost-Benefit

Why you should care about debugging machine learning models

O'Reilly on Data

DECEMBER 12, 2019

In addition to newer innovations, the practice borrows from model risk management, traditional model diagnostics, and software testing. It’s a very simple and powerful idea: simulate data that you find interesting and see what a model predicts for that data. 6] Debugging may focus on a variety of failure modes (i.e.,

Machine Learning

Machine Learning Modeling Testing Risk Management

The Five Use Cases in Data Observability: Mastering Data Production

DataKitchen

MAY 10, 2024

The Third of Five Use Cases in Data Observability Data Evaluation: This involves evaluating and cleansing new datasets before being added to production. This process is critical as it ensures data quality from the onset. Examples include regular loading of CRM data and anomaly detection.

Metrics

Metrics Testing Data Quality Dashboards

Accomplish Agile Business Intelligence & Analytics For Your Business

datapine

APRIL 15, 2020

Your Chance: Want to test an agile business intelligence solution? Business intelligence is moving away from the traditional engineering model: analysis, design, construction, testing, and implementation. Test BI in a small group and deploy the software internally. Finalize testing. Test throughout the lifecycle.

Business Intelligence

Business Intelligence Analytics Testing Dashboards

What you need to know about product management for AI

O'Reilly on Data

MARCH 31, 2020

The model outputs produced by the same code will vary with changes to things like the size of the training data (number of labeled examples), network training parameters, and training run time. This has serious implications for software testing, versioning, deployment, and other core development processes.

Management

Management Machine Learning Experimentation Metrics

Navigating the Chaos of Unruly Data: Solutions for Data Teams

DataKitchen

NOVEMBER 10, 2023

Extrinsic Control Deficit: Many of these changes stem from tools and processes beyond the immediate control of the data team. Unregulated ETL/ELT Processes: The absence of stringent data quality tests in ETL (Extract, Transform, Load) or ELT (Extract, Load, Transform) processes further exacerbates the problem.

Data Quality

Data Quality Testing Data Lake Data Integration

A Day in the Life of a DataOps Engineer

DataKitchen

OCTOBER 11, 2021

The data engineer then emails the BI Team, who refreshes a Tableau dashboard. Figure 1: Example data pipeline with manual processes. There are no automated tests , so errors frequently pass through the pipeline. The pipeline has automated tests at each step, making sure that each step completes successfully.

Testing

Testing Metadata Dashboards Statistics

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

AWS Big Data

NOVEMBER 22, 2024

Now, with support for dbt Cloud, you can access a managed, cloud-based environment that automates and enhances your data transformation workflows. This upgrade allows you to build, test, and deploy data models in dbt with greater ease and efficiency, using all the features that dbt Cloud provides.

Data Lake

Data Lake Data Warehouse Cost-Benefit Data Transformation

Development Strategies to Prevent Data Quality Issues in Production (Part 1)

Wayne Yaddow

MARCH 3, 2025

Photo by CDC on Unsplash Many data pipeline failures and quality issues that are detected by data observability tools in production could have been prevented earlier in the pipeline lifecycle with better pre-production testing strategies. Crucial for time-sensitive analytics and reporting processes.

Data Quality

Data Quality Strategy ROI Testing

What Is ‘Equity As Code,’ And How Can It Eliminate AI Bias?

DataKitchen

JUNE 7, 2021

Hopefully, with metrics in place, you can show measured improvements in productivity and quality that will win converts. Improve Collaboration, both Inter- and Intra -team – If the individuals in your data-analytics team don’t work together, it can impact analytics-cycle time, data quality, governance, security and more.

Testing

Testing IT Data-driven Measurement

The Future of AI: High Quality, Human Powered Data

Smart Data Collective

AUGUST 11, 2022

How Artificial Intelligence is Impacting Data Quality. Artificial intelligence has the potential to combat human error by taking up the tasking responsibilities associated with the analysis, drilling, and dissection of large volumes of data. Data quality is crucial in the age of artificial intelligence.

Data Quality

Data Quality Machine Learning Digital Transformation Big Data

DataKitchen Resource Guide To Data Journeys & Data Observability & DataOps

DataKitchen

FEBRUARY 23, 2024

Webinar: Beyond Data Observability: Personalization DataKitchen DataOps Observability Problem Statement White Paper: ‘Taming Chaos’ Technical Product Overview Four-minute online demo Detailed Product: Documentation Webinar: Data Observability Demo Day DataKitchen DataOps TestGen Problem Statement White Paper: ‘Mystery Box Full Of Data Errors’ (..)

Testing

Testing Data-driven Data Quality Metrics

Using DataOps to Drive Agility and Business Value

DataKitchen

JUNE 24, 2021

GE formed its Digital League to create a data culture. One of the keys for our success was really focusing that effort on what our key business initiatives were and what sorts of metrics mattered most to our customers. Chapin also mentioned that measuring cycle time and benchmarking metrics upfront was absolutely critical. “It

Metrics

Metrics ROI Measurement Cost-Benefit

Why Data Driven Decision Making is Your Path To Business Success

datapine

APRIL 16, 2019

While sometimes it’s okay to follow your instincts, the vast majority of your business-based decisions should be backed by metrics, facts, or figures related to your aims, goals, or initiatives that can ensure a stable backbone to your management reports and business operations. In most cases, this can prove detrimental to the business.

Data-driven

Data-driven Dashboards Visualization Cost-Benefit

Data supply-chain transformation: Aligning to win

CIO Business Intelligence

JULY 18, 2022

While this may seem like an odd question, understanding organizational change levers can help uncover opportunities for improving your data supply-chain processes. A key challenge with organizational alignment is communicating across all departments the goals, metrics, and benefits of a process or function.

Data Quality

Data Quality Metrics Measurement Cost-Benefit

DataOps Observability: Taming the Chaos (Part 3)

DataKitchen

NOVEMBER 18, 2022

As he thinks through the various journeys that data take in his company, Jason sees that his dashboard idea would require extracting or testing for events along the way. So, the only way for a data journey to truly observe what’s happening is to get his tools and pipelines to auto-report events. Data and tool tests.

Testing

Testing Statistics Measurement Metrics

Unlocking the potential of generative AI in the software development life cycle

CIO Business Intelligence

SEPTEMBER 10, 2024

The testing phase, particularly user acceptance testing (UAT), can become a labor-intensive bottleneck — and a budget breaker. According to a 2023 Capgemini report , companies spend about 35% of their IT budget on testing — a figure that has remained stubbornly high despite advancements in automation.

Software

Software Digital Transformation Testing Advertising

Introducing The Five Pillars Of Data Journeys

DataKitchen

JUNE 19, 2023

Another way to look at the five pillars is to see them in the context of a typical complex data estate. Monitoring is another pillar of Data Journeys, extending down the stack. The image above shows an example ‘’data at rest’ test result. For example, a test can check the top fifty customers or suppliers.

Testing

Testing Data Quality Metrics Cost-Benefit

The Race For Data Quality in a Medallion Architecture

Is Your Team in Denial of Data Quality? Here’s How to Tell

Webinars

Trending Sources

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

Webinars

Data Observability and Data Quality Testing Certification Series

Unlocking Data Team Success: Are You Process-Centric or Data-Centric?

The DataOps Vendor Landscape, 2021

Visualize data quality scores and metrics generated by AWS Glue Data Quality

Amazon DataZone now integrates with AWS Glue Data Quality and external data quality solutions

When is data too clean to be useful for enterprise AI?

Navigating the Storm: How Data Engineering Teams Can Overcome a Data Quality Crisis

What are model governance and model operations?

7 types of tech debt that could cripple your business

Measure performance of AWS Glue Data Quality for ETL pipelines

Data’s dark secret: Why poor quality cripples AI and growth

Key Success Metrics, Benefits, and Results for Data Observability Using DataKitchen Software

AI Governance: Act now, thrive later

Perform data parity at scale for data modernization programs using AWS Glue Data Quality

AI Product Management After Deployment

Two Downs Make Two Ups: The Only Success Metrics That Matter For Your Data & Analytics Team

Bridging the Gap: How ‘Data in Place’ and ‘Data in Use’ Define Complete Data Observability

DataKitchen Training And Certification Offerings

Data Observability and Monitoring with DataOps

Top 10 Analytics And Business Intelligence Trends For 2020

The Journey to DataOps Success: Key Takeaways from Transformation Trailblazers

Automated data governance with AWS Glue Data Quality, sensitive data detection, and AWS Lake Formation

How to Build a Data Quality Strategy to Get Executive Buy-In

7 ways gen AI can create more work than it saves

Why you should care about debugging machine learning models

The Five Use Cases in Data Observability: Mastering Data Production

Accomplish Agile Business Intelligence & Analytics For Your Business

What you need to know about product management for AI

Navigating the Chaos of Unruly Data: Solutions for Data Teams

A Day in the Life of a DataOps Engineer

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

Development Strategies to Prevent Data Quality Issues in Production (Part 1)

What Is ‘Equity As Code,’ And How Can It Eliminate AI Bias?

The Future of AI: High Quality, Human Powered Data

DataKitchen Resource Guide To Data Journeys & Data Observability & DataOps

Using DataOps to Drive Agility and Business Value

Why Data Driven Decision Making is Your Path To Business Success

Data supply-chain transformation: Aligning to win

DataOps Observability: Taming the Chaos (Part 3)

Unlocking the potential of generative AI in the software development life cycle

Introducing The Five Pillars Of Data Journeys

Stay Connected