Data Quality, Data Transformation and Data-driven

Data Quality

Data Transformation

Data-driven

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

1) What Is Data Quality Management? 4) Data Quality Best Practices. 5) How Do You Measure Data Quality? 6) Data Quality Metrics Examples. 7) Data Quality Control: Use Case. 8) The Consequences Of Bad Data Quality. 9) 3 Sources Of Low-Quality Data.

Data Quality

Data Quality Metrics Data-driven Management

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

Data is the foundation of innovation, agility and competitive advantage in todays digital economy. As technology and business leaders, your strategic initiatives, from AI-powered decision-making to predictive insights and personalized experiences, are all fueled by data. Data quality is no longer a back-office concern.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

Join 42,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

AWS Big Data

NOVEMBER 22, 2024

At AWS, we are committed to empowering organizations with tools that streamline data analytics and transformation processes. This integration enables data teams to efficiently transform and manage data using Athena with dbt Cloud’s robust features, enhancing the overall data workflow experience.

Data Lake

Data Lake Data Warehouse Cost-Benefit Data Transformation

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Set up alerts and orchestrate data quality rules with AWS Glue Data Quality

AWS Big Data

JUNE 6, 2023

Alerts and notifications play a crucial role in maintaining data quality because they facilitate prompt and efficient responses to any data quality issues that may arise within a dataset. It simplifies your experience of monitoring and evaluating the quality of your data.

Data Quality

Data Quality Metrics Data-driven Visualization

SAP Datasphere Powers Business at the Speed of Data

Rocket-Powered Data Science

MARCH 20, 2023

We live in a data-rich, insights-rich, and content-rich world. Data collections are the ones and zeroes that encode the actionable insights (patterns, trends, relationships) that we seek to extract from our data through machine learning and data science. Plus, AI can also help find key insights encoded in data.

Data Warehouse

Data Warehouse Metadata Digital Transformation Machine Learning

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

AWS Big Data

DECEMBER 4, 2024

In today’s rapidly evolving financial landscape, data is the bedrock of innovation, enhancing customer and employee experiences and securing a competitive edge. Like many large financial institutions, ANZ Institutional Division operated with siloed data practices and centralized data management teams.

Metadata

Metadata Data Governance Data Quality Data-driven

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

For container terminal operators, data-driven decision-making and efficient data sharing are vital to optimizing operations and boosting supply chain efficiency. Together, these capabilities enable terminal operators to enhance efficiency and competitiveness in an industry that is increasingly data driven.

IoT

IoT Machine Learning Metadata Data-driven

The Journey to DataOps Success: Key Takeaways from Transformation Trailblazers

DataKitchen

APRIL 26, 2021

In early April 2021, DataKItchen sat down with Jonathan Hodges, VP Data Management & Analytics, at Workiva ; Chuck Smith, VP of R&D Data Strategy at GlaxoSmithKline (GSK) ; and Chris Bergh, CEO and Head Chef at DataKitchen, to find out about their enterprise DataOps transformation journey, including key successes and lessons learned.

Measurement

Measurement Metrics Data-driven Dashboards

Development Strategies to Prevent Data Quality Issues in Production (Part 1)

Wayne Yaddow

MARCH 3, 2025

These strategies can prevent delayed discovery of quality issues during data observability monitoring in production. Below is a summary of recommendations for proactively identifying and fixing flaws before they impact production data. Helps maintain business rule consistency and avoid regressions in data quality overtime.

Data Quality

Data Quality Strategy ROI Testing

Alation & Bigeye: A Potent Partnership for Data Quality

Alation

DECEMBER 7, 2021

Alation and Bigeye have partnered to bring data observability and data quality monitoring into the data catalog. Read to learn how our newly combined capabilities put more trustworthy, quality data into the hands of those who are best equipped to leverage it. trillion each year due to poor data quality.

Data Quality

Data Quality Data-driven Metrics Dashboards

Data Engineers Are Using AI to Verify Data Transformations

Wayne Yaddow

FEBRUARY 26, 2025

AI is transforming how senior data engineers and data scientists validate data transformations and conversions. Artificial intelligence-based verification approaches aid in the detection of anomalies, the enforcement of data integrity, and the optimization of pipelines for improved efficiency.

Data Transformation

Data Transformation Testing Data-driven Data Quality

Ensuring Data Transformation Quality with dbt Core

Wayne Yaddow

MARCH 14, 2025

How dbt Core aids data teams test, validate, and monitor complex data transformations and conversions Photo by NASA on Unsplash Introduction dbt Core, an open-source framework for developing, testing, and documenting SQL-based data transformations, has become a must-have tool for modern data teams as the complexity of data pipelines grows.

Data Transformation

Data Transformation Testing Unstructured Data Data Quality

Available Now! Automated Testing for Data Transformations

Wayne Yaddow

FEBRUARY 18, 2025

Selecting the strategies and tools for validating data transformations and data conversions in your data pipelines. Introduction Data transformations and data conversions are crucial to ensure that raw data is organized, processed, and ready for useful analysis.

Testing

Testing Data Transformation Data-driven Data Quality

What is Data Lineage? Top 5 Benefits of Data Lineage

erwin

APRIL 29, 2020

Data lineage is the journey data takes from its creation through its transformations over time. It describes a certain dataset’s origin, movement, characteristics and quality. Tracing the source of data is an arduous task. Data Lineage Use Case: From Tracing COVID-19’s Origins to Data-Driven Business.

Key Performance Indicator

Key Performance Indicator Metadata Data Governance Data Quality

Data Mesh 101: How Data Mesh Helps Organizations Be Data-Driven and Achieve Velocity

Ontotext

FEBRUARY 12, 2024

In the final part of this three-part series, we’ll explore ho w data mesh bolsters performance and helps organizations and data teams work more effectively. Usually, organizations will combine different domain topologies, depending on the trade-offs, and choose to focus on specific aspects of data mesh.

Data-driven

Data-driven Data Lake Data Quality Business Objectives

Key Challenges Affecting Data Transformations—Dev and Testing

Wayne Yaddow

FEBRUARY 6, 2025

Common challenges and practical mitigation strategies for reliable data transformations. Photo by Mika Baumeister on Unsplash Introduction Data transformations are important processes in data engineering, enabling organizations to structure, enrich, and integrate data for analytics , reporting, and operational decision-making.

Testing

Testing Data Transformation Data-driven Manufacturing

Modernize your ETL platform with AWS Glue Studio: A case study from BMS

AWS Big Data

DECEMBER 13, 2023

In addition to using native managed AWS services that BMS didn’t need to worry about upgrading, BMS was looking to offer an ETL service to non-technical business users that could visually compose data transformation workflows and seamlessly run them on the AWS Glue Apache Spark-based serverless data integration engine.

Metadata

Metadata Data Lake Visualization Data Transformation

What is DataOps? Collaborative, cross-functional analytics

CIO Business Intelligence

DECEMBER 22, 2022

DataOps (data operations) is an agile, process-oriented methodology for developing and delivering analytics. It brings together DevOps teams with data engineers and data scientists to provide the tools, processes, and organizational structures to support the data-focused enterprise. What is DataOps?

Analytics

Analytics Machine Learning Data mining Advertising

Drive Growth with Data-Driven Strategies: Introducing Zenia Graph’s Salesforce Accelerator

Ontotext

MARCH 20, 2024

In today’s data-driven world, businesses are drowning in a sea of information. Traditional data integration methods struggle to bridge these gaps, hampered by high costs, data quality concerns, and inconsistencies. It’s a huge productivity loss.”

Data-driven

Data-driven Strategy Sales Data Integration

Breaking down data silos for digital success

CIO Business Intelligence

NOVEMBER 7, 2023

For years, IT and business leaders have been talking about breaking down the data silos that exist within their organizations. Given the importance of sharing information among diverse disciplines in the era of digital transformation, this concept is arguably as important as ever. What are the challenges and potential rewards?

Data Warehouse

Data Warehouse Digital Transformation Data-driven Reporting

An AI Chat Bot Wrote This Blog Post …

DataKitchen

DECEMBER 9, 2022

ChatGPT> DataOps, or data operations, is a set of practices and technologies that organizations use to improve the speed, quality, and reliability of their data analytics processes. The goal of DataOps is to help organizations make better use of their data to drive business decisions and improve outcomes.

Machine Learning

Machine Learning Data-driven Optimization Data Analytics

Data Integrity, the Basis for Reliable Insights

Sisense

AUGUST 28, 2020

We live in a world of data: There’s more of it than ever before, in a ceaselessly expanding array of forms and locations. Dealing with Data is your window into the ways data teams are tackling the challenges of this new world to help their companies and their customers thrive. What is data integrity? Data integrity risks.

Data Integration

Data Integration Testing Data Quality Data-driven

8 data strategy mistakes to avoid

CIO Business Intelligence

JANUARY 24, 2024

Organizations can’t afford to mess up their data strategies, because too much is at stake in the digital economy. How enterprises gather, store, cleanse, access, and secure their data can be a major factor in their ability to meet corporate goals. Here are some data strategy mistakes IT leaders would be wise to avoid.

Data Strategy

Data Strategy Strategy Unstructured Data Data Governance

The What & Why of Data Governance

erwin

MARCH 4, 2021

Modern data governance is a strategic, ongoing and collaborative practice that enables organizations to discover and track their data, understand what it means within a business context, and maximize its security, quality and value. The What: Data Governance Defined. Data governance has no standard definition.

Data Governance

Data Governance Digital Transformation Data-driven Cost-Benefit

Top 6 Benefits of Automating End-to-End Data Lineage

erwin

SEPTEMBER 17, 2020

Replace manual and recurring tasks for fast, reliable data lineage and overall data governance. It’s paramount that organizations understand the benefits of automating end-to-end data lineage. The importance of end-to-end data lineage is widely understood and ignoring it is risky business. Doing Data Lineage Right.

Cost-Benefit

Cost-Benefit Data Governance Metadata Reporting

The Best Data Management Tools For Small Businesses

Smart Data Collective

APRIL 29, 2020

As the world is gradually becoming more dependent on data, the services, tools and infrastructure are all the more important for businesses in every sector. Data management has become a fundamental business concern, and especially for businesses that are going through a digital transformation. What is data management?

Management

Management Data Warehouse Digital Transformation Dashboards

Automate discovery of data relationships using ML and Amazon Neptune graph technology

AWS Big Data

APRIL 19, 2023

Data mesh is a new approach to data management. Companies across industries are using a data mesh to decentralize data management to improve data agility and get value from data. This is especially true in a large enterprise with thousands of data products.

Technology

Technology Data-driven Machine Learning Sales

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

AWS Big Data

JULY 20, 2023

With data becoming the driving force behind many industries today, having a modern data architecture is pivotal for organizations to be successful. In this post, we describe Orca’s journey building a transactional data lake using Amazon Simple Storage Service (Amazon S3), Apache Iceberg, and AWS Analytics.

Data Lake

Data Lake Analytics Snapshot Data Quality

A step-by-step guide to setting up a data governance program

IBM Big Data Hub

FEBRUARY 9, 2023

In our last blog , we delved into the seven most prevalent data challenges that can be addressed with effective data governance. Today we will share our approach to developing a data governance program to drive data transformation and fuel a data-driven culture. Don’t try to do everything at once!

Data Governance

Data Governance Business Objectives Data Quality Measurement

Self-service vs Centralized Data Management: How to Leverage Data Lineage to Empower and Control

Octopai

JUNE 22, 2023

In the era of big data, organizations are grappling with the challenge of effectively managing and leveraging vast amounts of data. Two prominent approaches have emerged: self-service data management and centralized data management. This approach offers greater control and reduces the risk of data inconsistencies.

Management

Management Data-driven Data Quality Data Governance

Sure, Trust Your Data… Until It Breaks Everything: How Automated Data Lineage Saves the Day

Octopai

JUNE 9, 2024

In today’s data-centric world, organizations often tout data as their most valuable asset. However, many struggle to maintain reliable, trustworthy data amidst complex, evolving environments. This challenge is especially critical for executives responsible for data strategy and operations.

IT Data-driven Predictive Analytics Data Strategy

Data Preparation and Data Mapping: The Glue Between Data Management and Data Governance to Accelerate Insights and Reduce Risks

erwin

JANUARY 11, 2019

Organizations have spent a lot of time and money trying to harmonize data across diverse platforms , including cleansing, uploading metadata, converting code, defining business glossaries, tracking data transformations and so on. So questions linger about whether transformed data can be trusted.

Data Governance

Data Governance Risk Metadata Management

Why The Public Sector Needs Data Governance

Alation

NOVEMBER 22, 2022

What Is Data Governance In The Public Sector? Effective data governance for the public sector enables entities to ensure data quality, enhance security, protect privacy, and meet compliance requirements. With so much focus on compliance, democratizing data for self-service analytics can present a challenge.

Data Governance

Data Governance Metadata Data-driven Unstructured Data

Choosing A Graph Data Model to Best Serve Your Use Case

Ontotext

MARCH 27, 2024

For example, GPS, social media, cell phone handoffs are modeled as graphs while data catalogs, data lineage and MDM tools leverage knowledge graphs for linking metadata with semantics. Both nodes and edges have associated properties modeled as key-values with primitive data types and are single-valued.

Modeling

Modeling Metadata Data Quality Enterprise

Turnkey Cloud DataOps: Solution from Alation and Accenture

Alation

MARCH 22, 2022

Data people face a challenge. They must put high-quality data into the hands of users as efficiently as possible. As the latest iteration in this pursuit of high-quality data sharing, DataOps combines a range of disciplines. It synthesizes all we’ve learned about agile, data quality , and ETL/ELT.

Metadata

Metadata Cost-Benefit Data Quality Data Lake

5 best open source data flow lineage tools

Octopai

AUGUST 11, 2024

Data flow lineage is crucial for anyone handling data within organizations. In essence, data flow lineage is indispensable for ensuring transparency, maintaining data quality, achieving compliance, enabling efficient troubleshooting, conducting impact analysis, and enhancing collaboration within organizations.

Metadata

Metadata Visualization Data Quality Data Governance

How Tricentis unlocks insights across the software development lifecycle at speed and scale using Amazon Redshift

AWS Big Data

MARCH 3, 2023

Teams may struggle to have a unified view of software quality due to siloed testing across many disparate tools. For users that require a unified view of software quality, this is unacceptable. Although Tricentis has amassed such data over a decade, the data remains untapped for valuable insights.

Software

Software Data Lake Testing Dashboards

Showpad accelerates data maturity to unlock innovation using Amazon QuickSight

AWS Big Data

APRIL 5, 2023

In 2021, Showpad set forth the vision to use the power of data to unlock innovations and drive business decisions across its organization. In this post, we share how Showpad used QuickSight to streamline data and insights access across teams and customers. Showpad migrated over 70 dashboards with over 1,000 visuals.

Dashboards

Dashboards Reporting Cost-Benefit Visualization

Smart Factories: Artificial Intelligence and Automation for Reduced OPEX in Manufacturing

DataRobot Blog

MARCH 10, 2022

These initiatives utilize interconnected devices and automated machines that create a hyperbolic increase in data volumes. This type of growth has stressed legacy data management systems and makes it nearly impossible to implement a profitable data-centered solution. High-level example of a common machine learning lifecycle.

Manufacturing

Manufacturing IoT Machine Learning Forecasting

The Chief Marketing Officer and the CDO – A Modern Fable

Peter James Thomas

OCTOBER 30, 2018

Where they have, I have normally found the people holding these roles to be better informed about data matters than their peers. Prelude… I recently came across an article in Marketing Week with the clickbait-worthy headline of Why the rise of the chief data officer will be short-lived (their choice of capitalisation).

Marketing

Marketing Strategy Data Architecture Data Strategy

The Rising Need for Data Governance in Healthcare

Alation

OCTOBER 28, 2021

Healthcare is changing, and it all comes down to data. Data & analytics represents a major opportunity to tackle these challenges. Indeed, many healthcare organizations today are embracing digital transformation and using data to enhance operations. How can data help change how care is delivered?

Data Governance

Data Governance Measurement Data Quality Metrics

What is a Data Pipeline?

Jet Global

MAY 9, 2024

A data pipeline is a series of processes that move raw data from one or more sources to one or more destinations, often transforming and processing the data along the way. This can include tasks such as data ingestion, cleansing, filtering, aggregation, or standardization.

Data Lake

Data Lake Data Warehouse Business Intelligence Machine Learning

What is Data Mapping?

Jet Global

FEBRUARY 23, 2024

The quick and dirty definition of data mapping is the process of connecting different types of data from various data sources. Data mapping is a crucial step in data modeling and can help organizations achieve their business goals by enabling data integration, migration, transformation, and quality.

Data Warehouse

Data Warehouse Reporting Data Transformation Visualization

Unleashing GenAI — Ensuring Data Quality at Scale (Part 2)

Wayne Yaddow

MARCH 28, 2025

Unleashing GenAIEnsuring Data Quality at Scale (Part2) Transitioning from individual repository source systems to consolidated AI LLM pipelines, the importance of automated checks, end-to-end observability, and compliance with enterprise businessrules. First: It is critical to set up a thorough data inventory and assessment procedure.

Data Quality

Data Quality Data Integration Data Governance Metadata

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

Data’s dark secret: Why poor quality cripples AI and growth

Webinars

Trending Sources

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

Webinars

Set up alerts and orchestrate data quality rules with AWS Glue Data Quality

SAP Datasphere Powers Business at the Speed of Data

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

How EUROGATE established a data mesh architecture using Amazon DataZone

The Journey to DataOps Success: Key Takeaways from Transformation Trailblazers

Development Strategies to Prevent Data Quality Issues in Production (Part 1)

Alation & Bigeye: A Potent Partnership for Data Quality

Data Engineers Are Using AI to Verify Data Transformations

Ensuring Data Transformation Quality with dbt Core

Available Now! Automated Testing for Data Transformations

What is Data Lineage? Top 5 Benefits of Data Lineage

Data Mesh 101: How Data Mesh Helps Organizations Be Data-Driven and Achieve Velocity

Key Challenges Affecting Data Transformations—Dev and Testing

Modernize your ETL platform with AWS Glue Studio: A case study from BMS

What is DataOps? Collaborative, cross-functional analytics

Drive Growth with Data-Driven Strategies: Introducing Zenia Graph’s Salesforce Accelerator

Breaking down data silos for digital success

An AI Chat Bot Wrote This Blog Post …

Data Integrity, the Basis for Reliable Insights

8 data strategy mistakes to avoid

The What & Why of Data Governance

Top 6 Benefits of Automating End-to-End Data Lineage

The Best Data Management Tools For Small Businesses

Automate discovery of data relationships using ML and Amazon Neptune graph technology

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

A step-by-step guide to setting up a data governance program

Self-service vs Centralized Data Management: How to Leverage Data Lineage to Empower and Control

Sure, Trust Your Data… Until It Breaks Everything: How Automated Data Lineage Saves the Day

Data Preparation and Data Mapping: The Glue Between Data Management and Data Governance to Accelerate Insights and Reduce Risks

Why The Public Sector Needs Data Governance

Choosing A Graph Data Model to Best Serve Your Use Case

Turnkey Cloud DataOps: Solution from Alation and Accenture

5 best open source data flow lineage tools

How Tricentis unlocks insights across the software development lifecycle at speed and scale using Amazon Redshift

Showpad accelerates data maturity to unlock innovation using Amazon QuickSight

Smart Factories: Artificial Intelligence and Automation for Reduced OPEX in Manufacturing

The Chief Marketing Officer and the CDO – A Modern Fable

The Rising Need for Data Governance in Healthcare

What is a Data Pipeline?

What is Data Mapping?

Unleashing GenAI — Ensuring Data Quality at Scale (Part 2)

Stay Connected