Data Quality, Data Transformation and Reporting

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

1) What Is Data Quality Management? 4) Data Quality Best Practices. 5) How Do You Measure Data Quality? 6) Data Quality Metrics Examples. 7) Data Quality Control: Use Case. 8) The Consequences Of Bad Data Quality. 9) 3 Sources Of Low-Quality Data.

Data Quality

Data Quality Metrics Data-driven Management

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

As technology and business leaders, your strategic initiatives, from AI-powered decision-making to predictive insights and personalized experiences, are all fueled by data. Yet, despite growing investments in advanced analytics and AI, organizations continue to grapple with a persistent and often underestimated challenge: poor data quality.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

SAP Datasphere Powers Business at the Speed of Data

Rocket-Powered Data Science

MARCH 20, 2023

Data collections are the ones and zeroes that encode the actionable insights (patterns, trends, relationships) that we seek to extract from our data through machine learning and data science. Datasphere is a data discovery tool with essential functionalities: recommendations, data marketplace, and business content (i.e.,

Data Warehouse

Data Warehouse Metadata Digital Transformation Machine Learning

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Ensuring Data Transformation Results with Great Expectations

Wayne Yaddow

MARCH 12, 2025

However, Great Expectations (GX ) sets itself apart as a robust, open-source framework that helps data teams maintain consistent and transparent data quality standards. Data quality rules are codified into structured Expectation Suites by Great Expectations instead of relying on ad-hoc scripts or manual checks.

Data Transformation

Data Transformation Data Quality Testing Data Warehouse

Development Strategies to Prevent Data Quality Issues in Production (Part 1)

Wayne Yaddow

MARCH 3, 2025

When implementing automated validation, AI-driven regression testing, real-time canary pipelines, synthetic data generation, freshness enforcement, KPI tracking, and CI/CD automation, organizations can shift from reactive data observability to proactive data quality assurance. Summary: Why thisorder?

Data Quality

Data Quality Strategy ROI Testing

It’s Essential — Verifying Data Transformations (Part 4)

Wayne Yaddow

FEBRUARY 4, 2025

Its EssentialVerifying Data Transformations (Part4) Uncovering the leading problems in data transformation workflowsand practical ways to detect and preventthem In Parts 13 of this series of blogs, categories of data transformations were identified as among the top causes of data quality defects in data pipeline workflows.

Data Transformation

Data Transformation Testing Data Quality Strategy

Complex Data Transformations — Test Planning Best Practices

Wayne Yaddow

FEBRUARY 21, 2025

Complex Data TransformationsTest Planning Best Practices Ensuring data accuracy with structured testing and best practices Photo by Taylor Vick on Unsplash Introduction Data transformations and conversions are crucial for data pipelines, enabling organizations to process, integrate, and refine raw data into meaningful insights.

Testing

Testing Data Transformation Data Quality Data Integration

Ensuring Data Transformation Quality with dbt Core

Wayne Yaddow

MARCH 14, 2025

How dbt Core aids data teams test, validate, and monitor complex data transformations and conversions Photo by NASA on Unsplash Introduction dbt Core, an open-source framework for developing, testing, and documenting SQL-based data transformations, has become a must-have tool for modern data teams as the complexity of data pipelines grows.

Data Transformation

Data Transformation Testing Unstructured Data Data Quality

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

In the following section, two use cases demonstrate how the data mesh is established with Amazon DataZone to better facilitate machine learning for an IoT-based digital twin and BI dashboards and reporting using Tableau. In the past, one-to-one connections were established between Tableau and respective applications.

IoT

IoT Machine Learning Metadata Data-driven

What is Data Lineage? Top 5 Benefits of Data Lineage

erwin

APRIL 29, 2020

These tools range from enterprise service bus (ESB) products, data integration tools; extract, transform and load (ETL) tools, procedural code, application program interfaces (API)s, file transfer protocol (FTP) processes, and even business intelligence (BI) reports that further aggregate and transform data.

Key Performance Indicator

Key Performance Indicator Metadata Data Governance Data Quality

Data Engineers Are Using AI to Verify Data Transformations

Wayne Yaddow

FEBRUARY 26, 2025

AI is transforming how senior data engineers and data scientists validate data transformations and conversions. Artificial intelligence-based verification approaches aid in the detection of anomalies, the enforcement of data integrity, and the optimization of pipelines for improved efficiency.

Data Transformation

Data Transformation Testing Data-driven Data Quality

Alation & Bigeye: A Potent Partnership for Data Quality

Alation

DECEMBER 7, 2021

Alation and Bigeye have partnered to bring data observability and data quality monitoring into the data catalog. Read to learn how our newly combined capabilities put more trustworthy, quality data into the hands of those who are best equipped to leverage it. trillion each year due to poor data quality.

Data Quality

Data Quality Data-driven Metrics Dashboards

Is your data supply chain a liability?

CIO Business Intelligence

JUNE 23, 2022

Yet as companies fight for skilled analyst roles to utilize data to make better decisions , they often fall short in improving the data supply chain and resulting data quality. Without a solid data supply-chain management practices in place, data quality often suffers. First mile/last mile impacts.

Data Quality

Data Quality Key Performance Indicator Metrics KPI

From Raw Inputs to Polished Outputs: The Art of Testing Data Transformations

Wayne Yaddow

MARCH 5, 2025

In this post, well see the fundamental procedures, tools, and techniques that data engineers, data scientists, and QA/testing teams use to ensure high-quality data as soon as its deployed. First, we look at how unit and integration tests uncover transformation errors at an early stage.

Testing

Testing Data Transformation Statistics Metadata

Key Challenges Affecting Data Transformations—Dev and Testing

Wayne Yaddow

FEBRUARY 6, 2025

Common challenges and practical mitigation strategies for reliable data transformations. Photo by Mika Baumeister on Unsplash Introduction Data transformations are important processes in data engineering, enabling organizations to structure, enrich, and integrate data for analytics , reporting, and operational decision-making.

Testing

Testing Data Transformation Data-driven Manufacturing

Breaking down data silos for digital success

CIO Business Intelligence

NOVEMBER 7, 2023

Given the importance of sharing information among diverse disciplines in the era of digital transformation, this concept is arguably as important as ever. The aim is to normalize, aggregate, and eventually make available to analysts across the organization data that originates in various pockets of the enterprise.

Data Warehouse

Data Warehouse Digital Transformation Data-driven Reporting

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

AWS Big Data

NOVEMBER 9, 2023

In this post, we delve into a case study for a retail use case, exploring how the Data Build Tool (dbt) was used effectively within an AWS environment to build a high-performing, efficient, and modern data platform. It does this by helping teams handle the T in ETL (extract, transform, and load) processes.

Data Warehouse

Data Warehouse Testing Data Quality Reporting

Top 6 Benefits of Automating End-to-End Data Lineage

erwin

SEPTEMBER 17, 2020

According to erwin’s “2020 State of Data Governance and Automation” report , close to 70 percent of data professional respondents say they spend an average of 10 or more hours per week on data-related activities, and most of that time is spent searching for and preparing data.

Cost-Benefit

Cost-Benefit Data Governance Metadata Reporting

Modernize your ETL platform with AWS Glue Studio: A case study from BMS

AWS Big Data

DECEMBER 13, 2023

In addition to using native managed AWS services that BMS didn’t need to worry about upgrading, BMS was looking to offer an ETL service to non-technical business users that could visually compose data transformation workflows and seamlessly run them on the AWS Glue Apache Spark-based serverless data integration engine.

Metadata

Metadata Data Lake Visualization Data Transformation

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

AWS Big Data

AUGUST 1, 2024

However, you might face significant challenges when planning for a large-scale data warehouse migration. Trace the flow of data from its origins in the source systems, through the data warehouse, and ultimately to its consumption by reporting, analytics, and other downstream processes.

Data Warehouse

Data Warehouse KPI Optimization Cost-Benefit

8 data strategy mistakes to avoid

CIO Business Intelligence

JANUARY 24, 2024

“Establishing data governance rules helps organizations comply with these regulations, reducing the risk of legal and financial penalties. Clear governance rules can also help ensure data quality by defining standards for data collection, storage, and formatting, which can improve the accuracy and reliability of your analysis.”

Data Strategy

Data Strategy Strategy Unstructured Data Data Governance

The What & Why of Data Governance

erwin

MARCH 4, 2021

And when you talk about that question at a high level, he says, you get a very “simple answer,”– which is ‘the only thing we want to have is the right data with the right quality to the right person at the right time at the right cost.’. The Why: Data Governance Drivers. Why should companies care about data governance?

Data Governance

Data Governance Digital Transformation Data-driven Cost-Benefit

Biggest Trends in Data Visualization Taking Shape in 2022

Smart Data Collective

OCTOBER 13, 2021

There are countless examples of big data transforming many different industries. There is no disputing the fact that the collection and analysis of massive amounts of unstructured data has been a huge breakthrough. Data virtualization is ideal in any situation where the is necessary: Information coming from diverse data sources.

Visualization

Visualization Cost-Benefit Big Data Prescriptive Analytics

Data Integrity, the Basis for Reliable Insights

Sisense

AUGUST 28, 2020

Uncomfortable truth incoming: Most people in your organization don’t think about the quality of their data from intake to production of insights. However, as a data team member, you know how important data integrity (and a whole host of other aspects of data management) is.

Data Integration

Data Integration Testing Data Quality Data-driven

AzureML and CRISP-DM – a Framework to help the Business Intelligence professional move to AI

Jen Stirrup

SEPTEMBER 30, 2021

Before we dive in, let’s define strands of AI, Machine Learning and Data Science: Business intelligence (BI) leverages software and services to transform data into actionable insights that inform an organization’s strategic and tactical business decisions.

Business Intelligence

Business Intelligence Data mining Machine Learning Testing

Time for New Partnership Paradigms to Be Future-fit

CIO Business Intelligence

DECEMBER 6, 2023

An IDC report estimated the global IT developer shortage will reach four million by 2025, leaving businesses struggling to accelerate digital transformation without the needed workforce.

Digital Transformation

Digital Transformation Software Cost-Benefit Manufacturing

Database vs. Data Warehouse: What’s the Difference?

Jet Global

MAY 28, 2019

The success of any business into the next year and beyond will depend entirely on the volume, accuracy, and reportability of the data they collect—and how well the business can analyze, extract insight from, and take action on that data. Enter the Warehouse.

Data Warehouse

Data Warehouse Reporting Business Intelligence Sales

From Disparate Data to Visualized Knowledge Part I: Moving from Spreadsheets to an RDF Database

Ontotext

NOVEMBER 18, 2021

OntoRefine is a data transformation tool that lets you unite plenty of data formats and get them into your triplestore. That way, we can simplify our lives in the future, so when we seek reports for one building (say Building123), we also get information about the “other” building (BuildingABC), which is at the same address.

Visualization

Visualization Reporting Metadata Enterprise

A step-by-step guide to setting up a data governance program

IBM Big Data Hub

FEBRUARY 9, 2023

In our last blog , we delved into the seven most prevalent data challenges that can be addressed with effective data governance. Today we will share our approach to developing a data governance program to drive data transformation and fuel a data-driven culture.

Data Governance

Data Governance Business Objectives Data Quality Measurement

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

AWS Big Data

JULY 20, 2023

Prior to the creation of the data lake, Orca’s data was distributed among various data silos, each owned by a different team with its own data pipelines and technology stack. Moreover, running advanced analytics and ML on disparate data sources proved challenging.

Data Lake

Data Lake Analytics Snapshot Data Quality

NEW: Octopai Announces Support of Microsoft Azure Data Factory

Octopai

JANUARY 19, 2021

With Octopai’s support and analysis of Azure Data Factory, enterprises can now view complete end-to-end data lineage from Azure Data Factory all the way through to reporting for the first time ever.

Metadata

Metadata ROI Machine Learning Data Quality

Data Preparation and Data Mapping: The Glue Between Data Management and Data Governance to Accelerate Insights and Reduce Risks

erwin

JANUARY 11, 2019

Organizations have spent a lot of time and money trying to harmonize data across diverse platforms , including cleansing, uploading metadata, converting code, defining business glossaries, tracking data transformations and so on. So questions linger about whether transformed data can be trusted.

Data Governance

Data Governance Risk Metadata Management

An AI Chat Bot Wrote This Blog Post …

DataKitchen

DECEMBER 9, 2022

Some of the key benefits of DataOps include: Improved speed and reliability: By automating and streamlining data-related tasks and processes, DataOps can help organizations to accelerate the development and deployment of data-driven solutions, and to improve the reliability of their data analytics and machine learning initiatives.

Machine Learning

Machine Learning Data-driven Optimization Data Analytics

Drive Growth with Data-Driven Strategies: Introducing Zenia Graph’s Salesforce Accelerator

Ontotext

MARCH 20, 2024

Traditional data integration methods struggle to bridge these gaps, hampered by high costs, data quality concerns, and inconsistencies. Studies reveal that businesses lose significant time and opportunities due to missing integrations and poor data quality and accessibility.

Data-driven

Data-driven Strategy Sales Data Integration

Showpad accelerates data maturity to unlock innovation using Amazon QuickSight

AWS Big Data

APRIL 5, 2023

The company decided to use AWS to unify its business intelligence (BI) and reporting strategy for both internal organization-wide use cases and in-product embedded analytics targeted at its customers. In this post, we share how Showpad used QuickSight to streamline data and insights access across teams and customers.

Dashboards

Dashboards Reporting Cost-Benefit Visualization

How Tricentis unlocks insights across the software development lifecycle at speed and scale using Amazon Redshift

AWS Big Data

MARCH 3, 2023

Although Tricentis has amassed such data over a decade, the data remains untapped for valuable insights. Each of these tools has its own reporting capabilities that make it difficult to combine the data for integrated and actionable business insights. Finally, data integrity is of paramount importance.

Software

Software Data Lake Testing Dashboards

Tackling AI’s data challenges with IBM databases on AWS

IBM Big Data Hub

MARCH 14, 2024

Businesses face significant hurdles when preparing data for artificial intelligence (AI) applications. The existence of data silos and duplication, alongside apprehensions regarding data quality, presents a multifaceted environment for organizations to manage. With Netezza support for 1.2

Cost-Benefit

Cost-Benefit Metadata Optimization Management

The Chief Marketing Officer and the CDO – A Modern Fable

Peter James Thomas

OCTOBER 30, 2018

Most of the time, the article does nothing more than to reflect the continuing confusion about whether or not organisations need CDOs and – assuming that they do – what their remit should be and who they should report to [4]. It may well be that one thing that a CDO needs to get going is a data transformation programme.

Marketing

Marketing Strategy Data Architecture Data Strategy

The Rising Need for Data Governance in Healthcare

Alation

OCTOBER 28, 2021

To make good on this potential, healthcare organizations need to understand their data and how they can use it. These systems should collectively maintain data quality, integrity, and security, so the organization can use data effectively and efficiently. Why Is Data Governance in Healthcare Important?

Data Governance

Data Governance Measurement Data Quality Metrics

Self-Serve Data Preparation Doesn’t Mean Traditional ETL is Dead!

Smarten

JANUARY 4, 2018

Extract, Transform and Load (ETL) refers to a process of connecting to data sources, integrating data from various data sources, improving data quality, aggregating it and then storing it in staging data source or data marts or data warehouses for consumption of various business applications including BI, Analytics and Reporting.

Data Warehouse

Data Warehouse OLAP Data Governance Optimization

What is a Data Pipeline?

Jet Global

MAY 9, 2024

A data pipeline is a series of processes that move raw data from one or more sources to one or more destinations, often transforming and processing the data along the way. This can include tasks such as data ingestion, cleansing, filtering, aggregation, or standardization.

Data Lake

Data Lake Data Warehouse Business Intelligence Machine Learning

What is Data Mapping?

Jet Global

FEBRUARY 23, 2024

The quick and dirty definition of data mapping is the process of connecting different types of data from various data sources. Data mapping is a crucial step in data modeling and can help organizations achieve their business goals by enabling data integration, migration, transformation, and quality.

Data Warehouse

Data Warehouse Reporting Data Transformation Visualization

Save Time and Stress with Dynamics Data Merging from Atlas

Jet Global

MARCH 13, 2024

Between complex data structures, data security questions, and error-prone manual processes, merging data from disparate sources into a single system can quickly turn your routine reporting processes into a stressful and time-consuming ordeal.

Reporting

Reporting Finance Data Quality Sales

Unified Data Clears the Roadblocks of Your Hybrid Cloud Journey

Jet Global

AUGUST 24, 2023

Given your organizations focus on productivity, you know soon your team will working in a divided reporting environment. While the cloud infrastructure promises to bring positive changes, your company’s data will exist in both worlds: on-prem and the cloud.

Finance

Finance Reporting Data Integration Data Warehouse

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

Data’s dark secret: Why poor quality cripples AI and growth

Webinars

Trending Sources

SAP Datasphere Powers Business at the Speed of Data

Webinars

Ensuring Data Transformation Results with Great Expectations

Development Strategies to Prevent Data Quality Issues in Production (Part 1)

It’s Essential — Verifying Data Transformations (Part 4)

Complex Data Transformations — Test Planning Best Practices

Ensuring Data Transformation Quality with dbt Core

How EUROGATE established a data mesh architecture using Amazon DataZone

What is Data Lineage? Top 5 Benefits of Data Lineage

Data Engineers Are Using AI to Verify Data Transformations

Alation & Bigeye: A Potent Partnership for Data Quality

Is your data supply chain a liability?

From Raw Inputs to Polished Outputs: The Art of Testing Data Transformations

Key Challenges Affecting Data Transformations—Dev and Testing

Breaking down data silos for digital success

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

Top 6 Benefits of Automating End-to-End Data Lineage

Modernize your ETL platform with AWS Glue Studio: A case study from BMS

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

8 data strategy mistakes to avoid

The What & Why of Data Governance

Biggest Trends in Data Visualization Taking Shape in 2022

Data Integrity, the Basis for Reliable Insights

AzureML and CRISP-DM – a Framework to help the Business Intelligence professional move to AI

Time for New Partnership Paradigms to Be Future-fit

Database vs. Data Warehouse: What’s the Difference?

From Disparate Data to Visualized Knowledge Part I: Moving from Spreadsheets to an RDF Database

A step-by-step guide to setting up a data governance program

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

NEW: Octopai Announces Support of Microsoft Azure Data Factory

Data Preparation and Data Mapping: The Glue Between Data Management and Data Governance to Accelerate Insights and Reduce Risks

An AI Chat Bot Wrote This Blog Post …

Drive Growth with Data-Driven Strategies: Introducing Zenia Graph’s Salesforce Accelerator

Showpad accelerates data maturity to unlock innovation using Amazon QuickSight

How Tricentis unlocks insights across the software development lifecycle at speed and scale using Amazon Redshift

Tackling AI’s data challenges with IBM databases on AWS

The Chief Marketing Officer and the CDO – A Modern Fable

The Rising Need for Data Governance in Healthcare

Self-Serve Data Preparation Doesn’t Mean Traditional ETL is Dead!

What is a Data Pipeline?

What is Data Mapping?

Save Time and Stress with Dynamics Data Merging from Atlas

Unified Data Clears the Roadblocks of Your Hybrid Cloud Journey

Stay Connected