Data Quality, Data Transformation and Modeling

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

1) What Is Data Quality Management? 4) Data Quality Best Practices. 5) How Do You Measure Data Quality? 6) Data Quality Metrics Examples. 7) Data Quality Control: Use Case. 8) The Consequences Of Bad Data Quality. 9) 3 Sources Of Low-Quality Data.

Data Quality

Data Quality Metrics Data-driven Management

IKEA’s Data Transformation: Lessons from a Global Giant

Timo Elliott

NOVEMBER 7, 2024

In a recent presentation at the SAPSA Impuls event in Stockholm , George Sandu, IKEA’s Master Data Leader, shared the company’s data transformation story, offering valuable lessons for organizations navigating similar challenges. “Every flow in our supply chain represents a data flow,” Sandu explained.

Data Transformation

Data Transformation Data Quality Digital Transformation Strategy

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

As technology and business leaders, your strategic initiatives, from AI-powered decision-making to predictive insights and personalized experiences, are all fueled by data. Yet, despite growing investments in advanced analytics and AI, organizations continue to grapple with a persistent and often underestimated challenge: poor data quality.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

AWS Big Data

NOVEMBER 22, 2024

The need for streamlined data transformations As organizations increasingly adopt cloud-based data lakes and warehouses, the demand for efficient data transformation tools has grown. This saves time and effort, especially for teams looking to minimize infrastructure management and focus solely on data modeling.

Data Lake

Data Lake Data Warehouse Cost-Benefit Data Transformation

SAP Datasphere Powers Business at the Speed of Data

Rocket-Powered Data Science

MARCH 20, 2023

Datasphere goes beyond the “big three” data usage end-user requirements (ease of discovery, access, and delivery) to include data orchestration (data ops and data transformations) and business data contextualization (semantics, metadata, catalog services).

Data Warehouse

Data Warehouse Metadata Digital Transformation Machine Learning

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

AWS Big Data

DECEMBER 4, 2024

Domain ownership recognizes that the teams generating the data have the deepest understanding of it and are therefore best suited to manage, govern, and share it effectively. This principle makes sure data accountability remains close to the source, fostering higher data quality and relevance.

Metadata

Metadata Data Governance Data Quality Data-driven

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

In addition to real-time analytics and visualization, the data needs to be shared for long-term data analytics and machine learning applications. To achieve this, EUROGATE designed an architecture that uses Amazon DataZone to publish specific digital twin data sets, enabling access to them with SageMaker in a separate AWS account.

IoT

IoT Machine Learning Metadata Data-driven

Ensuring Data Transformation Quality with dbt Core

Wayne Yaddow

MARCH 14, 2025

How dbt Core aids data teams test, validate, and monitor complex data transformations and conversions Photo by NASA on Unsplash Introduction dbt Core, an open-source framework for developing, testing, and documenting SQL-based data transformations, has become a must-have tool for modern data teams as the complexity of data pipelines grows.

Data Transformation

Data Transformation Testing Unstructured Data Data Quality

The Journey to DataOps Success: Key Takeaways from Transformation Trailblazers

DataKitchen

APRIL 26, 2021

Furthermore, the introduction of AI and ML models hastened the need to be more efficient and effective in deploying new technologies. Similarly, Workiva was driven to DataOps due to an increased need for analytics agility to meet a range of organizational needs, such as real-time dashboard updates or ML model training and monitoring.

Measurement

Measurement Metrics Data-driven Dashboards

Data Engineers Are Using AI to Verify Data Transformations

Wayne Yaddow

FEBRUARY 26, 2025

AI is transforming how senior data engineers and data scientists validate data transformations and conversions. Artificial intelligence-based verification approaches aid in the detection of anomalies, the enforcement of data integrity, and the optimization of pipelines for improved efficiency.

Data Transformation

Data Transformation Testing Data-driven Data Quality

Is your data supply chain a liability?

CIO Business Intelligence

JUNE 23, 2022

Yet as companies fight for skilled analyst roles to utilize data to make better decisions , they often fall short in improving the data supply chain and resulting data quality. Without a solid data supply-chain management practices in place, data quality often suffers. First mile/last mile impacts.

Data Quality

Data Quality Key Performance Indicator Metrics KPI

What is DataOps? Collaborative, cross-functional analytics

CIO Business Intelligence

DECEMBER 22, 2022

It seeks to improve the way data are managed and products are created, and to coordinate these improvements with the goals of the business. According to Gartner, DataOps also aims “to deliver value faster by creating predictable delivery and change management of data, data models, and related artifacts.”

Analytics

Analytics Machine Learning Data mining Software

AzureML and CRISP-DM – a Framework to help the Business Intelligence professional move to AI

Jen Stirrup

SEPTEMBER 30, 2021

They may also learn from evidence, but the data and the modelling fundamentally comes from humans in some way. Data Science – Data science is the field of study that combines domain expertise, programming skills, and knowledge of mathematics and statistics to extract meaningful insights from data.

Business Intelligence

Business Intelligence Data mining Machine Learning Testing

Time for New Partnership Paradigms to Be Future-fit

CIO Business Intelligence

DECEMBER 6, 2023

However, this partnership model cannot keep pace with an always-changing technology landscape in which the skill gaps and lack of resources are increasing. The new models recognise this, drawing tech vendors to shift toward innovation-focused roles and become partners in the client’s success.

Digital Transformation

Digital Transformation Software Cost-Benefit Manufacturing

Breaking down data silos for digital success

CIO Business Intelligence

NOVEMBER 7, 2023

Given the importance of sharing information among diverse disciplines in the era of digital transformation, this concept is arguably as important as ever. The aim is to normalize, aggregate, and eventually make available to analysts across the organization data that originates in various pockets of the enterprise.

Data Warehouse

Data Warehouse Digital Transformation Data-driven Reporting

8 data strategy mistakes to avoid

CIO Business Intelligence

JANUARY 24, 2024

“Establishing data governance rules helps organizations comply with these regulations, reducing the risk of legal and financial penalties. Clear governance rules can also help ensure data quality by defining standards for data collection, storage, and formatting, which can improve the accuracy and reliability of your analysis.”

Data Strategy

Data Strategy Strategy Unstructured Data Data Governance

The importance of data ingestion and integration for enterprise AI

IBM Big Data Hub

JANUARY 9, 2024

Companies still often accept the risk of using internal data when exploring large language models (LLMs) because this contextual data is what enables LLMs to change from general-purpose to domain-specific knowledge. In the generative AI or traditional AI development cycle, data ingestion serves as the entry point.

Enterprise

Enterprise Data Integration Data Quality Contextual Data

Straumann Group is transforming dentistry with data, AI

CIO Business Intelligence

FEBRUARY 16, 2023

“All they would have to do is just build their model and run with it,” he says. But to augment its various businesses with ML and AI, Iyengar’s team first had to break down data silos within the organization and transform the company’s data operations. For now, it operates under a centralized “hub and spokes” model.

Unstructured Data

Unstructured Data Data Lake Prescriptive Analytics Data Warehouse

Biggest Trends in Data Visualization Taking Shape in 2022

Smart Data Collective

OCTOBER 13, 2021

There are countless examples of big data transforming many different industries. There is no disputing the fact that the collection and analysis of massive amounts of unstructured data has been a huge breakthrough. How does Data Virtualization manage data quality requirements?

Visualization

Visualization Cost-Benefit Big Data Prescriptive Analytics

Choosing A Graph Data Model to Best Serve Your Use Case

Ontotext

MARCH 27, 2024

For example, GPS, social media, cell phone handoffs are modeled as graphs while data catalogs, data lineage and MDM tools leverage knowledge graphs for linking metadata with semantics. Knowledge graphs model knowledge of a domain as a graph with a network of entities and relationships.

Modeling

Modeling Metadata Data Quality Enterprise

The What & Why of Data Governance

erwin

MARCH 4, 2021

So companies will be forced to classify their data and to find mechanisms to share it with such platforms.”. GDPR is also proving to be the de facto model for data privacy across the United States. You can request a demo of erwin Data Intelligence here.

Data Governance

Data Governance Digital Transformation Data-driven Cost-Benefit

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

AWS Big Data

NOVEMBER 9, 2023

In this post, we delve into a case study for a retail use case, exploring how the Data Build Tool (dbt) was used effectively within an AWS environment to build a high-performing, efficient, and modern data platform. It does this by helping teams handle the T in ETL (extract, transform, and load) processes. usr/local/airflow/.local/bin/dbt

Data Warehouse

Data Warehouse Testing Data Quality Reporting

An AI Chat Bot Wrote This Blog Post …

DataKitchen

DECEMBER 9, 2022

DataOps involves close collaboration between data scientists, IT professionals, and business stakeholders, and it often involves the use of automation and other technologies to streamline data-related tasks. One of the key benefits of DataOps is the ability to accelerate the development and deployment of data-driven solutions.

Machine Learning

Machine Learning Data-driven Optimization Data Analytics

Harnessing Streaming Data: Insights at the Speed of Life

Sisense

OCTOBER 15, 2020

Every data professional knows that ensuring data quality is vital to producing usable query results. Streaming data can be extra challenging in this regard, as it tends to be “dirty,” with new fields that are added without warning and frequent mistakes in the data collection process. Now, let’s connect to Redshift.

Dashboards

Dashboards IoT Optimization Internet of Things

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

AWS Big Data

AUGUST 1, 2024

However, you might face significant challenges when planning for a large-scale data warehouse migration. Trace the flow of data from its origins in the source systems, through the data warehouse, and ultimately to its consumption by reporting, analytics, and other downstream processes.

Data Warehouse

Data Warehouse KPI Optimization Cost-Benefit

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

AWS Big Data

JULY 20, 2023

Prior to the creation of the data lake, Orca’s data was distributed among various data silos, each owned by a different team with its own data pipelines and technology stack. Moreover, running advanced analytics and ML on disparate data sources proved challenging.

Data Lake

Data Lake Analytics Snapshot Data Quality

Empowering data mesh: The tools to deliver BI excellence

erwin

APRIL 16, 2024

In this blog, we’ll delve into the critical role of governance and data modeling tools in supporting a seamless data mesh implementation and explore how erwin tools can be used in that role. erwin also provides data governance, metadata management and data lineage software called erwin Data Intelligence by Quest.

Metadata

Metadata Data Quality Data Governance Modeling

Data Mesh 101: How Data Mesh Helps Organizations Be Data-Driven and Achieve Velocity

Ontotext

FEBRUARY 12, 2024

This is especially beneficial when teams need to increase data product velocity with trust and data quality, reduce communication costs, and help data solutions align with business objectives. In most enterprises, data is needed and produced by many business units but owned and trusted by no one.

Data-driven

Data-driven Data Lake Data Quality Business Objectives

A step-by-step guide to setting up a data governance program

IBM Big Data Hub

FEBRUARY 9, 2023

In our last blog , we delved into the seven most prevalent data challenges that can be addressed with effective data governance. Today we will share our approach to developing a data governance program to drive data transformation and fuel a data-driven culture.

Data Governance

Data Governance Business Objectives Data Quality Measurement

From Disparate Data to Visualized Knowledge Part I: Moving from Spreadsheets to an RDF Database

Ontotext

NOVEMBER 18, 2021

OntoRefine is a data transformation tool that lets you unite plenty of data formats and get them into your triplestore. Ensuring data quality with SHACL. Our previous step covers data format differences. Dealing with spreadsheets via OntoRefine. Surveyors already use spreadsheets. Everyone makes mistakes.

Visualization

Visualization Reporting Metadata Enterprise

Alation and dbt Unlock Metadata and Increase Modern Data Stack Visibility

Alation

OCTOBER 18, 2022

Yet every dbt transformation contains vital metadata that is not captured – until now. Making this data visible in the data catalog will let data teams share their work, support re-use, and empower everyone to better understand and trust data. Data Transformation in the Modern Data Stack.

Metadata

Metadata Metrics Recreation/Entertainment Data Quality

Accelerate analytics on Amazon OpenSearch Service with AWS Glue through its native connector

AWS Big Data

DECEMBER 21, 2023

OpenSearch Service also has vector database capabilities that let you implement semantic search and Retrieval Augmented Generation (RAG) with large language models (LLMs) to build recommendation and media search engines. AWS Glue provides both visual and code-based interfaces to make data integration effortless.

Analytics

Analytics IT Data Lake Visualization

Sure, Trust Your Data… Until It Breaks Everything: How Automated Data Lineage Saves the Day

Octopai

JUNE 9, 2024

They invested heavily in data infrastructure and hired a talented team of data scientists and analysts. The goal was to develop sophisticated data products, such as predictive analytics models to forecast patient needs, patient care optimization tools, and operational efficiency dashboards.

IT

IT Data-driven Predictive Analytics Data Strategy

Data Preparation and Data Mapping: The Glue Between Data Management and Data Governance to Accelerate Insights and Reduce Risks

erwin

JANUARY 11, 2019

Organizations have spent a lot of time and money trying to harmonize data across diverse platforms , including cleansing, uploading metadata, converting code, defining business glossaries, tracking data transformations and so on. So questions linger about whether transformed data can be trusted.

Data Governance

Data Governance Risk Metadata Management

Database vs. Data Warehouse: What’s the Difference?

Jet Global

MAY 28, 2019

A data warehouse stores transactional level details and serves the broader reporting and analytical needs of an organization – creating one source of truth for building semantic models or serving structured, simplified and harmonized data to tools like Power BI, Excel or even SSRS.

Data Warehouse

Data Warehouse Reporting Business Intelligence Sales

Automate discovery of data relationships using ML and Amazon Neptune graph technology

AWS Big Data

APRIL 19, 2023

Background The success of a data-driven organization recognizes data as a key enabler to increase and sustain innovation. The goal of a data product is to solve the long-standing issue of data silos and data quality. It follows what is called a distributed system architecture.

Technology

Technology Data-driven Machine Learning Sales

Why The Public Sector Needs Data Governance

Alation

NOVEMBER 22, 2022

What Is Data Governance In The Public Sector? Effective data governance for the public sector enables entities to ensure data quality, enhance security, protect privacy, and meet compliance requirements. With so much focus on compliance, democratizing data for self-service analytics can present a challenge.

Data Governance

Data Governance Metadata Data-driven Unstructured Data

Drive Growth with Data-Driven Strategies: Introducing Zenia Graph’s Salesforce Accelerator

Ontotext

MARCH 20, 2024

Traditional data integration methods struggle to bridge these gaps, hampered by high costs, data quality concerns, and inconsistencies. Studies reveal that businesses lose significant time and opportunities due to missing integrations and poor data quality and accessibility.

Data-driven

Data-driven Strategy Sales Data Integration

Smart Factories: Artificial Intelligence and Automation for Reduced OPEX in Manufacturing

DataRobot Blog

MARCH 10, 2022

By enabling data scientists to rapidly iterate through model development, validation, and deployment, DataRobot provides the tools to blitz through steps four and five of the machine learning lifecycle with AutoML and Auto Time-Series capabilities. Train, Compare, Rank, Validate, and Select Models for Production.

Manufacturing

Manufacturing IoT Machine Learning Forecasting

5 best open source data flow lineage tools

Octopai

AUGUST 11, 2024

Just as a navigation app provides a detailed map of roads, guiding you from your starting point to your destination while highlighting every turn and intersection, data flow lineage offers a comprehensive view of data movement and transformations throughout its lifecycle.

Metadata

Metadata Visualization Data Quality Data Governance

Tackling AI’s data challenges with IBM databases on AWS

IBM Big Data Hub

MARCH 14, 2024

Businesses face significant hurdles when preparing data for artificial intelligence (AI) applications. The existence of data silos and duplication, alongside apprehensions regarding data quality, presents a multifaceted environment for organizations to manage.

Cost-Benefit

Cost-Benefit Metadata Optimization Management

The Rising Need for Data Governance in Healthcare

Alation

OCTOBER 28, 2021

Healthcare is changing, and it all comes down to data. Leaders in healthcare seek to improve patient outcomes, meet changing business models (including value-based care ), and ensure compliance while creating better experiences. Data & analytics represents a major opportunity to tackle these challenges.

Data Governance

Data Governance Measurement Data Quality Metrics

How Tricentis unlocks insights across the software development lifecycle at speed and scale using Amazon Redshift

AWS Big Data

MARCH 3, 2023

Finally, data integrity is of paramount importance. Every event in the data source can be relevant, and our customers don’t tolerate data loss, poor data quality, or discrepancies between the source and Tricentis Analytics. Fixed-size data files avoid further latency due to unbound file sizes.

Software

Software Data Lake Testing Cost-Benefit

The Chief Marketing Officer and the CDO – A Modern Fable

Peter James Thomas

OCTOBER 30, 2018

It may well be that one thing that a CDO needs to get going is a data transformation programme. This may purely be focused on cultural aspects of how an organisation records, shares and otherwise uses data. It may be to build a new (or a first) Data Architecture. It may be to build a new (or a first) Data Architecture.

Marketing

Marketing Strategy Data Architecture Data Strategy

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

IKEA’s Data Transformation: Lessons from a Global Giant

Webinars

Trending Sources

Data’s dark secret: Why poor quality cripples AI and growth

Webinars

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

SAP Datasphere Powers Business at the Speed of Data

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

How EUROGATE established a data mesh architecture using Amazon DataZone

Ensuring Data Transformation Quality with dbt Core

The Journey to DataOps Success: Key Takeaways from Transformation Trailblazers

Data Engineers Are Using AI to Verify Data Transformations

Is your data supply chain a liability?

What is DataOps? Collaborative, cross-functional analytics

AzureML and CRISP-DM – a Framework to help the Business Intelligence professional move to AI

Time for New Partnership Paradigms to Be Future-fit

Breaking down data silos for digital success

8 data strategy mistakes to avoid

The importance of data ingestion and integration for enterprise AI

Straumann Group is transforming dentistry with data, AI

Biggest Trends in Data Visualization Taking Shape in 2022

Choosing A Graph Data Model to Best Serve Your Use Case

The What & Why of Data Governance

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

An AI Chat Bot Wrote This Blog Post …

Harnessing Streaming Data: Insights at the Speed of Life

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

Empowering data mesh: The tools to deliver BI excellence

Data Mesh 101: How Data Mesh Helps Organizations Be Data-Driven and Achieve Velocity

A step-by-step guide to setting up a data governance program

From Disparate Data to Visualized Knowledge Part I: Moving from Spreadsheets to an RDF Database

Alation and dbt Unlock Metadata and Increase Modern Data Stack Visibility

Accelerate analytics on Amazon OpenSearch Service with AWS Glue through its native connector

Sure, Trust Your Data… Until It Breaks Everything: How Automated Data Lineage Saves the Day

Data Preparation and Data Mapping: The Glue Between Data Management and Data Governance to Accelerate Insights and Reduce Risks

Database vs. Data Warehouse: What’s the Difference?

Automate discovery of data relationships using ML and Amazon Neptune graph technology

Why The Public Sector Needs Data Governance

Drive Growth with Data-Driven Strategies: Introducing Zenia Graph’s Salesforce Accelerator

Smart Factories: Artificial Intelligence and Automation for Reduced OPEX in Manufacturing

5 best open source data flow lineage tools

Tackling AI’s data challenges with IBM databases on AWS

The Rising Need for Data Governance in Healthcare

How Tricentis unlocks insights across the software development lifecycle at speed and scale using Amazon Redshift

The Chief Marketing Officer and the CDO – A Modern Fable

Stay Connected