Data Integration, Data Quality and IT

The Race For Data Quality in a Medallion Architecture

DataKitchen

NOVEMBER 5, 2024

The Race For Data Quality In A Medallion Architecture The Medallion architecture pattern is gaining traction among data teams. It is a layered approach to managing and transforming data. It sounds great, but how do you prove the data is correct at each layer? How do you ensure data quality in every layer ?

Data Quality

Data Quality Testing Metrics Reporting

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

AWS Big Data

DECEMBER 4, 2024

With the growing emphasis on data, organizations are constantly seeking more efficient and agile ways to integrate their data, especially from a wide variety of applications. Zero-ETL is a set of fully managed integrations by AWS that minimizes the need to build ETL data pipelines.

Data Integration

Data Integration Data Lake Statistics Data-driven

What gives IT leaders pause as they look to integrate agentic AI with legacy infrastructure

CIO Business Intelligence

FEBRUARY 26, 2025

So its not surprising that 70% of developers say that theyre having problems integrating AI agents with their existing systems. The problem is that, before AI agents can be integrated into a companys infrastructure, that infrastructure must be brought up to modern standards. Not all of that is gen AI, though.

IT

IT Enterprise Interactive Data Quality

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

As technology and business leaders, your strategic initiatives, from AI-powered decision-making to predictive insights and personalized experiences, are all fueled by data. Yet, despite growing investments in advanced analytics and AI, organizations continue to grapple with a persistent and often underestimated challenge: poor data quality.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

Bigeye Enable Monitoring, Quality and Lineage of Data

David Menninger's Analyst Perspectives

NOVEMBER 19, 2024

Maintaining quality and trust is a perennial data management challenge, the importance of which has come into sharper focus in recent years thanks to the rise of artificial intelligence (AI).  With the aim of rectifying that situation, Bigeye’s founders set out to build a business around data observability.

Data Quality

Data Quality Dashboards Data-driven Software

Talend Data Fabric Simplifies Data Life Cycle Management

David Menninger's Analyst Perspectives

NOVEMBER 16, 2021

Talend is a data integration and management software company that offers applications for cloud computing, big data integration, application integration, data quality and master data management. Its code generation architecture uses a visual interface to create Java or SQL code.

Management

Management Data Warehouse Data Quality Data Integration

Data Quality Is Free

Anmut

JANUARY 30, 2025

Originally applied to manufacturing, this principle holds profound relevance in today’s data-driven world. The idea is simple yet powerful – investing in quality upfront provides a significant saving because it eliminates the need to fix problems after they occur. How about data quality?

Data Quality

Data Quality Cost-Benefit Statistics Data-driven

Question: What is the difference between Data Quality and DataOps Observability?

DataKitchen

NOVEMBER 18, 2022

Question: What is the difference between Data Quality and Observability in DataOps? Data Quality is static. It is the measure of data sets at any point in time. A financial analogy: Data Quality is your Balance Sheet, Data Observability is your Cash Flow Statement.

Data Quality

Data Quality Testing Measurement Data Integration

Innovative data integration in 2024: Pioneering the future of data integration

CIO Business Intelligence

MAY 8, 2024

In the age of big data, where information is generated at an unprecedented rate, the ability to integrate and manage diverse data sources has become a critical business imperative. Traditional data integration methods are often cumbersome, time-consuming, and unable to keep up with the rapidly evolving data landscape.

Data Integration

Data Integration IoT Cost-Benefit Machine Learning

Introducing AWS Glue Data Quality anomaly detection

AWS Big Data

AUGUST 8, 2024

Thousands of organizations build data integration pipelines to extract and transform data. They establish data quality rules to ensure the extracted data is of high quality for accurate business decisions. After a few months, daily sales surpassed 2 million dollars, rendering the threshold obsolete.

Data Quality

Data Quality Statistics Visualization Metrics

The quest for high-quality data

O'Reilly on Data

JUNE 18, 2019

Machine learning solutions for data integration, cleaning, and data generation are beginning to emerge. “AI AI starts with ‘good’ data” is a statement that receives wide agreement from data scientists, analysts, and business owners. Data integration and cleaning. Data unification and integration.

Machine Learning

Machine Learning Data Quality Statistics Modeling

Prioritizing data integration to discover the untapped potential of data

CIO Business Intelligence

MARCH 19, 2025

On top of that, they are storing data in IT environments that are increasingly complex, including in the cloud and on mainframes, sometimes simultaneously, all while needing to ensure proper security and compliance. How do companies ensure their data landscape is ready for the future? All of this complexity creates a challenge.

Data Integration

Data Integration Data Quality Visualization Risk

Data Integrity, the Basis for Reliable Insights

Sisense

AUGUST 28, 2020

We live in a world of data: There’s more of it than ever before, in a ceaselessly expanding array of forms and locations. Dealing with Data is your window into the ways data teams are tackling the challenges of this new world to help their companies and their customers thrive. What is data integrity?

Data Integration

Data Integration Testing Data Quality Data-driven

3 Huge Reasons that Data Integrity is Absolutely Essential

Smart Data Collective

APRIL 30, 2021

However, your data integrity practices are just as vital. But what exactly is data integrity? How can data integrity be damaged? And why does data integrity matter? What is data integrity? Indeed, without data integrity, decision-making can be as good as guesswork.

Data Integration

Data Integration Strategy Software Management

Top 10 Analytics And Business Intelligence Trends For 2020

datapine

NOVEMBER 27, 2019

Companies are no longer wondering if data visualizations improve analyses but what is the best way to tell each data-story. 2020 will be the year of data quality management and data discovery: clean and secure data combined with a simple and powerful presentation. 1) Data Quality Management (DQM).

Business Intelligence

Business Intelligence Analytics Prescriptive Analytics Data Quality

Data integrity vs. data quality: Is there a difference?

IBM Big Data Hub

JULY 13, 2023

When we talk about data integrity, we’re referring to the overarching completeness, accuracy, consistency, accessibility, and security of an organization’s data. Together, these factors determine the reliability of the organization’s data.

Data Quality

Data Quality Data Integration Metadata Cost-Benefit

AWS Glue Data Quality is Generally Available

AWS Big Data

JUNE 6, 2023

We are excited to announce the General Availability of AWS Glue Data Quality. Our journey started by working backward from our customers who create, manage, and operate data lakes and data warehouses for analytics and machine learning. It takes days for data engineers to identify and implement data quality rules.

Data Quality

Data Quality Statistics Data Lake Visualization

The DataOps Vendor Landscape, 2021

DataKitchen

APRIL 13, 2021

DataOps needs a directed graph-based workflow that contains all the data access, integration, model and visualization steps in the data analytic production process. It orchestrates complex pipelines, toolchains, and tests across teams, locations, and data centers. OwlDQ — Predictive data quality.

Testing

Testing Machine Learning Consulting Data Science

Get started with AWS Glue Data Quality dynamic rules for ETL pipelines

AWS Big Data

MAY 23, 2024

Hundreds of thousands of organizations build data integration pipelines to extract and transform data. They establish data quality rules to ensure the extracted data is of high quality for accurate business decisions.

Data Quality

Data Quality Metrics Data Lake Sales

Fire Your Super-Smart Data Consultants with DataOps

DataKitchen

JANUARY 25, 2022

When internal resources fall short, companies outsource data engineering and analytics. Large enterprises integrate hundreds or thousands of asynchronous data sources into a web of pipelines that flow into visualizations and purpose-built databases that support self-service analysis. Here is where the loss of control begins.

Consulting

Consulting Testing Data Lake Data Quality

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

AWS Big Data

DECEMBER 4, 2024

They’re taking data they’ve historically used for analytics or business reporting and putting it to work in machine learning (ML) models and AI-powered applications. Data teams struggle to find a unified approach that enables effortless discovery, understanding, and assurance of data quality and security across various sources.

Data Analytics

Data Analytics Analytics Data Lake Data Quality

4 Common Data Integrity Issues and How to Solve Them

Octopai

AUGUST 3, 2022

It’s also a critical trait for the data assets of your dreams. What is data with integrity? Data integrity is the extent to which you can rely on a given set of data for use in decision-making. Where can data integrity fall short? Too much or too little access to data systems.

Data Integration

Data Integration Manufacturing Data Quality Data Governance

How to Deliver Data Quality with Data Governance: Ryan Doupe, CDO of American Fidelity, 9-Step Process

Alation

JANUARY 20, 2022

Several weeks ago (prior to the Omicron wave), I got to attend my first conference in roughly two years: Dataversity’s Data Quality and Information Quality Conference. Ryan Doupe, Chief Data Officer of American Fidelity, held a thought-provoking session that resonated with me. Step 2: Data Definitions.

Data Quality

Data Quality Data Governance Metrics Statistics

Getting started with AWS Glue Data Quality from the AWS Glue Data Catalog

AWS Big Data

JUNE 6, 2023

AWS Glue is a serverless data integration service that makes it simple to discover, prepare, and combine data for analytics, machine learning (ML), and application development. Hundreds of thousands of customers use data lakes for analytics and ML to make data-driven business decisions.

Data Quality

Data Quality Data-driven Data Lake Metrics

Don’t Fear Artificial Intelligence; Embrace it Through Data Governance

CIO Business Intelligence

APRIL 29, 2022

This would be straightforward task were it not for the fact that, during the digital-era, there has been an explosion of data – collected and stored everywhere – much of it poorly governed, ill-understood, and irrelevant. Data Centricity. There is evidence to suggest that there is a blind spot when it comes to data in the AI context.

Data Governance

Data Governance IT Data Lake Risk

Why Your Business Should Use a Data Catalog to Organize Its Data

Smart Data Collective

JULY 15, 2021

A data catalog serves the same purpose. By using metadata (or short descriptions), data catalogs help companies gather, organize, retrieve, and manage information. You can think of a data catalog as an enhanced Access database or library card catalog system. It helps you locate and discover data that fit your search criteria.

Metadata

Metadata IT Data-driven Data Quality

Finding Data Quality

Jim Harris

DECEMBER 24, 2015

Have you ever experienced that sinking feeling, where you sense if you don’t find data quality, then data quality will find you? I hope that you enjoy reading this blog post, but most important, I hope you always remember: “Data are friends, not food.” Data Silos. Good feeling’s gone—AHH!”

Data Quality

Data Quality Enterprise Business Intelligence Data Governance

6 tough AI discussions every IT leader must have

CIO Business Intelligence

JANUARY 9, 2024

Our firm’s leaders] wanted to make sure there were guidelines in place to protect the company, its data, and its people.” “The CIO is at the nexus of those conversations,” says Tim Crawford, CIO strategic adviser at Los Angeles-based IT advisory firm AVOA. Which business cases actually need AI?

IT

IT Risk Cost-Benefit Machine Learning

What Is Data Integrity?

Alation

AUGUST 9, 2022

Today, data is more valuable to a company than it’s ever been. It can help you make data-driven decisions which can improve business performance, boost revenue, and improve efficiencies. But in the four years since it came into force, have companies reached their full potential for data integrity? What is data integrity?

Data Integration

Data Integration Data Quality Measurement Data-driven

Bridging the Gap: How ‘Data in Place’ and ‘Data in Use’ Define Complete Data Observability

DataKitchen

SEPTEMBER 21, 2023

This is where the true power of complete data observability comes into play, and it’s time to get acquainted with its two critical parts: ‘Data in Place’ and ‘Data in Use.’ What is Data in Place? There are multiple locations where problems can happen in a data and analytic system.

Testing

Testing Data Quality Predictive Modeling Metrics

What Is Data Quality and Why Is It Important?

Alation

AUGUST 5, 2021

What is Data Quality? Data quality is defined as: the degree to which data meets a company’s expectations of accuracy, validity, completeness, and consistency. By tracking data quality , a business can pinpoint potential issues harming quality, and ensure that shared data is fit to be used for a given purpose.

Data Quality

Data Quality IT Data Governance Sales

Navigating the Chaos of Unruly Data: Solutions for Data Teams

DataKitchen

NOVEMBER 10, 2023

Extrinsic Control Deficit: Many of these changes stem from tools and processes beyond the immediate control of the data team. Unregulated ETL/ELT Processes: The absence of stringent data quality tests in ETL (Extract, Transform, Load) or ELT (Extract, Load, Transform) processes further exacerbates the problem.

Data Quality

Data Quality Testing Data Lake Data Integration

Collibra Provides a Platform for Data Intelligence

David Menninger's Analyst Perspectives

OCTOBER 8, 2024

Collibra was founded in 2008 by Chief Executive Officer Felix Van de Maele and Chief Data Citizen Stijn Christiaens. Removing barriers that prevent or delay users from gaining access to data enables it to be treated as a product that is generated and consumed internally by workers or externally by partners and customers.

Data Quality

Data Quality Data Governance Enterprise Visualization

Data Observability and Monitoring with DataOps

DataKitchen

MAY 10, 2021

Data errors also affect careers. If you have been in the data profession for any length of time, you probably know what it means to face a mob of stakeholders who are angry about inaccurate or late analytics. Some of the DataOps best practices and industry discussion around errors have coalesced around the term “data observability.”

Testing

Testing Manufacturing Data Quality Statistics

Data architecture strategy for data quality

IBM Big Data Hub

JANUARY 5, 2023

Poor data quality is one of the top barriers faced by organizations aspiring to be more data-driven. Ill-timed business decisions and misinformed business processes, missed revenue opportunities, failed business initiatives and complex data systems can all stem from data quality issues.

Data Architecture

Data Architecture Data Quality Strategy Data Lake

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

Their terminal operations rely heavily on seamless data flows and the management of vast volumes of data. Recently, EUROGATE has developed a digital twin for its container terminal Hamburg (CTH), generating millions of data points every second from Internet of Things (IoT)devices attached to its container handling equipment (CHE).

IoT

IoT Machine Learning Metadata Data-driven

The Terms and Conditions of a Data Contract are Data Tests

DataKitchen

DECEMBER 29, 2022

Data contracts are a new idea for data and analytic team development to ensure that data is transmitted accurately and consistently between different systems or teams. One of the primary benefits of using data contracts is that they help to ensure data integrity and compatibility.

Testing

Testing Statistics Data Quality Data Integration

The Five Use Cases in Data Observability: Effective Data Anomaly Monitoring

DataKitchen

MAY 10, 2024

The Second of Five Use Cases in Data Observability Data Evaluation: This involves evaluating and cleansing new datasets before being added to production. This process is critical as it ensures data quality from the onset. Examples include regular loading of CRM data and anomaly detection.

Data Quality

Data Quality Testing Software Dashboards

Development Strategies to Prevent Data Quality Issues in Production (Part 1)

Wayne Yaddow

MARCH 3, 2025

When implementing automated validation, AI-driven regression testing, real-time canary pipelines, synthetic data generation, freshness enforcement, KPI tracking, and CI/CD automation, organizations can shift from reactive data observability to proactive data quality assurance.

Data Quality

Data Quality Strategy ROI Testing

How AI is revolutionizing ERP migration

CIO Business Intelligence

NOVEMBER 13, 2024

Now, picture doing that with a mountain of data. Infused with the magic of artificial intelligence (AI), DataLark revolutionizes data migration, making it faster, more efficient, and surprisingly painless. It involves shifting massive amounts of data from outdated legacy systems to a sleek, modern ERP platform.

Cost-Benefit

Cost-Benefit Data-driven Data Integration Data Quality

Salesforce acquisition of Tableau – What does it mean?

Andrew White

JUNE 11, 2019

Salesforce closes acquisition of Mulesoft – May 2018 (business app vendor acquires data integration). But the recent spate of acquisitions just prove the point: the work of data and analytics governance remains an after-thought, even for these large vendors (just as it does for many of their prospects).

IT

IT Data Quality Data Integration Business Objectives

Alation Launches Open Data Quality Framework

Alation

MAY 24, 2022

In a sea of questionable data, how do you know what to trust? Data quality tells you the answer. It signals what data is trustworthy, reliable, and safe to use. It empowers engineers to oversee data pipelines that deliver trusted data to the wider organization. Today, as part of its 2022.2

Data Quality

Data Quality Metadata Reporting Metrics

Deep automation in machine learning

O'Reilly on Data

DECEMBER 19, 2018

are only starting to exist; one big task over the next two years is developing the IDEs for machine learning, plus other tools for data management, pipeline management, data cleaning, data provenance, and data lineage. GitHub is an excellent tool for managing code, but we need to think about [code+data].

Machine Learning

Machine Learning Software Metadata Testing

2024 Gartner Market Guide To DataOps

DataKitchen

AUGUST 16, 2024

As the pioneer in the DataOps category, we are proud to have laid the groundwork for what has become an essential approach to managing data operations in today’s fast-paced business environment. At DataKitchen, we think of this is a ‘meta-orchestration’ of the code and tools acting upon the data.

Marketing

Marketing Data Quality Testing Metadata

The Race For Data Quality in a Medallion Architecture

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Webinars

Trending Sources

What gives IT leaders pause as they look to integrate agentic AI with legacy infrastructure

Webinars

Data’s dark secret: Why poor quality cripples AI and growth

Bigeye Enable Monitoring, Quality and Lineage of Data

Talend Data Fabric Simplifies Data Life Cycle Management

Data Quality Is Free

Question: What is the difference between Data Quality and DataOps Observability?

Innovative data integration in 2024: Pioneering the future of data integration

Introducing AWS Glue Data Quality anomaly detection

The quest for high-quality data

Prioritizing data integration to discover the untapped potential of data

Data Integrity, the Basis for Reliable Insights

3 Huge Reasons that Data Integrity is Absolutely Essential

Top 10 Analytics And Business Intelligence Trends For 2020

Data integrity vs. data quality: Is there a difference?

AWS Glue Data Quality is Generally Available

The DataOps Vendor Landscape, 2021

Get started with AWS Glue Data Quality dynamic rules for ETL pipelines

Fire Your Super-Smart Data Consultants with DataOps

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

4 Common Data Integrity Issues and How to Solve Them

How to Deliver Data Quality with Data Governance: Ryan Doupe, CDO of American Fidelity, 9-Step Process

Getting started with AWS Glue Data Quality from the AWS Glue Data Catalog

Don’t Fear Artificial Intelligence; Embrace it Through Data Governance

Why Your Business Should Use a Data Catalog to Organize Its Data

Finding Data Quality

6 tough AI discussions every IT leader must have

What Is Data Integrity?

Bridging the Gap: How ‘Data in Place’ and ‘Data in Use’ Define Complete Data Observability

What Is Data Quality and Why Is It Important?

Navigating the Chaos of Unruly Data: Solutions for Data Teams

Collibra Provides a Platform for Data Intelligence

Data Observability and Monitoring with DataOps

Data architecture strategy for data quality

How EUROGATE established a data mesh architecture using Amazon DataZone

The Terms and Conditions of a Data Contract are Data Tests

The Five Use Cases in Data Observability: Effective Data Anomaly Monitoring

Development Strategies to Prevent Data Quality Issues in Production (Part 1)

How AI is revolutionizing ERP migration

Salesforce acquisition of Tableau – What does it mean?

Alation Launches Open Data Quality Framework

Deep automation in machine learning

2024 Gartner Market Guide To DataOps

Stay Connected