Blog, Data Integration and Data Quality

Blog

Data Integration

Data Quality

The Race For Data Quality in a Medallion Architecture

DataKitchen

NOVEMBER 5, 2024

The Race For Data Quality In A Medallion Architecture The Medallion architecture pattern is gaining traction among data teams. It is a layered approach to managing and transforming data. It sounds great, but how do you prove the data is correct at each layer? How do you ensure data quality in every layer ?

Data Quality

Data Quality Testing Metrics Reporting

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

AWS Big Data

DECEMBER 4, 2024

With the growing emphasis on data, organizations are constantly seeking more efficient and agile ways to integrate their data, especially from a wide variety of applications. We take care of the ETL for you by automating the creation and management of data replication. What’s the difference between zero-ETL and Glue ETL?

Data Integration

Data Integration Data Lake Statistics Data-driven

Join 42,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

Data Observability and Data Quality Testing Certification Series

DataKitchen

MAY 14, 2024

Data Observability and Data Quality Testing Certification Series We are excited to invite you to a free four-part webinar series that will elevate your understanding and skills in Data Observation and Data Quality Testing. Slides and recordings will be provided.

Data Quality

Data Quality Testing Metrics Measurement

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Data Quality Is Free

Anmut

JANUARY 30, 2025

They made us realise that building systems, processes and procedures to ensure quality is built in at the outset is far more cost effective than correcting mistakes once made. How about data quality? Redman and David Sammon, propose an interesting (and simple) exercise to measure data quality.

Data Quality

Data Quality Cost-Benefit Statistics Data-driven

Introducing AWS Glue Data Quality anomaly detection

AWS Big Data

AUGUST 8, 2024

Thousands of organizations build data integration pipelines to extract and transform data. They establish data quality rules to ensure the extracted data is of high quality for accurate business decisions. After a few months, daily sales surpassed 2 million dollars, rendering the threshold obsolete.

Data Quality

Data Quality Statistics Visualization Metrics

Question: What is the difference between Data Quality and DataOps Observability?

DataKitchen

NOVEMBER 18, 2022

Question: What is the difference between Data Quality and Observability in DataOps? Data Quality is static. It is the measure of data sets at any point in time. A financial analogy: Data Quality is your Balance Sheet, Data Observability is your Cash Flow Statement.

Data Quality

Data Quality Testing Measurement Data Integration

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

AWS Big Data

DECEMBER 4, 2024

Data teams struggle to find a unified approach that enables effortless discovery, understanding, and assurance of data quality and security across various sources. Having confidence in your data is key. Automate data profiling and data quality recommendations, monitor data quality rules, and receive alerts.

Data Analytics

Data Analytics Analytics Data Lake Data Quality

An AI Chat Bot Wrote This Blog Post …

DataKitchen

DECEMBER 9, 2022

DataOps automation typically involves the use of tools and technologies to automate the various steps of the data analytics and machine learning process, from data preparation and cleaning, to model training and deployment. By using DataOps, organizations can improve. Query> When do DataOps?

Machine Learning

Machine Learning Data-driven Optimization Data Analytics

Data Integrity, the Basis for Reliable Insights

Sisense

AUGUST 28, 2020

We live in a world of data: There’s more of it than ever before, in a ceaselessly expanding array of forms and locations. Dealing with Data is your window into the ways data teams are tackling the challenges of this new world to help their companies and their customers thrive. What is data integrity?

Data Integration

Data Integration Testing Data Quality Data-driven

Data integrity vs. data quality: Is there a difference?

IBM Big Data Hub

JULY 13, 2023

When we talk about data integrity, we’re referring to the overarching completeness, accuracy, consistency, accessibility, and security of an organization’s data. Together, these factors determine the reliability of the organization’s data. In short, yes.

Data Quality

Data Quality Data Integration Metadata Cost-Benefit

AWS Glue Data Quality is Generally Available

AWS Big Data

JUNE 6, 2023

We are excited to announce the General Availability of AWS Glue Data Quality. Our journey started by working backward from our customers who create, manage, and operate data lakes and data warehouses for analytics and machine learning. It takes days for data engineers to identify and implement data quality rules.

Data Quality

Data Quality Statistics Data Lake Visualization

Top 10 Analytics And Business Intelligence Trends For 2020

datapine

NOVEMBER 27, 2019

Companies are no longer wondering if data visualizations improve analyses but what is the best way to tell each data-story. 2020 will be the year of data quality management and data discovery: clean and secure data combined with a simple and powerful presentation. 1) Data Quality Management (DQM).

Business Intelligence

Business Intelligence Analytics Prescriptive Analytics Data Quality

The DataOps Vendor Landscape, 2021

DataKitchen

APRIL 13, 2021

Read the complete blog below for a more detailed description of the vendors and their capabilities. This is not surprising given that DataOps enables enterprise data teams to generate significant business value from their data. QuerySurge – Continuously detect data issues in your delivery pipelines. Data breaks.

Testing

Testing Machine Learning Consulting Data Science

How to Deliver Data Quality with Data Governance: Ryan Doupe, CDO of American Fidelity, 9-Step Process

Alation

JANUARY 20, 2022

Several weeks ago (prior to the Omicron wave), I got to attend my first conference in roughly two years: Dataversity’s Data Quality and Information Quality Conference. Ryan Doupe, Chief Data Officer of American Fidelity, held a thought-provoking session that resonated with me. Step 2: Data Definitions.

Data Quality

Data Quality Data Governance Metrics Statistics

Fire Your Super-Smart Data Consultants with DataOps

DataKitchen

JANUARY 25, 2022

Ensuring that data is available, secure, correct, and fit for purpose is neither simple nor cheap. Companies end up paying outside consultants enormous fees while still having to suffer the effects of poor data quality and lengthy cycle time. . For example, DataOps can be used to automate data integration.

Consulting

Consulting Testing Data Lake Data Quality

Top 10 Data Lineage Podcasts, Blogs, and Magazines

Octopai

JANUARY 31, 2021

We have identified the top ten sites, videos, or podcasts online that deal with data lineage. Our list of Top 10 Data Lineage Podcasts, Blogs, and Websites To Follow in 2021. Data Engineering Podcast. This podcast centers around data management and investigates a different aspect of this field each week.

Data Governance

Data Governance Data Processing Data Quality Metadata

Finding Data Quality

Jim Harris

DECEMBER 24, 2015

Have you ever experienced that sinking feeling, where you sense if you don’t find data quality, then data quality will find you? I hope that you enjoy reading this blog post, but most important, I hope you always remember: “Data are friends, not food.” Data Silos. Data Cleansing.

Data Quality

Data Quality Enterprise Business Intelligence Data Governance

What Is Data Integrity?

Alation

AUGUST 9, 2022

But in the four years since it came into force, have companies reached their full potential for data integrity? But firstly, we need to look at how we define data integrity. What is data integrity? Many confuse data integrity with data quality. Is integrity a universal truth?

Data Integration

Data Integration Data Quality Measurement Data-driven

The Five Use Cases in Data Observability: Effective Data Anomaly Monitoring

DataKitchen

MAY 10, 2024

Data ingestion monitoring, a critical aspect of Data Observability, plays a pivotal role by providing continuous updates and ensuring high-quality data feeds into your systems. This process is critical as it ensures data quality from the onset. Ensuring all data arrives on time and is of the right quality.

Data Quality

Data Quality Testing Software Dashboards

Bridging the Gap: How ‘Data in Place’ and ‘Data in Use’ Define Complete Data Observability

DataKitchen

SEPTEMBER 21, 2023

These layers help teams delineate different stages of data processing, storage, and access, offering a structured approach to data management. In the context of Data in Place, validating data quality automatically with Business Domain Tests is imperative for ensuring the trustworthiness of your data assets.

Testing

Testing Data Quality Predictive Modeling Metrics

Navigating the Chaos of Unruly Data: Solutions for Data Teams

DataKitchen

NOVEMBER 10, 2023

Extrinsic Control Deficit: Many of these changes stem from tools and processes beyond the immediate control of the data team. Unregulated ETL/ELT Processes: The absence of stringent data quality tests in ETL (Extract, Transform, Load) or ELT (Extract, Load, Transform) processes further exacerbates the problem.

Data Quality

Data Quality Testing Data Lake Data Integration

Augmented Analytics Must Provide Data Quality and Insight!

Smarten

APRIL 25, 2024

How Can I Ensure Data Quality and Gain Data Insight Using Augmented Analytics? There are many business issues surrounding the use of data to make decisions. One such issue is the inability of an organization to gather and analyze data.

Data Quality

Data Quality Analytics Machine Learning Visualization

Data architecture strategy for data quality

IBM Big Data Hub

JANUARY 5, 2023

Poor data quality is one of the top barriers faced by organizations aspiring to be more data-driven. Ill-timed business decisions and misinformed business processes, missed revenue opportunities, failed business initiatives and complex data systems can all stem from data quality issues.

Data Architecture

Data Architecture Data Quality Strategy Data Lake

How IBM HR leverages IBM Watson® Knowledge Catalog to improve data quality and deliver superior talent insights

IBM Big Data Hub

JUNE 12, 2023

Companies rely heavily on data and analytics to find and retain talent, drive engagement, improve productivity and more across enterprise talent management. However, analytics are only as good as the quality of the data, which must be error-free, trustworthy and transparent. What is data quality? million each year.

Data Quality

Data Quality Data Governance People Analytics Data-driven

Alation Launches Open Data Quality Framework

Alation

MAY 24, 2022

In a sea of questionable data, how do you know what to trust? Data quality tells you the answer. It signals what data is trustworthy, reliable, and safe to use. It empowers engineers to oversee data pipelines that deliver trusted data to the wider organization. Today, as part of its 2022.2

Data Quality

Data Quality Metadata Reporting Metrics

IBM named a leader in the 2022 Gartner® Magic Quadrant™ for Data Quality Solutions

IBM Big Data Hub

NOVEMBER 4, 2022

Data is the new oil and organizations of all stripes are tapping this resource to fuel growth. However, data quality and consistency are one of the top barriers faced by organizations in their quest to become more data-driven. Unlock quality data with IBM. and its leading data observability offerings.

Data Quality

Data Quality Metadata Data Governance Data-driven

The Terms and Conditions of a Data Contract are Data Tests

DataKitchen

DECEMBER 29, 2022

Data contracts are a new idea for data and analytic team development to ensure that data is transmitted accurately and consistently between different systems or teams. One of the primary benefits of using data contracts is that they help to ensure data integrity and compatibility.

Testing

Testing Statistics Data Quality Data Integration

Elevating Data Integration: A Four-Tier Approach to Effective Data Preparation

Data Virtualization

SEPTEMBER 12, 2024

Reading Time: 2 minutes In today’s data-driven landscape, the integration of raw source data into usable business objects is a pivotal step in ensuring that organizations can make informed decisions and maximize the value of their data assets. To achieve these goals, a well-structured.

Data Integration

Data Integration Business Objectives Data-driven Management

What Is Data Quality and Why Is It Important?

Alation

AUGUST 5, 2021

What is Data Quality? Data quality is defined as: the degree to which data meets a company’s expectations of accuracy, validity, completeness, and consistency. By tracking data quality , a business can pinpoint potential issues harming quality, and ensure that shared data is fit to be used for a given purpose.

Data Quality

Data Quality IT Data Governance Sales

How Do You Know When You’re Ready for AI?

Data Virtualization

NOVEMBER 28, 2024

appeared first on Data Management Blog - Data Integration and Modern Data Management Articles, Analysis and Information. One surprising statistic from the Rand Corporation is that 80% of artificial intelligence (AI). The post How Do You Know When You’re Ready for AI?

Statistics

Statistics Data Integration Management Data Quality

Why data observability is essential to AI governance

erwin

DECEMBER 9, 2024

And if it isnt changing, its likely not being used within our organizations, so why would we use stagnant data to facilitate our use of AI? The key is understanding not IF, but HOW, our data fluctuates, and data observability can help us do just that. And lets not forget about the controls.

Metadata

Metadata Data Quality Sales Modeling

Data Observability and Monitoring with DataOps

DataKitchen

MAY 10, 2021

Make sure the data and the artifacts that you create from data are correct before your customer sees them. It’s not about data quality . In governance, people sometimes perform manual data quality assessments. It’s not only about the data. Data Quality. Location Balance Tests.

Testing

Testing Manufacturing Data Quality Statistics

The Five Use Cases in Data Observability: Mastering Data Production

DataKitchen

MAY 10, 2024

The Five Use Cases in Data Observability: Mastering Data Production (#3) Introduction Managing the production phase of data analytics is a daunting challenge. Overseeing multi-tool, multi-dataset, and multi-hop data processes ensures high-quality outputs. Is the business logic producing correct outcomes?

Metrics

Metrics Testing Data Quality Dashboards

2024 Gartner Market Guide To DataOps

DataKitchen

AUGUST 16, 2024

At DataKitchen, we think of this is a ‘meta-orchestration’ of the code and tools acting upon the data. Data Pipeline Observability: Optimizes pipelines by monitoring data quality, detecting issues, tracing data lineage, and identifying anomalies using live and historical metadata.

Marketing

Marketing Data Quality Testing Metadata

Managing Misuse, in Dual-Use Foundation AI Models

Data Virtualization

OCTOBER 31, 2024

Reading Time: 2 minutes When making decisions that are critical to national security, governments rely on data, and those that leverage the cutting edge technology of generative AI foundation models will have a distinct advantage over their adversaries. Pros and Cons of generative AI.

Management

Management Modeling Data Integration Technology

What is a data fabric architecture?

IBM Big Data Hub

MARCH 25, 2022

A data fabric is an architectural approach that enables organizations to simplify data access and data governance across a hybrid multicloud landscape for better 360-degree views of the customer and enhanced MLOps and trustworthy AI. The post What is a data fabric architecture? appeared first on Journey to AI Blog.

Metadata

Metadata Data Quality Data Governance Data Integration

SHACL-ing the Data Quality Dragon III: A Good Artisan Knows Their Tools

Ontotext

NOVEMBER 23, 2023

The next step is to link the data graph to the shapes graph: ex:TolkienDragonShape sh:shapesGraph ex:TolkienShapesGraph. This technique can be especially useful in data integration projects where you are combining related, potentially overlapping data from multiple sources. Ontotext’s GraphDB Give it a try today!

Data Quality

Data Quality Reporting Metadata Big Data

O’Reilly Releases First Chapters of a New Book about Logical Data Management

Data Virtualization

JANUARY 21, 2025

The post OReilly Releases First Chapters of a New Book about Logical Data Management appeared first on Data Management Blog - Data Integration and Modern Data Management Articles, Analysis and Information. Gartner predicts that by the end of this year, 30%.

Management

Management Data Integration Technology Data Warehouse

The Need For Personalized Data Journeys for Your Data Consumers

DataKitchen

OCTOBER 20, 2023

Deploying a Data Journey Instance unique to each customer’s payload is vital to fill this gap. Such an instance answers the critical question of ‘Dude, Where is my data?’ ’ while maintaining operational efficiency and ensuring data quality—thus preserving customer satisfaction and the team’s credibility.

Insurance

Insurance Metadata Data-driven Data Quality

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

AWS Big Data

JULY 29, 2024

It addresses many of the shortcomings of traditional data lakes by providing features such as ACID transactions, schema evolution, row-level updates and deletes, and time travel. In this blog post, we’ll discuss how the metadata layer of Apache Iceberg can be used to make data lakes more efficient.

Metadata

Metadata Snapshot Data Lake Metrics

How HPE Aruba Supply Chain optimized cost and performance by migrating to an AWS modern data architecture

AWS Big Data

SEPTEMBER 11, 2024

This blog post is co-written with Hardeep Randhawa and Abhay Kumar from HPE. AWS Transfer Family seamlessly integrates with other AWS services, automates transfer, and makes sure data is protected with encryption and access controls. HPE Aruba Networking is the industry leader in wired, wireless, and network security solutions.

Data Architecture

Data Architecture Optimization Data Warehouse Metadata

Your Generative AI LLM Needs a Data Journey: A Comprehensive Guide for Data Engineers

DataKitchen

FEBRUARY 27, 2024

However, the foundation of their success rests not just on sophisticated algorithms or computational power but on the quality and integrity of the data they are trained on and interact with. The Imperative of Data Quality Validation Testing Data quality validation testing is not just a best practice; it’s imperative.

Data Quality

Data Quality Unstructured Data Testing Data-driven

7 Benefits of Metadata Management

erwin

FEBRUARY 19, 2021

It involves establishing policies and processes to ensure information can be integrated, accessed, shared, linked, analyzed and maintained across an organization. Better data quality. It harvests metadata from various data sources and maps any data element from source to target and harmonize data integration across platforms.

Metadata

Metadata Management Data Quality Cost-Benefit

What is Data Lineage? Top 5 Benefits of Data Lineage

erwin

APRIL 29, 2020

Many large organizations, in their desire to modernize with technology, have acquired several different systems with various data entry points and transformation rules for data as it moves into and across the organization. Seeing data pipelines and information flows further supports compliance efforts. Data Quality.

Key Performance Indicator

Key Performance Indicator Metadata Data Governance Data Quality

The Race For Data Quality in a Medallion Architecture

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Webinars

Trending Sources

Data Observability and Data Quality Testing Certification Series

Webinars

Data Quality Is Free

Introducing AWS Glue Data Quality anomaly detection

Question: What is the difference between Data Quality and DataOps Observability?

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

An AI Chat Bot Wrote This Blog Post …

Data Integrity, the Basis for Reliable Insights

Data integrity vs. data quality: Is there a difference?

AWS Glue Data Quality is Generally Available

Top 10 Analytics And Business Intelligence Trends For 2020

The DataOps Vendor Landscape, 2021

How to Deliver Data Quality with Data Governance: Ryan Doupe, CDO of American Fidelity, 9-Step Process

Fire Your Super-Smart Data Consultants with DataOps

Top 10 Data Lineage Podcasts, Blogs, and Magazines

Finding Data Quality

What Is Data Integrity?

The Five Use Cases in Data Observability: Effective Data Anomaly Monitoring

Bridging the Gap: How ‘Data in Place’ and ‘Data in Use’ Define Complete Data Observability

Navigating the Chaos of Unruly Data: Solutions for Data Teams

Augmented Analytics Must Provide Data Quality and Insight!

Data architecture strategy for data quality

How IBM HR leverages IBM Watson® Knowledge Catalog to improve data quality and deliver superior talent insights

Alation Launches Open Data Quality Framework

IBM named a leader in the 2022 Gartner® Magic Quadrant™ for Data Quality Solutions

The Terms and Conditions of a Data Contract are Data Tests

Elevating Data Integration: A Four-Tier Approach to Effective Data Preparation

What Is Data Quality and Why Is It Important?

How Do You Know When You’re Ready for AI?

Why data observability is essential to AI governance

Data Observability and Monitoring with DataOps

The Five Use Cases in Data Observability: Mastering Data Production

2024 Gartner Market Guide To DataOps

Managing Misuse, in Dual-Use Foundation AI Models

What is a data fabric architecture?

SHACL-ing the Data Quality Dragon III: A Good Artisan Knows Their Tools

O’Reilly Releases First Chapters of a New Book about Logical Data Management

The Need For Personalized Data Journeys for Your Data Consumers

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

How HPE Aruba Supply Chain optimized cost and performance by migrating to an AWS modern data architecture

Your Generative AI LLM Needs a Data Journey: A Comprehensive Guide for Data Engineers

7 Benefits of Metadata Management

What is Data Lineage? Top 5 Benefits of Data Lineage

Stay Connected