Data Quality, Document and Metadata

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

1) What Is Data Quality Management? 4) Data Quality Best Practices. 5) How Do You Measure Data Quality? 6) Data Quality Metrics Examples. 7) Data Quality Control: Use Case. 8) The Consequences Of Bad Data Quality. 9) 3 Sources Of Low-Quality Data.

Data Quality

Data Quality Metrics Data-driven Management

Manage concurrent write conflicts in Apache Iceberg on the AWS Glue Data Catalog

AWS Big Data

APRIL 8, 2025

Concurrent UPDATE/DELETE on overlapping partitions When multiple processes attempt to modify the same partition simultaneously, data conflicts can arise. For example, imagine a data quality process updating customer records with corrected addresses while another process is deleting outdated customer records.

Snapshot

Snapshot Management Metadata Big Data

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

As technology and business leaders, your strategic initiatives, from AI-powered decision-making to predictive insights and personalized experiences, are all fueled by data. Yet, despite growing investments in advanced analytics and AI, organizations continue to grapple with a persistent and often underestimated challenge: poor data quality.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

SAP Datasphere Powers Business at the Speed of Data

Rocket-Powered Data Science

MARCH 20, 2023

Data collections are the ones and zeroes that encode the actionable insights (patterns, trends, relationships) that we seek to extract from our data through machine learning and data science. Source: [link] SAP also announced key partners that further enhance Datasphere as a powerful business data fabric.

Data Warehouse

Data Warehouse Metadata Digital Transformation Machine Learning

7 Benefits of Metadata Management

erwin

FEBRUARY 19, 2021

Metadata management is key to wringing all the value possible from data assets. However, most organizations don’t use all the data at their disposal to reach deeper conclusions about how to drive revenue, achieve regulatory compliance or accomplish other strategic objectives. What Is Metadata? Harvest data.

Metadata

Metadata Management Data Quality Cost-Benefit

When is data too clean to be useful for enterprise AI?

CIO Business Intelligence

NOVEMBER 27, 2024

Once the province of the data warehouse team, data management has increasingly become a C-suite priority, with data quality seen as key for both customer experience and business performance. But along with siloed data and compliance concerns , poor data quality is holding back enterprise AI projects.

Enterprise

Enterprise Data Quality Structured Data Modeling

Data Governance and Metadata Management: You Can’t Have One Without the Other

erwin

FEBRUARY 13, 2020

When an organization’s data governance and metadata management programs work in harmony, then everything is easier. Data governance is a complex but critical practice. There’s always more data to handle, much of it unstructured; more data sources, like IoT, more points of integration, and more regulatory compliance requirements.

Metadata

Metadata Data Governance Management Cost-Benefit

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

AWS Big Data

JULY 29, 2024

It addresses many of the shortcomings of traditional data lakes by providing features such as ACID transactions, schema evolution, row-level updates and deletes, and time travel. In this blog post, we’ll discuss how the metadata layer of Apache Iceberg can be used to make data lakes more efficient.

Metadata

Metadata Snapshot Data Lake Metrics

Best Practices for Metadata Management

Alation

JULY 19, 2021

What Is Metadata? Metadata is information about data. A clothing catalog or dictionary are both examples of metadata repositories. Indeed, a popular online catalog, like Amazon, offers rich metadata around products to guide shoppers: ratings, reviews, and product details are all examples of metadata.

Metadata

Metadata Management Data Governance Machine Learning

Very Meta … Unlocking Data’s Potential with Metadata Management Solutions

erwin

OCTOBER 24, 2019

Untapped data, if mined, represents tremendous potential for your organization. While there has been a lot of talk about big data over the years, the real hero in unlocking the value of enterprise data is metadata , or the data about the data. Metadata Is the Heart of Data Intelligence.

Metadata

Metadata Management Data-driven Data Architecture

RDF-Star: Metadata Complexity Simplified

Ontotext

JUNE 10, 2021

Not Every Graph is a Knowledge Graph: Schemas and Semantic Metadata Matter. To be able to automate these operations and maintain sufficient data quality, enterprises have started implementing the so-called data fabrics , that employ diverse metadata sourced from different systems. Such examples are provenance (e.g.

Metadata

Metadata Cost-Benefit OLAP Modeling

How Metadata Makes Data Meaningful

erwin

DECEMBER 12, 2019

Metadata is an important part of data governance, and as a result, most nascent data governance programs are rife with project plans for assessing and documenting metadata. But are these rampant and often uncontrolled projects to collect metadata properly motivated? What Is Metadata?

Metadata

Metadata Data Governance Digital Transformation Data Quality

Alation 2022.2: Open Data Quality Initiative and Enhanced Data Governance

Alation

MAY 24, 2022

generally available on May 24, Alation introduces the Open Data Quality Initiative for the modern data stack, giving customers the freedom to choose the data quality vendor that’s best for them with the added confidence that those tools will integrate seamlessly with Alation’s Data Catalog and Data Governance application.

Data Quality

Data Quality Data Governance Metadata Metrics

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

Rocket-Powered Data Science

FEBRUARY 15, 2023

Since ChatGPT is built from large language models that are trained against massive data sets (mostly business documents, internal text repositories, and similar resources) within your organization, consequently attention must be given to the stability, accessibility, and reliability of those resources.

Strategy

Strategy Experimentation Uncertainty Machine Learning

A Few Proven Suggestions for Handling Large Data Sets

Smart Data Collective

SEPTEMBER 26, 2021

A NoSQl database can use documents for the storage and retrieval of data. The central concept is the idea of a document. Documents encompass and encode data (or information) in a standard format. A document is susceptible to change. The documents can be in PDF format. Speaking of which.

Metadata

Metadata Visualization Unstructured Data Data mining

What is data governance? Best practices for managing data assets

CIO Business Intelligence

MARCH 24, 2023

It must be clear to all participants and auditors how and when data-related decisions and controls were introduced into the processes. Data-related decisions, processes, and controls subject to data governance must be auditable. The program must introduce and support standardization of enterprise data.

Data Governance

Data Governance Management Metadata Data Quality

Alation and Salesforce partner on data governance for Data Cloud

CIO Business Intelligence

SEPTEMBER 19, 2024

It will do this, it said, with bidirectional integration between its platform and Salesforce’s to seamlessly delivers data governance and end-to-end lineage within Salesforce Data Cloud. Additional to that, we are also allowing the metadata inside of Alation to be read into these agents.”

Data Governance

Data Governance Metadata Unstructured Data Structured Data

Deep automation in machine learning

O'Reilly on Data

DECEMBER 19, 2018

Anomaly detection is well-known in the financial industry, where it’s frequently used to detect fraudulent transactions, but it can also be used to catch and fix data quality issues automatically. If you suddenly see unexpected patterns in your social data, that may mean adversaries are attempting to poison your data sources.

Machine Learning

Machine Learning Software Metadata Testing

What’s the Current State of Data Governance and Automation?

erwin

JANUARY 30, 2020

The results of our new research show that organizations are still trying to master data governance, including adjusting their strategies to address changing priorities and overcoming challenges related to data discovery, preparation, quality and traceability. Top Five: Benefits of An Automation Framework for Data Governance.

Data Governance

Data Governance Metadata Cost-Benefit Digital Transformation

What is Data Lineage? Top 5 Benefits of Data Lineage

erwin

APRIL 29, 2020

An understanding of the data’s origins and history helps answer questions about the origin of data in a Key Performance Indicator (KPI) reports, including: How the report tables and columns are defined in the metadata? Who are the data owners? Data lineage offers proof that the data provided is reflected accurately.

Key Performance Indicator

Key Performance Indicator Metadata Data Governance Data Quality

Top 6 Benefits of Automating End-to-End Data Lineage

erwin

SEPTEMBER 17, 2020

For example, automatically importing mappings from developers’ Excel sheets, flat files, Access and ETL tools into a comprehensive mappings inventory, complete with auto generated and meaningful documentation of the mappings, is a powerful way to support overall data governance. Data quality is crucial to every organization.

Cost-Benefit

Cost-Benefit Data Governance Metadata Reporting

The Gold Standard – The Key to Information Extraction and Data Quality Control

Ontotext

MAY 26, 2021

In natural language processing (NLP) and computational linguistics the Gold Standard typically represents a corpus of text or a set of documents, annotated or tagged with the desired results for the analysis – be it designation of the corresponding part of speech, syntactic parsing, concept or relationship.

Data Quality

Data Quality Machine Learning Measurement Metadata

Why Your Business Should Use a Data Catalog to Organize Its Data

Smart Data Collective

JULY 15, 2021

A data catalog serves the same purpose. By using metadata (or short descriptions), data catalogs help companies gather, organize, retrieve, and manage information. You can think of a data catalog as an enhanced Access database or library card catalog system. What Does a Data Catalog Do?

Metadata

Metadata IT Data-driven Data Quality

Data governance in the age of generative AI

AWS Big Data

FEBRUARY 29, 2024

Data governance is a critical building block across all these approaches, and we see two emerging areas of focus. First, many LLM use cases rely on enterprise knowledge that needs to be drawn from unstructured data such as documents, transcripts, and images, in addition to structured data from data warehouses.

Data Governance

Data Governance Unstructured Data Metadata Data Lake

NASA accelerates science with gen AI-powered search

CIO Business Intelligence

JANUARY 15, 2024

With seven operating centers, nine research facilities, and more than 18,000 staff, the agency continually generates an overwhelming amount of data, which it stores in more than 30 science data repositories across five topical areas — astrophysics, heliophysics, biological science, physical science, earth science, and planetary science.

Informatics

Informatics Metadata Data mining Digital Transformation

Top 10 Data Governance Trends for 2020: Data’s Real Value Comes Into Focus

erwin

JANUARY 3, 2020

As organizations become data-driven and awash in an overwhelming amount of data from multiple data sources (AI, IoT, ML, etc.), they will find new ways to get a handle on data quality and focus on data management processes and best practices.

Data Governance

Data Governance Digital Transformation IoT Metadata

How Metadata Makes Data Meaningful

erwin

DECEMBER 12, 2019

Metadata is an important part of data governance, and as a result, most nascent data governance programs are rife with project plans for assessing and documenting metadata. But are these rampant and often uncontrolled projects to collect metadata properly motivated? What Is Metadata?

Metadata

Metadata Data Governance Digital Transformation Data Quality

Automating Data Governance

erwin

OCTOBER 29, 2020

Earlier this year, erwin conducted a research project in partnership with Dataversity, the 2020 State of Data Governance and Automation. We asked participants to “talk to us about data value chain bottlenecks.” Data quality and accuracy are recurring themes as well.

Data Governance

Data Governance Metadata Digital Transformation ROI

Top 5 Data Catalog Benefits: Understanding Your Organization’s Data Lineage

erwin

AUGUST 7, 2019

A data catalog benefits organizations in a myriad of ways. With the right data catalog tool, organizations can automate enterprise metadata management – including data cataloging, data mapping, data quality and code generation for faster time to value and greater accuracy for data movement and/or deployment projects.

Metadata

Metadata Data Governance Data Quality Data Warehouse

The Benefits of Data Management Automation: 8 Tips to Automate Data Management

erwin

FEBRUARY 6, 2020

How to Automate Data Management. Here are our eight recommendations for how to transition from manual to automated data management: 1) Put Data Quality First: Automating and matching business terms with data assets and documenting lineage down to the column level are critical to good decision making.

Management

Management Data Governance Cost-Benefit Metadata

Using Strategic Data Governance to Manage GDPR/CCPA Complexity

erwin

JULY 12, 2019

Modern, strategic data governance , which involves both IT and the business, enables organizations to plan and document how they will discover and understand their data within context, track its physical existence and lineage, and maximize its security, quality and value. How erwin Can Help.

Data Governance

Data Governance Management Metadata Risk Management

What is BCBS 239 Compliance?

Octopai

JANUARY 19, 2020

BCBS 239 is a document published by that committee entitled, Principles for Effective Risk Data Aggregation and Risk Reporting. The document, first published in 2013, outlines best practices for global and domestic banks to identify, manage, and report risks, including credit, market, liquidity, and operational risks.

Metadata

Metadata Risk Management Business Intelligence Data Governance

Constructing A Digital Transformation Strategy: Putting the Data in Digital Transformation

erwin

JULY 17, 2019

Your organization won’t be able to take complete advantage of analytics tools to become data-driven unless you establish a foundation for agile and complete data management. You need automated data mapping and cataloging through the integration lifecycle process, inclusive of data at rest and data in motion.

Digital Transformation

Digital Transformation Strategy Metadata Data-driven

How to Do Data Modeling the Right Way

erwin

MAY 27, 2020

What, then, should users look for in a data modeling product to support their governance/intelligence requirements in the data-driven enterprise? Nine Steps to Data Modeling. Provide metadata and schema visualization regardless of where data is stored.

Modeling

Modeling Metadata Data Governance Visualization

How HPE Aruba Supply Chain optimized cost and performance by migrating to an AWS modern data architecture

AWS Big Data

SEPTEMBER 11, 2024

This also includes building an industry standard integrated data repository as a single source of truth, operational reporting through real time metrics, data quality monitoring, 24/7 helpdesk, and revenue forecasting through financial projections and supply availability projections.

Data Architecture

Data Architecture Optimization Data Warehouse Metadata

Gen AI can be the answer to your data problems — but not all of them

CIO Business Intelligence

JUNE 12, 2024

Most enterprise data is unstructured and semi-structured documents and code, as well as images and video. For example, gen AI can be used to extract metadata from documents, create indexes of information and knowledge graphs, and to query, summarize, and analyze this data.

Modeling

Modeling Testing Cost-Benefit Metadata

Are Data Governance Bottlenecks Holding You Back?

erwin

FEBRUARY 4, 2021

It uncovered a number of obstacles that organizations have to overcome to improve their data operations. 1 bottleneck, according to 62 percent of respondents, was documenting complete data lineage. Overcoming Data Governance Bottlenecks.

Data Governance

Data Governance Metadata Data Quality IoT

What’s Business Process Modeling Got to Do with It? – Choosing A BPM Tool

erwin

MARCH 21, 2019

Organizations also can use a BPM tool to identify the staff who function as “unofficial data repositories.” Organizations can document employee processes to ensure vital information isn’t lost should an employee choose to leave. The lack of a central metadata repository is a far too common thorn in an organization’s side.

Modeling

Modeling Metadata Data Governance IT

SHACL-ing the Data Quality Dragon III: A Good Artisan Knows Their Tools

Ontotext

NOVEMBER 23, 2023

This technique can be especially useful in data integration projects where you are combining related, potentially overlapping data from multiple sources. Remember to set up your shapes graph in a repository that has been configured from the beginning to support SHACL, as described in our documentation.

Data Quality

Data Quality Reporting Metadata Big Data

Alation and dbt Unlock Metadata and Increase Modern Data Stack Visibility

Alation

OCTOBER 18, 2022

In the modern data stack, dbt is a key tool to make data ready for analysis. Data analysts and engineers use dbt to transform, test, and document data in the cloud data warehouse. Yet every dbt transformation contains vital metadata that is not captured – until now. Conclusion.

Metadata

Metadata Metrics Recreation/Entertainment Data Quality

SHACL-ing the Data Quality Dragon I: the Problem and the Tools

Ontotext

NOVEMBER 9, 2023

While everyone may subscribe to the same design decisions and agree on an ontology, there may be differences in the data quality. In such situations, data must be validated. Instead, they provide metadata about the shapes. Sometimes there is no room for error. So stay tuned! Ontotext’s GraphDB Give it a try today!

Data Quality

Data Quality Testing Reporting Metadata

Data Mapping Tools: What Are the Key Differentiators

erwin

MARCH 14, 2019

(BFSI, PHARMA, INSURANCE AND NON-PROFIT) CASE STUDIES FOR AUTOMATED METADATA-DRIVEN AUTOMATION. As well as introducing greater efficiency to the data governance process, automated data mapping tools enable data to be auto-documented from XML that builds mappings for the target repository or reporting structure.

Metadata

Metadata Data Governance Data-driven Digital Transformation

Clean up your Excel and CSV files without writing code using AWS Glue DataBrew

AWS Big Data

NOVEMBER 15, 2023

As the organization receives data from multiple external vendors, it often arrives in different formats, typically Excel or CSV files, with each vendor using their own unique data layout and structure. DataBrew is an excellent tool for data quality and preprocessing. For Matching conditions , choose Match all conditions.

Metadata

Metadata Sales Data Lake Big Data

You Cannot Get to the Moon on a Bike!

Ontotext

JANUARY 10, 2024

Limiting growth by (data integration) complexity Most operational IT systems in an enterprise have been developed to serve a single business function and they use the simplest possible model for this. In both cases, semantic metadata is the glue that turns knowledge graphs into hubs of data, metadata, and content.

Metadata

Metadata Slice and Dice Data Integration Enterprise

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

Manage concurrent write conflicts in Apache Iceberg on the AWS Glue Data Catalog

Webinars

Trending Sources

Data’s dark secret: Why poor quality cripples AI and growth

Webinars

SAP Datasphere Powers Business at the Speed of Data

7 Benefits of Metadata Management

When is data too clean to be useful for enterprise AI?

Data Governance and Metadata Management: You Can’t Have One Without the Other

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

Best Practices for Metadata Management

Very Meta … Unlocking Data’s Potential with Metadata Management Solutions

RDF-Star: Metadata Complexity Simplified

How Metadata Makes Data Meaningful

Alation 2022.2: Open Data Quality Initiative and Enhanced Data Governance

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

A Few Proven Suggestions for Handling Large Data Sets

What is data governance? Best practices for managing data assets

Alation and Salesforce partner on data governance for Data Cloud

Deep automation in machine learning

What’s the Current State of Data Governance and Automation?

What is Data Lineage? Top 5 Benefits of Data Lineage

Top 6 Benefits of Automating End-to-End Data Lineage

The Gold Standard – The Key to Information Extraction and Data Quality Control

Why Your Business Should Use a Data Catalog to Organize Its Data

Data governance in the age of generative AI

NASA accelerates science with gen AI-powered search

Top 10 Data Governance Trends for 2020: Data’s Real Value Comes Into Focus

How Metadata Makes Data Meaningful

Automating Data Governance

Top 5 Data Catalog Benefits: Understanding Your Organization’s Data Lineage

The Benefits of Data Management Automation: 8 Tips to Automate Data Management

Using Strategic Data Governance to Manage GDPR/CCPA Complexity

What is BCBS 239 Compliance?

Constructing A Digital Transformation Strategy: Putting the Data in Digital Transformation

How to Do Data Modeling the Right Way

How HPE Aruba Supply Chain optimized cost and performance by migrating to an AWS modern data architecture

Gen AI can be the answer to your data problems — but not all of them

Are Data Governance Bottlenecks Holding You Back?

What’s Business Process Modeling Got to Do with It? – Choosing A BPM Tool

SHACL-ing the Data Quality Dragon III: A Good Artisan Knows Their Tools

Alation and dbt Unlock Metadata and Increase Modern Data Stack Visibility

SHACL-ing the Data Quality Dragon I: the Problem and the Tools

Data Mapping Tools: What Are the Key Differentiators

Clean up your Excel and CSV files without writing code using AWS Glue DataBrew

You Cannot Get to the Moon on a Bike!

Stay Connected