Article, Data Integration and Metadata

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

Data quality is no longer a back-office concern. In this article, I am drawing from firsthand experience working with CIOs, CDOs, CTOs and transformation leaders across industries. I aim to outline pragmatic strategies to elevate data quality into an enterprise-wide capability. Complex orgs with mature data capabilities.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

Metadata, the Neglected Stepchild of IT

Data Virtualization

DECEMBER 8, 2022

Reading Time: 3 minutes While cleaning up our archive recently, I found an old article published in 1976 about data dictionary/directory systems (DD/DS). Nowadays, we no longer use the term DD/DS, but “data catalog” or simply “metadata system”. It was written by L.

Metadata

Metadata IT Data Integration Publishing

The Power of Active Metadata

Data Virtualization

JULY 28, 2023

Reading Time: 2 minutes As the volume, variety, and velocity of data continue to surge, organizations still struggle to gain meaningful insights. This is where active metadata comes in. Listen to “Why is Active Metadata Management Essential?” What is Active Metadata? ” on Spreaker.

Metadata

Metadata Data Integration Management Data Science

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Why data observability is essential to AI governance

erwin

DECEMBER 9, 2024

Will the new creative, diverse and scalable data pipelines you are building also incorporate the AI governance guardrails needed to manage and limit your organizational risk? We will tackle all these burning questions and more in this article.

Metadata

Metadata Data Quality Sales Modeling

Becoming a machine learning company means investing in foundational technologies

O'Reilly on Data

MAY 21, 2019

Not surprisingly, data integration and ETL were among the top responses, with 60% currently building or evaluating solutions in this area. In an age of data-hungry algorithms, everything really begins with collecting and aggregating data. Data results from a Twitter poll. Metadata and artifacts needed for audits.

Machine Learning

Machine Learning Technology Deep Learning Data Science

The Need For Personalized Data Journeys for Your Data Consumers

DataKitchen

OCTOBER 20, 2023

While this is a technically demanding task, the advent of ‘Payload’ Data Journeys (DJs) offers a targeted approach to meet the increasingly specific demands of Data Consumers. Payload DJs facilitate capturing metadata, lineage, and test results at each phase, enhancing tracking efficiency and reducing the risk of data loss.

Insurance

Insurance Metadata Data-driven Data Quality

Elevating Data Integration: A Four-Tier Approach to Effective Data Preparation

Data Virtualization

SEPTEMBER 12, 2024

Reading Time: 2 minutes In today’s data-driven landscape, the integration of raw source data into usable business objects is a pivotal step in ensuring that organizations can make informed decisions and maximize the value of their data assets. To achieve these goals, a well-structured.

Data Integration

Data Integration Business Objectives Data-driven Management

Don’t Fear Artificial Intelligence; Embrace it Through Data Governance

CIO Business Intelligence

APRIL 29, 2022

Preparing for an artificial intelligence (AI)-fueled future, one where we can enjoy the clear benefits the technology brings while also the mitigating risks, requires more than one article. This first article emphasizes data as the ‘foundation-stone’ of AI-based initiatives. Establishing a Data Foundation. era is upon us.

Data Governance

Data Governance IT Data Lake Risk

The Semantic Web: 20 Years And a Handful of Enterprise Knowledge Graphs Later

Ontotext

JULY 29, 2021

KGs bring the Semantic Web paradigm to the enterprises, by introducing semantic metadata to drive data management and content management to new levels of efficiency and breaking silos to let them synergize with various forms of knowledge management. Take this restaurant, for example. Enterprise Knowledge Graphs and the Semantic Web.

Enterprise

Enterprise Metadata Knowledge Discovery Management

Proposals for model vulnerability and security

O'Reilly on Data

MARCH 20, 2019

Data integrity constraints: Many databases don’t allow for strange or unrealistic combinations of input variables and this could potentially thwart watermarking attacks. Applying data integrity constraints on live, incoming data streams could have the same benefits. Disparate impact analysis: see section 1.

Modeling

Modeling Machine Learning Predictive Modeling Consulting

My Reflections on the Gartner® Hype Cycle™ for Data Management, 2024

Data Virtualization

DECEMBER 20, 2024

The post My Reflections on the Gartner Hype Cycle for Data Management, 2024 appeared first on Data Management Blog - Data Integration and Modern Data Management Articles, Analysis and Information. Gartner Hype Cycle methodology provides a view of how.

Management

Management Data Integration Technology Data Architecture

What is an Information Steward, and Why You Should Care

Grooper

MARCH 5, 2020

Lower cost data processes. This article is will help you understand the critical role of information stewardship as it relates to data and analytics. These stewards monitor the input and output of data integrations and workflows to ensure data quality. More effective business process execution.

Data Lake

Data Lake Metadata Data Quality Software

How Cloudera Data Flow Enables Successful Data Mesh Architectures

Cloudera

OCTOBER 7, 2021

In this blog, I will demonstrate the value of Cloudera DataFlow (CDF) , the edge-to-cloud streaming data platform available on the Cloudera Data Platform (CDP) , as a Data integration and Democratization fabric. Introduction to the Data Mesh Architecture and its Required Capabilities. Introduction.

Metadata

Metadata Cost-Benefit Enterprise Interactive

Querying Minds Want to Know: Can a Data Fabric and RAG Clean up LLMs? – Part 4 : Intelligent Autonomous Agents

Data Virtualization

AUGUST 23, 2024

The post Querying Minds Want to Know: Can a Data Fabric and RAG Clean up LLMs? – Part 4 : Intelligent Autonomous Agents appeared first on Data Management Blog - Data Integration and Modern Data Management Articles, Analysis and Information.

Data Integration

Data Integration Modeling Management Data Architecture

GraphDB: MongoDB Document Store Integration for Large-scale Metadata Management

Ontotext

JUNE 27, 2019

Ontotext’s GraphDB is an enterprise-ready semantic graph database (also called RDF triplestore as it stores data in RDF triples). It provides the core infrastructure for solutions where modeling agility, data integration, relationship exploration, cross-enterprise data publishing and consumption are critical.

Metadata

Metadata Management Enterprise Publishing

Top 10 Data Lineage Podcasts, Blogs, and Magazines

Octopai

JANUARY 31, 2021

The particular episode we recommend looks at how WeWork struggled with understanding their data lineage so they created a metadata repository to increase visibility. Agile Data. Another podcast we think is worth a listen is Agile Data. Techcopedia follows the latest trends in data and provides comprehensive tutorials.

Data Governance

Data Governance Data Processing Data Quality Metadata

Knowledge Graphs vs. Property Graphs – Part 1

TDAN

AUGUST 18, 2020

Flexibility is one strong driver: heterogeneous data, integrating new data sources, and analytics all require flexibility. We are in the era of graphs. Graphs are hot. Graphs deliver it in spades. Over the last few years, a number of new graph databases came to market. As we start the next decade, dare we say […].

Data Integration

Data Integration Marketing Analytics Data Architecture

Ozone Write Pipeline V2 with Ratis Streaming

Cloudera

NOVEMBER 8, 2022

Ozone is also highly available — the Ozone metadata is replicated by Apache Ratis, an implementation of the Raft consensus algorithm for high-performance replication. Since Ozone supports both Hadoop FileSystem interface and Amazon S3 interface, frameworks like Apache Spark, YARN, Hive, and Impala can automatically use Ozone to store data.

Metadata

Metadata Data-driven Management Optimization

Business Glossary and Metadata: When a Data Catalog Product is Not a Data Catalog

TDAN

DECEMBER 17, 2019

If you do a general internet search for data catalogs, all sorts of possibilities emerge. If you look closely, and ask a lot of questions, you will find that some of these products are not actually fully functional data catalogs at all. Some software products start out life-solving a specific use case related to data, […].

Metadata

Metadata Software Data Quality Data Integration

My Understanding of the Gartner® Hype Cycle™ for Finance Data and Analytics Governance, 2023

Data Virtualization

MARCH 28, 2024

As noted in the Gartner Hype Cycle for Finance Data and Analytics Governance, 2023, “Through. The post My Understanding of the Gartner® Hype Cycle™ for Finance Data and Analytics Governance, 2023 appeared first on Data Management Blog - Data Integration and Modern Data Management Articles, Analysis and Information.

Finance

Finance Digital Transformation Analytics Data Integration

Themes and Conferences per Pacoid, Episode 8

Domino Data Lab

APRIL 3, 2019

This month’s article features updates from one of the early data conferences of the year, Strata Data Conference – which was held just last week in San Francisco. In particular, here’s my Strata SF talk “Overview of Data Governance” presented in article form. Those days are long gone if they ever existed.

Machine Learning

Machine Learning Data Governance Metadata Data Science

What Is a Data Fabric and How Does a Data Catalog Support It?

Alation

JANUARY 25, 2022

As a reminder, here’s Gartner’s definition of data fabric: “A design concept that serves as an integrated layer (fabric) of data and connecting processes. In this blog, we will focus on the “integrated layer” part of this definition by examining each of the key layers of a comprehensive data fabric in more detail.

Metadata

Metadata IT Data-driven Metrics

Throwing Your Data Into the Ocean

Ontotext

JANUARY 6, 2021

According to this article , it costs $54,500 for every kilogram you want into space. That means removing errors, filling in missing information and harmonizing the various data sources so that there is consistency. Once that is done, data can be transformed and enriched with metadata to facilitate analysis.

Metadata

Metadata Unstructured Data Cost-Benefit Enterprise

Available Now! Automated Testing for Data Transformations

Wayne Yaddow

FEBRUARY 18, 2025

While transformations edit or restructure data to meet business objectives (such as aggregating sales data, enhancing customer information, or standardizing addresses), conversions typically deal with changing data formats, such as from CSV to JSON or string to integertypes.

Testing

Testing Data Transformation Data-driven Data Quality

Data Governance in a Data Mesh or Data Fabric Architecture

Data Virtualization

DECEMBER 21, 2023

And data fabric is a self-service data layer that is supported in an orchestrated fashion to serve. The post Data Governance in a Data Mesh or Data Fabric Architecture appeared first on Data Management Blog - Data Integration and Modern Data Management Articles, Analysis and Information.

Data Governance

Data Governance Data Architecture Data Integration Management

Navigating the New Data Landscape: Trends and Opportunities

Data Virtualization

JUNE 19, 2024

The post Navigating the New Data Landscape: Trends and Opportunities appeared first on Data Management Blog - Data Integration and Modern Data Management Articles, Analysis and Information. At TDWI, we see companies collecting traditional structured.

Data Integration

Data Integration Management Analytics Data Architecture

Improving the Accuracy of LLM-Based Text-to-SQL Generation with a Semantic Layer in the Denodo Platform

Data Virtualization

MAY 23, 2024

The post Improving the Accuracy of LLM-Based Text-to-SQL Generation with a Semantic Layer in the Denodo Platform appeared first on Data Management Blog - Data Integration and Modern Data Management Articles, Analysis and Information.

Data Integration

Data Integration Management Metadata Enterprise

Harnessing the Power of Generative AI for Your Enterprise

Data Virtualization

SEPTEMBER 5, 2024

The post Harnessing the Power of Generative AI for Your Enterprise appeared first on Data Management Blog - Data Integration and Modern Data Management Articles, Analysis and Information.

Enterprise

Enterprise Data Integration Optimization Management

GraphDB in Action: Putting the Most Reliable RDF Database to Work for Better Human-machine Interaction

Ontotext

JANUARY 26, 2023

published as a special topic article in AI magazine, Volume 43, Issue 1 , Spring 2022. The paper introduces KnowWhereGraph (KWG) as a solution to the ever-growing challenge of integrating heterogeneous data and building services on top of already existing open data. The catalog stores the asset’s metadata in RDF.

Interactive

Interactive Metadata Data Integration Data-driven

If Johnny Mnemonic Smuggled Linked Data

Ontotext

MAY 30, 2019

In this article, we are bringing science fiction to the semantic technology (and data management) talk to shed some light on three common data challenges: the storage, retrieval and security of information. We will talk through these from the perspective of Linked Data (and cyberpunk).

Cost-Benefit

Cost-Benefit Big Data Technology Metadata

Data Strategies for Getting Greater Business Value from Distributed Data

Data Virtualization

MAY 19, 2023

Reading Time: 11 minutes The post Data Strategies for Getting Greater Business Value from Distributed Data appeared first on Data Management Blog - Data Integration and Modern Data Management Articles, Analysis and Information.

Data Strategy

Data Strategy Strategy Data Integration Management

Getting the Fundamentals Right for Gen AI

Data Virtualization

MARCH 21, 2024

The POC was for a data. The post Getting the Fundamentals Right for Gen AI appeared first on Data Management Blog - Data Integration and Modern Data Management Articles, Analysis and Information.

Data Integration

Data Integration Management IT Consulting

How to Shop for Data?

Data Virtualization

JANUARY 18, 2024

To be truly “data-driven,” an organization must view data as more than a byproduct. The post How to Shop for Data? appeared first on Data Management Blog - Data Integration and Modern Data Management Articles, Analysis and Information.

Data-driven

Data-driven Data Integration Management Metadata

The Data Lakehouse: Blending Data Warehouses and Data Lakes

Data Virtualization

APRIL 21, 2022

But what is a data lakehouse and why should we develop one? The post The Data Lakehouse: Blending Data Warehouses and Data Lakes appeared first on Data Virtualization blog - Data Integration and Modern Data Management Articles, Analysis and Information.

Data Lake

Data Lake Data Warehouse Data Integration Management

Choosing a Data Catalog: Data Map or Data Delivery App?

Data Virtualization

NOVEMBER 17, 2022

The idea seems, on the face of it, easy to understand: a data catalog is simply a centralized inventory of the data assets within an organization. Data catalogs also seek to be the. The post Choosing a Data Catalog: Data Map or Data Delivery App?

Data Integration

Data Integration Management Data Lake IT

Denodo Joins Forces with Presto

Data Virtualization

JUNE 22, 2023

The Denodo Platform is a logical data management platform, powered by. The post Denodo Joins Forces with Presto appeared first on Data Management Blog - Data Integration and Modern Data Management Articles, Analysis and Information.

Data Integration

Data Integration Management Data Lake Metadata

Welcome to the Era of Denodo Assistant

Data Virtualization

NOVEMBER 20, 2024

The post Welcome to the Era of Denodo Assistant appeared first on Data Management Blog - Data Integration and Modern Data Management Articles, Analysis and Information. With the development of large language models (LLMs) and other generative AI (GenAI) technologies in recent years, we have doubled our efforts.

Data Integration

Data Integration Modeling Management Technology

What Does 2000 Year Old Concrete Have to Do with Knowledge Graphs?

Ontotext

SEPTEMBER 2, 2020

Instead, it creates a unified way, sometimes called a data fabric, of accessing an organization’s data as well as 3rd party or global data in a seamless manner. Data is represented in a holistic, human-friendly and meaningful way. For efficient drug discovery, linked data is key.

Insurance

Insurance Metadata Publishing Unstructured Data

Data Mesh vs Data Fabric: Understanding the Key Differences

Data Virtualization

JANUARY 17, 2023

One of the key considerations is how best to handle data, and this is where data mesh and data fabric come into play. The post Data Mesh vs Data Fabric: Understanding the Key Differences appeared first on Data Virtualization blog - Data Integration and Modern Data Management Articles, Analysis and Information.

Data Architecture

Data Architecture Data Integration Management Metadata

The Secret Sauce of LeasePlan’s Award-Winning Logical Data Fabric

Data Virtualization

AUGUST 4, 2022

The post The Secret Sauce of LeasePlan’s Award-Winning Logical Data Fabric appeared first on Data Virtualization blog - Data Integration and Modern Data Management Articles, Analysis and Information. This is a testament to the maturity of.

Data Strategy

Data Strategy Data Integration Strategy Management

Data democratization: How data architecture can drive business decisions and AI initiatives

IBM Big Data Hub

AUGUST 4, 2023

When workers get their hands on the right data, it not only gives them what they need to solve problems, but also prompts them to ask, “What else can I do with data?” ” through a truly data literate organization. What is data democratization? Security Data security is a high priority.

Data Architecture

Data Architecture Data Lake Machine Learning Data Governance

The Superpowers of Ontotext’s Relation and Event Detector

Ontotext

FEBRUARY 26, 2024

From a technological perspective, RED combines a sophisticated knowledge graph with large language models (LLM) for improved natural language processing (NLP), data integration, search and information discovery, built on top of the metaphactory platform.

Data-driven

Data-driven Risk Modeling Risk Management

GraphDB: Semantic Text Similarity for Identifying Related Terms & Documents

Ontotext

JULY 11, 2019

Ontotext’s GraphDB is an enterprise-ready semantic graph database (also called RDF triplestore because it stores data in RDF triples). It provides the core infrastructure for solutions where modelling agility, data integration, relationship exploration, cross-enterprise data publishing and consumption are critical. .

Statistics

Statistics Modeling Metadata Enterprise

The Modern Data Stack Explained: What The Future Holds

Alation

JANUARY 17, 2023

In this article, I will explain the modern data stack in detail, list some benefits, and discuss what the future holds. What Is the Modern Data Stack? The modern data stack is a combination of various software tools used to collect, process, and store data on a well-integrated cloud-based data platform.

Data Warehouse

Data Warehouse Cost-Benefit Data Science Data Transformation

Data’s dark secret: Why poor quality cripples AI and growth

Metadata, the Neglected Stepchild of IT

Webinars

Trending Sources

The Power of Active Metadata

Webinars

Why data observability is essential to AI governance

Becoming a machine learning company means investing in foundational technologies

The Need For Personalized Data Journeys for Your Data Consumers

Elevating Data Integration: A Four-Tier Approach to Effective Data Preparation

Don’t Fear Artificial Intelligence; Embrace it Through Data Governance

The Semantic Web: 20 Years And a Handful of Enterprise Knowledge Graphs Later

Proposals for model vulnerability and security

My Reflections on the Gartner® Hype Cycle™ for Data Management, 2024

What is an Information Steward, and Why You Should Care

How Cloudera Data Flow Enables Successful Data Mesh Architectures

Querying Minds Want to Know: Can a Data Fabric and RAG Clean up LLMs? – Part 4 : Intelligent Autonomous Agents

GraphDB: MongoDB Document Store Integration for Large-scale Metadata Management

Top 10 Data Lineage Podcasts, Blogs, and Magazines

Knowledge Graphs vs. Property Graphs – Part 1

Ozone Write Pipeline V2 with Ratis Streaming

Business Glossary and Metadata: When a Data Catalog Product is Not a Data Catalog

My Understanding of the Gartner® Hype Cycle™ for Finance Data and Analytics Governance, 2023

Themes and Conferences per Pacoid, Episode 8

What Is a Data Fabric and How Does a Data Catalog Support It?

Throwing Your Data Into the Ocean

Available Now! Automated Testing for Data Transformations

Data Governance in a Data Mesh or Data Fabric Architecture

Navigating the New Data Landscape: Trends and Opportunities

Improving the Accuracy of LLM-Based Text-to-SQL Generation with a Semantic Layer in the Denodo Platform

Harnessing the Power of Generative AI for Your Enterprise

GraphDB in Action: Putting the Most Reliable RDF Database to Work for Better Human-machine Interaction

If Johnny Mnemonic Smuggled Linked Data

Data Strategies for Getting Greater Business Value from Distributed Data

Getting the Fundamentals Right for Gen AI

How to Shop for Data?

The Data Lakehouse: Blending Data Warehouses and Data Lakes

Choosing a Data Catalog: Data Map or Data Delivery App?

Denodo Joins Forces with Presto

Welcome to the Era of Denodo Assistant

What Does 2000 Year Old Concrete Have to Do with Knowledge Graphs?

Data Mesh vs Data Fabric: Understanding the Key Differences

The Secret Sauce of LeasePlan’s Award-Winning Logical Data Fabric

Data democratization: How data architecture can drive business decisions and AI initiatives

The Superpowers of Ontotext’s Relation and Event Detector

GraphDB: Semantic Text Similarity for Identifying Related Terms & Documents

The Modern Data Stack Explained: What The Future Holds

Stay Connected