Data Integration, Metadata and Modeling

Proposals for model vulnerability and security

O'Reilly on Data

MARCH 20, 2019

Apply fair and private models, white-hat and forensic model debugging, and common sense to protect machine learning models from malicious actors. Like many others, I’ve known for some time that machine learning models themselves could pose security risks. This is like a denial-of-service (DOS) attack on your model itself.

Modeling

Modeling Machine Learning Predictive Modeling Consulting

Bridging the gap between mainframe data and hybrid cloud environments

CIO Business Intelligence

FEBRUARY 27, 2025

A high hurdle many enterprises have yet to overcome is accessing mainframe data via the cloud. Mainframes hold an enormous amount of critical and sensitive business data including transactional information, healthcare records, customer data, and inventory metrics.

Metadata

Metadata Data Lake Cost-Benefit Forecasting

Build a high-performance quant research platform with Apache Iceberg

AWS Big Data

JANUARY 9, 2025

Iceberg offers distinct advantages through its metadata layer over Parquet, such as improved data management, performance optimization, and integration with various query engines. Unlike direct Amazon S3 access, Iceberg supports these operations on petabyte-scale data lakes without requiring complex custom code.

Metadata

Metadata Snapshot Cost-Benefit Optimization

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

RDF-Star: Metadata Complexity Simplified

Ontotext

JUNE 10, 2021

And yeah, the real-world relationships among the entities represented in the data had to be fudged a bit to fit in the counterintuitive model of tabular data, but, in trade, you get reliability and speed. Not Every Graph is a Knowledge Graph: Schemas and Semantic Metadata Matter. Graph Databases vs Relational Databases.

Metadata

Metadata Cost-Benefit OLAP Modeling

7 Benefits of Metadata Management

erwin

FEBRUARY 19, 2021

Metadata management is key to wringing all the value possible from data assets. However, most organizations don’t use all the data at their disposal to reach deeper conclusions about how to drive revenue, achieve regulatory compliance or accomplish other strategic objectives. What Is Metadata? Harvest data.

Metadata

Metadata Management Data Quality Cost-Benefit

5 Ways Data Modeling Is Critical to Data Governance

erwin

JANUARY 9, 2020

Then there’s unstructured data with no contextual framework to govern data flows across the enterprise not to mention time-consuming manual data preparation and limited views of data lineage. Today’s data modeling is not your father’s data modeling software.

Data Governance

Data Governance Modeling Metadata Unstructured Data

Enterprises can gain an edge with Metadata Management

CIO Business Intelligence

SEPTEMBER 6, 2024

As artificial intelligence (AI) and machine learning (ML) continue to reshape industries, robust data management has become essential for organizations of all sizes. This means organizations must cover their bases in all areas surrounding data management including security, regulations, efficiency, and architecture.

Metadata

Metadata Enterprise Management Cost-Benefit

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

CIO Business Intelligence

NOVEMBER 19, 2024

The core of their problem is applying AI technology to the data they already have, whether in the cloud, on their premises, or more likely both. Imagine that you’re a data engineer. The data is spread out across your different storage systems, and you don’t know what is where. What does the next generation of AI workloads need?

Management

Management Unstructured Data Deep Learning Metadata

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

These strategies, such as investing in AI-powered cleansing tools and adopting federated governance models, not only address the current data quality challenges but also pave the way for improved decision-making, operational efficiency and customer satisfaction. When financial data is inconsistent, reporting becomes unreliable.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

AWS Big Data

JULY 29, 2024

It addresses many of the shortcomings of traditional data lakes by providing features such as ACID transactions, schema evolution, row-level updates and deletes, and time travel. In this blog post, we’ll discuss how the metadata layer of Apache Iceberg can be used to make data lakes more efficient.

Metadata

Metadata Snapshot Data Lake Metrics

What Is Data Modeling? Data Modeling Best Practices for Data-Driven Organizations

erwin

JANUARY 17, 2020

What is Data Modeling? Data modeling is a process that enables organizations to discover, design, visualize, standardize and deploy high-quality data assets through an intuitive, graphical interface. Data models provide visualization, create additional metadata and standardize data design across the enterprise.

Data-driven

Data-driven Modeling Metadata Data Governance

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

In addition to real-time analytics and visualization, the data needs to be shared for long-term data analytics and machine learning applications. To achieve this, EUROGATE designed an architecture that uses Amazon DataZone to publish specific digital twin data sets, enabling access to them with SageMaker in a separate AWS account.

IoT

IoT Machine Learning Metadata Data-driven

Salesforce debuts Zero Copy Partner Network to ease data integration

CIO Business Intelligence

APRIL 25, 2024

“The challenge that a lot of our customers have is that requires you to copy that data, store it in Salesforce; you have to create a place to store it; you have to create an object or field in which to store it; and then you have to maintain that pipeline of data synchronization and make sure that data is updated,” Carlson said.

Data Integration

Data Integration Data Lake Data Warehouse Metadata

Metadata Management, Data Governance and Automation

erwin

NOVEMBER 6, 2019

In most companies, an incredible amount of data flows from multiple sources in a variety of formats and is constantly being moved and federated across a changing system landscape. And this time, you guessed it – we’re focusing on data automation and how it could impact metadata management and data governance.

Metadata

Metadata Data Governance Management Cost-Benefit

How Metadata Makes Data Meaningful

erwin

DECEMBER 12, 2019

Metadata is an important part of data governance, and as a result, most nascent data governance programs are rife with project plans for assessing and documenting metadata. But in many scenarios, it seems that the underlying driver of metadata collection projects is that it’s just something you do for data governance.

Metadata

Metadata Data Governance Digital Transformation Data Quality

Are You Content with Your Organization’s Content Strategy?

Rocket-Powered Data Science

JULY 6, 2021

This is accomplished through tags, annotations, and metadata (TAM). granules) of the data collection for fast search, access, and retrieval is also important for efficient orchestration and delivery of the data that fuels AI, automation, and machine learning operations. Collect, curate, and catalog (i.e.,

Strategy

Strategy Machine Learning Metadata Knowledge Discovery

Deep automation in machine learning

O'Reilly on Data

DECEMBER 19, 2018

We need to do more than automate model building with autoML; we need to automate tasks at every stage of the data pipeline. In a previous post , we talked about applications of machine learning (ML) to software development, which included a tour through sample tools in data science and for managing data infrastructure.

Machine Learning

Machine Learning Software Metadata Testing

How to Do Data Modeling the Right Way

erwin

MAY 27, 2020

Data modeling supports collaboration among business stakeholders – with different job roles and skills – to coordinate with business objectives. Data resides everywhere in a business , on-premise and in private or public clouds. A single source of data truth helps companies begin to leverage data as a strategic asset.

Modeling

Modeling Metadata Data Governance Visualization

Becoming a machine learning company means investing in foundational technologies

O'Reilly on Data

MAY 21, 2019

Companies successfully adopt machine learning either by building on existing data products and services, or by modernizing existing models and algorithms. In this post, I share slides and notes from a keynote I gave at the Strata Data Conference in London earlier this year. A typical data pipeline for machine learning.

Machine Learning

Machine Learning Technology Deep Learning Data Science

Modern Data Modeling: The Foundation of Enterprise Data Management and Data Governance

erwin

MAY 13, 2020

The role of data modeling (DM) has expanded to support enterprise data management, including data governance and intelligence efforts. Metadata management is the key to managing and governing your data and drawing intelligence from it. Types of Data Models: Conceptual, Logical and Physical.

Data Governance

Data Governance Enterprise Modeling Management

Demystify data sharing and collaboration patterns on AWS: Choosing the right tool for the job

AWS Big Data

OCTOBER 21, 2024

When dealing with third-party data sources, AWS Data Exchange simplifies the discovery, subscription, and utilization of third-party data from a diverse range of producers or providers. As a producer, you can also monetize your data through the subscription model using AWS Data Exchange.

Sales

Sales Data-driven Data Processing Key Performance Indicator

Why data observability is essential to AI governance

erwin

DECEMBER 9, 2024

For instance, Large Language Models (LLMs) are known to ultimately perform better when data is structured. And being that data is fluid and constantly changing, its very easy for bias, bad data and sensitive information to creep into your AI data pipeline. Lets give a for instance.

Metadata

Metadata Data Quality Sales Modeling

Recap of Amazon Redshift key product announcements in 2024

AWS Big Data

DECEMBER 17, 2024

We have enhanced data sharing performance with improved metadata handling, resulting in data sharing first query execution that is up to four times faster when the data sharing producers data is being updated.

Data Lake

Data Lake Data Warehouse Data-driven Optimization

Doing Cloud Migration and Data Governance Right the First Time

erwin

OCTOBER 8, 2020

But even with the “need for speed” to market, new applications must be modeled and documented for compliance, transparency and stakeholder literacy. With all these diverse metadata sources, it is difficult to understand the complicated web they form much less get a simple visual flow of data lineage and impact analysis.

Data Governance

Data Governance Metadata Testing Data Lake

Amazon OpenSearch Service Under the Hood : OpenSearch Optimized Instances(OR1)

AWS Big Data

APRIL 17, 2024

In this post, we discuss how the reimagined data flow works with OR1 instances and how it can provide high indexing throughput and durability using a new physical replication protocol. We also dive deep into some of the challenges we solved to maintain correctness and data integrity.

Optimization

Optimization Snapshot Metadata Cost-Benefit

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

AWS Big Data

DECEMBER 4, 2024

They’re taking data they’ve historically used for analytics or business reporting and putting it to work in machine learning (ML) models and AI-powered applications. Amazon SageMaker Unified Studio (Preview) solves this challenge by providing an integrated authoring experience to use all your data and tools for analytics and AI.

Data Analytics

Data Analytics Analytics Data Lake Data Quality

Data integrity vs. data quality: Is there a difference?

IBM Big Data Hub

JULY 13, 2023

When we talk about data integrity, we’re referring to the overarching completeness, accuracy, consistency, accessibility, and security of an organization’s data. Together, these factors determine the reliability of the organization’s data. In short, yes.

Data Quality

Data Quality Data Integration Metadata Cost-Benefit

Constructing A Digital Transformation Strategy: Putting the Data in Digital Transformation

erwin

JULY 17, 2019

Part Two of the Digital Transformation Journey … In our last blog on driving digital transformation , we explored how enterprise architecture (EA) and business process (BP) modeling are pivotal factors in a viable digital transformation strategy. Digital Transformation Strategy: Smarter Data.

Digital Transformation

Digital Transformation Strategy Metadata Data-driven

How Metadata Makes Data Meaningful

erwin

DECEMBER 12, 2019

Metadata is an important part of data governance, and as a result, most nascent data governance programs are rife with project plans for assessing and documenting metadata. But in many scenarios, it seems that the underlying driver of metadata collection projects is that it’s just something you do for data governance.

Metadata

Metadata Data Governance Digital Transformation Data Quality

What’s the Current State of Data Governance and Automation?

erwin

JANUARY 30, 2020

The results of our new research show that organizations are still trying to master data governance, including adjusting their strategies to address changing priorities and overcoming challenges related to data discovery, preparation, quality and traceability. And close to 50 percent have deployed data catalogs and business glossaries.

Data Governance

Data Governance Metadata Cost-Benefit Digital Transformation

The Enduring Significance of Data Modeling in the Modern Data-Driven Enterprise

erwin

AUGUST 31, 2023

Q: Is data modeling cool again? In today’s fast-paced digital landscape, data reigns supreme. The data-driven enterprise relies on accurate, accessible, and actionable information to make strategic decisions and drive innovation. A: It always was and is getting cooler!!

Data-driven

Data-driven Modeling Enterprise Structured Data

What is data governance? Best practices for managing data assets

CIO Business Intelligence

MARCH 24, 2023

It encompasses the people, processes, and technologies required to manage and protect data assets. The Data Management Association (DAMA) International defines it as the “planning, oversight, and control over management of data and the use of data and data-related sources.”

Data Governance

Data Governance Management Metadata Data Quality

SAP Datasphere review: turning data from a technical problem to a business data product.

Jen Stirrup

MARCH 29, 2023

However, to turn data into a business problem, organizations need support to move away from technical issues to start getting value as quickly as possible. SAP Datasphere simplifies data integration, cataloging, semantic modeling, warehousing, federation, and virtualization through a unified interface.

Data Warehouse

Data Warehouse Metadata Data Integration Business Intelligence

The Semantic Web: 20 Years And a Handful of Enterprise Knowledge Graphs Later

Ontotext

JULY 29, 2021

The Semantic Web started in the late 90’s as a fascinating vision for a web of data, which is easy to interpret by both humans and machines. One of its pillars are ontologies that represent explicit formal conceptual models, used to describe semantically both unstructured content and databases. Take this restaurant, for example.

Enterprise

Enterprise Metadata Knowledge Discovery Management

The Need For Personalized Data Journeys for Your Data Consumers

DataKitchen

OCTOBER 20, 2023

Example 2: The Data Engineering Team Has Many Small, Valuable Files Where They Need Individual Source File Tracking In a typical data processing workflow, tracking individual files as they progress through various stages—from file delivery to data ingestion—is crucial.

Insurance

Insurance Metadata Data-driven Data Quality

There’s More to erwin Data Governance Automation Than Meets the AI

erwin

NOVEMBER 6, 2020

To better explain our vision for automating data governance, let’s look at some of the different aspects of how the erwin Data Intelligence Suite (erwin DI) incorporates automation. Data Cataloging: Catalog and sync metadata with data management and governance artifacts according to business requirements in real time.

Data Governance

Data Governance Metadata Data-driven Visualization

Bridging the Gap: How ‘Data in Place’ and ‘Data in Use’ Define Complete Data Observability

DataKitchen

SEPTEMBER 21, 2023

Data in Use pertains explicitly to how data is actively employed in business intelligence tools, predictive models, visualization platforms, and even during export or reverse ETL processes. These applications are where the rubber meets the road and often where customers first encounter data quality issues.

Testing

Testing Data Quality Predictive Modeling Metrics

Top analytics announcements of AWS re:Invent 2024

AWS Big Data

FEBRUARY 26, 2025

S3 Tables integration with the AWS Glue Data Catalog is in preview, allowing you to stream, query, and visualize dataincluding Amazon S3 Metadata tablesusing AWS analytics services such as Amazon Data Firehose , Amazon Athena , Amazon Redshift, Amazon EMR, and Amazon QuickSight. With AWS Glue 5.0, With AWS Glue 5.0,

Analytics

Analytics Data Lake Metadata Data Warehouse

Don’t Fear Artificial Intelligence; Embrace it Through Data Governance

CIO Business Intelligence

APRIL 29, 2022

Forward-thinking transformation leaders have realised that more focus needs to be placed on ‘data-centric value creation’ and have made this the pre-eminent organising principle in their organisations. Many organisations focus too heavily on fine tuning their computational models in their pursuit of ‘quick-wins.’ About Andrew P.

Data Governance

Data Governance IT Risk Data Lake

SAP enhances Datasphere and SAC for AI-driven transformation

CIO Business Intelligence

MARCH 6, 2024

SAP unveiled Datasphere a year ago as a comprehensive data service, built on SAP Business Technology Platform (BTP), to provide a unified experience for data integration, data cataloging, semantic modeling, data warehousing, data federation, and data virtualization.

Unstructured Data

Unstructured Data Dashboards Business Intelligence Data Governance

You Cannot Get to the Moon on a Bike!

Ontotext

JANUARY 10, 2024

In Computer Science, we are trained to use the Okham razor – the simplest model of reality that can get the job done is the best one. And each of these gains requires data integration across business lines and divisions. We call this the Bad Data Tax. So, how to manage this complexity better?

Metadata

Metadata Slice and Dice Data Integration Enterprise

Data confidence begins at the edge

CIO Business Intelligence

SEPTEMBER 23, 2024

Data doubt compounds tough edge challenges The variety of operational challenges at the edge are compounded by the difficulties of sourcing trustworthy data sets from heterogeneous IT/OT estates. Specifically, what the DCF does is capture metadata related to the application and compute stack.

Manufacturing

Manufacturing Internet of Things Metadata Risk

Data governance in the age of generative AI

AWS Big Data

FEBRUARY 29, 2024

Data is your generative AI differentiator, and a successful generative AI implementation depends on a robust data strategy incorporating a comprehensive data governance approach. Data governance is a critical building block across all these approaches, and we see two emerging areas of focus.

Data Governance

Data Governance Unstructured Data Metadata Data Lake

Five Benefits of an Automation Framework for Data Governance

erwin

JANUARY 24, 2019

In most companies, an incredible amount of data flows from multiple sources in a variety of formats and is constantly being moved and federated across a changing system landscape. With an automation framework, data professionals can meet these needs at a fraction of the cost of the traditional manual way. Governing metadata.

Data Governance

Data Governance Metadata Data-driven Cost-Benefit

Proposals for model vulnerability and security

Bridging the gap between mainframe data and hybrid cloud environments

Webinars

Trending Sources

Build a high-performance quant research platform with Apache Iceberg

Webinars

RDF-Star: Metadata Complexity Simplified

7 Benefits of Metadata Management

5 Ways Data Modeling Is Critical to Data Governance

Enterprises can gain an edge with Metadata Management

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

Data’s dark secret: Why poor quality cripples AI and growth

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

What Is Data Modeling? Data Modeling Best Practices for Data-Driven Organizations

How EUROGATE established a data mesh architecture using Amazon DataZone

Salesforce debuts Zero Copy Partner Network to ease data integration

Metadata Management, Data Governance and Automation

How Metadata Makes Data Meaningful

Are You Content with Your Organization’s Content Strategy?

Deep automation in machine learning

How to Do Data Modeling the Right Way

Becoming a machine learning company means investing in foundational technologies

Modern Data Modeling: The Foundation of Enterprise Data Management and Data Governance

Demystify data sharing and collaboration patterns on AWS: Choosing the right tool for the job

Why data observability is essential to AI governance

Recap of Amazon Redshift key product announcements in 2024

Doing Cloud Migration and Data Governance Right the First Time

Amazon OpenSearch Service Under the Hood : OpenSearch Optimized Instances(OR1)

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Data integrity vs. data quality: Is there a difference?

Constructing A Digital Transformation Strategy: Putting the Data in Digital Transformation

How Metadata Makes Data Meaningful

What’s the Current State of Data Governance and Automation?

The Enduring Significance of Data Modeling in the Modern Data-Driven Enterprise

What is data governance? Best practices for managing data assets

SAP Datasphere review: turning data from a technical problem to a business data product.

The Semantic Web: 20 Years And a Handful of Enterprise Knowledge Graphs Later

The Need For Personalized Data Journeys for Your Data Consumers

There’s More to erwin Data Governance Automation Than Meets the AI

Bridging the Gap: How ‘Data in Place’ and ‘Data in Use’ Define Complete Data Observability

Top analytics announcements of AWS re:Invent 2024

Don’t Fear Artificial Intelligence; Embrace it Through Data Governance

SAP enhances Datasphere and SAC for AI-driven transformation

You Cannot Get to the Moon on a Bike!

Data confidence begins at the edge

Data governance in the age of generative AI

Five Benefits of an Automation Framework for Data Governance

Stay Connected