Data Integration, IT and Metadata

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

AWS Big Data

DECEMBER 4, 2024

With the growing emphasis on data, organizations are constantly seeking more efficient and agile ways to integrate their data, especially from a wide variety of applications. Zero-ETL is a set of fully managed integrations by AWS that minimizes the need to build ETL data pipelines.

Data Integration

Data Integration Data Lake Statistics Data-driven

Build a high-performance quant research platform with Apache Iceberg

AWS Big Data

JANUARY 9, 2025

Iceberg offers distinct advantages through its metadata layer over Parquet, such as improved data management, performance optimization, and integration with various query engines. As mentioned earlier, 80% of quantitative research work is attributed to data management tasks.

Metadata

Metadata Snapshot Cost-Benefit Optimization

Bridging the gap between mainframe data and hybrid cloud environments

CIO Business Intelligence

FEBRUARY 27, 2025

A high hurdle many enterprises have yet to overcome is accessing mainframe data via the cloud. Mainframes hold an enormous amount of critical and sensitive business data including transactional information, healthcare records, customer data, and inventory metrics.

Metadata

Metadata Data Lake Cost-Benefit Forecasting

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Enterprises can gain an edge with Metadata Management

CIO Business Intelligence

SEPTEMBER 6, 2024

Central to this is metadata management, a critical component for driving future success AI and ML need large amounts of accurate data for companies to get the most out of the technology. Unfortunately, many IT teams struggle to organize and track sensitive data across their environments. This approach is risky and costly.

Metadata

Metadata Enterprise Management Cost-Benefit

7 Benefits of Metadata Management

erwin

FEBRUARY 19, 2021

Metadata management is key to wringing all the value possible from data assets. However, most organizations don’t use all the data at their disposal to reach deeper conclusions about how to drive revenue, achieve regulatory compliance or accomplish other strategic objectives. What Is Metadata? Analyze metadata.

Metadata

Metadata Management Data Quality Cost-Benefit

RDF-Star: Metadata Complexity Simplified

Ontotext

JUNE 10, 2021

With graph databases the representation of relationships as data make it possible to better represent data in real time, addressing newly discovered types of data and relationships. This is a graph of millions of edges and vertices – in enterprise data management terms it is a giant piece of master/reference data.

Metadata

Metadata Cost-Benefit OLAP Modeling

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

AWS Big Data

JULY 29, 2024

In the era of big data, data lakes have emerged as a cornerstone for storing vast amounts of raw data in its native format. They support structured, semi-structured, and unstructured data, offering a flexible and scalable environment for data ingestion from multiple sources.

Metadata

Metadata Snapshot Data Lake Metrics

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

Data quality is no longer a back-office concern. We also examine how centralized, hybrid and decentralized data architectures support scalable, trustworthy ecosystems. One thing is clear for leaders aiming to drive trusted AI, resilient operations and informed decisions at scale: transformation starts with data you can trust.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

What Is a Metadata Catalog? (And How it Can Dramatically Improve Your Data Accuracy)

Octopai

JANUARY 31, 2022

If you’re a mystery lover, I’m sure you’ve read that classic tale: Sherlock Holmes and the Case of the Deceptive Data, and you know how a metadata catalog was a key plot element. In The Case of the Deceptive Data, Holmes is approached by B.I. He goes on to explain: Reasons for inaccurate data. Big data is BIG.

Metadata

Metadata IT Unstructured Data IoT

Four Use Cases Proving the Benefits of Metadata-Driven Automation

erwin

FEBRUARY 7, 2019

Organization’s cannot hope to make the most out of a data-driven strategy, without at least some degree of metadata-driven automation. The volume and variety of data has snowballed, and so has its velocity. As such, traditional – and mostly manual – processes associated with data management and data governance have broken down.

Metadata

Metadata Insurance Data-driven Cost-Benefit

Metadata Management, Data Governance and Automation

erwin

NOVEMBER 6, 2019

erwin released its State of Data Governance Report in February 2018, just a few months before the General Data Protection Regulation (GDPR) took effect. Download Free GDPR Guide | Step By Step Guide to Data Governance for GDPR?. IDC Technology Spotlight, Data Intelligence in Context: Get the report (… it’s free).

Metadata

Metadata Data Governance Management Cost-Benefit

Salesforce debuts Zero Copy Partner Network to ease data integration

CIO Business Intelligence

APRIL 25, 2024

At Salesforce World Tour NYC today, Salesforce unveiled a new global ecosystem of technology and solution providers geared to help its customers leverage third-party data via secure, bidirectional zero-copy integrations with Salesforce Data Cloud. It works in Salesforce just like any other native Salesforce data,” Carlson said.

Data Integration

Data Integration Data Lake Data Warehouse Metadata

Metadata, the Neglected Stepchild of IT

Data Virtualization

DECEMBER 8, 2022

Reading Time: 3 minutes While cleaning up our archive recently, I found an old article published in 1976 about data dictionary/directory systems (DD/DS). Nowadays, we no longer use the term DD/DS, but “data catalog” or simply “metadata system”. It was written by L.

Metadata

Metadata IT Data Integration Publishing

The Missing Link in Enterprise Data Governance: Metadata

Octopai

JUNE 26, 2020

In order to figure out why the numbers in the two reports didn’t match, Steve needed to understand everything about the data that made up those reports – when the report was created, who created it, any changes made to it, which system it was created in, etc. Enterprise data governance. Metadata in data governance.

Metadata

Metadata Data Governance Enterprise Reporting

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

Their terminal operations rely heavily on seamless data flows and the management of vast volumes of data. Recently, EUROGATE has developed a digital twin for its container terminal Hamburg (CTH), generating millions of data points every second from Internet of Things (IoT)devices attached to its container handling equipment (CHE).

IoT

IoT Machine Learning Metadata Data-driven

How Metadata Makes Data Meaningful

erwin

DECEMBER 12, 2019

Metadata is an important part of data governance, and as a result, most nascent data governance programs are rife with project plans for assessing and documenting metadata. But in many scenarios, it seems that the underlying driver of metadata collection projects is that it’s just something you do for data governance.

Metadata

Metadata Data Governance Digital Transformation Data Quality

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

CIO Business Intelligence

NOVEMBER 19, 2024

Some challenges include data infrastructure that allows scaling and optimizing for AI; data management to inform AI workflows where data lives and how it can be used; and associated data services that help data scientists protect AI workflows and keep their models clean. Performance enhancements.

Management

Management Unstructured Data Deep Learning Metadata

Are You Content with Your Organization’s Content Strategy?

Rocket-Powered Data Science

JULY 6, 2021

Specifically, in the modern era of massive data collections and exploding content repositories, we can no longer simply rely on keyword searches to be sufficient. This is accomplished through tags, annotations, and metadata (TAM). Smart content includes labeled (tagged, annotated) metadata (TAM). Can you find them all?

Strategy

Strategy Machine Learning Metadata Knowledge Discovery

Why Your Business Should Use a Data Catalog to Organize Its Data

Smart Data Collective

JULY 15, 2021

A data catalog serves the same purpose. By using metadata (or short descriptions), data catalogs help companies gather, organize, retrieve, and manage information. By using metadata (or short descriptions), data catalogs help companies gather, organize, retrieve, and manage information.

Metadata

Metadata IT Data-driven Data Quality

Don’t Fear Artificial Intelligence; Embrace it Through Data Governance

CIO Business Intelligence

APRIL 29, 2022

This would be straightforward task were it not for the fact that, during the digital-era, there has been an explosion of data – collected and stored everywhere – much of it poorly governed, ill-understood, and irrelevant. Data Centricity. There is evidence to suggest that there is a blind spot when it comes to data in the AI context.

Data Governance

Data Governance IT Data Lake Risk

Deep automation in machine learning

O'Reilly on Data

DECEMBER 19, 2018

are only starting to exist; one big task over the next two years is developing the IDEs for machine learning, plus other tools for data management, pipeline management, data cleaning, data provenance, and data lineage. GitHub is an excellent tool for managing code, but we need to think about [code+data].

Machine Learning

Machine Learning Software Metadata Testing

IBM named a leader in the 2022 Gartner® Magic Quadrant™ for Data Integration Tools

IBM Big Data Hub

AUGUST 24, 2022

The only question is, how do you ensure effective ways of breaking down data silos and bringing data together for self-service access? It starts by modernizing your data integration capabilities – ensuring disparate data sources and cloud environments can come together to deliver data in real time and fuel AI initiatives.

Data Integration

Data Integration Metadata Data-driven Data Architecture

5 Ways Data Modeling Is Critical to Data Governance

erwin

JANUARY 9, 2020

For decades, data modeling has been the optimal way to design and deploy new relational databases with high-quality data sources and support application development. Today’s data modeling is not your father’s data modeling software. So here’s why data modeling is so critical to data governance.

Data Governance

Data Governance Modeling Metadata Unstructured Data

Demystify data sharing and collaboration patterns on AWS: Choosing the right tool for the job

AWS Big Data

OCTOBER 21, 2024

Let’s briefly describe the capabilities of the AWS services we referred above: AWS Glue is a fully managed, serverless, and scalable extract, transform, and load (ETL) service that simplifies the process of discovering, preparing, and loading data for analytics. Amazon Athena is used to query, and explore the data.

Sales

Sales Data-driven Data Processing Key Performance Indicator

Why data observability is essential to AI governance

erwin

DECEMBER 9, 2024

When it comes to using AI and machine learning across your organization, there are many good reasons to provide your data and analytics community with an intelligent data foundation. For instance, Large Language Models (LLMs) are known to ultimately perform better when data is structured.

Metadata

Metadata Data Quality Sales Modeling

Introducing MongoDB Atlas metadata collection with AWS Glue crawlers

AWS Big Data

FEBRUARY 6, 2023

Many customers also have data in managed operational databases such as MongoDB Atlas and need to combine it with data from Amazon Simple Storage Service (Amazon S3) data lakes to derive insights. Choose the table to view the schema and other metadata.

Metadata

Metadata Data Lake Machine Learning Big Data

Data integrity vs. data quality: Is there a difference?

IBM Big Data Hub

JULY 13, 2023

When we talk about data integrity, we’re referring to the overarching completeness, accuracy, consistency, accessibility, and security of an organization’s data. Together, these factors determine the reliability of the organization’s data.

Data Quality

Data Quality Data Integration Metadata Cost-Benefit

Amazon OpenSearch Service Under the Hood : OpenSearch Optimized Instances(OR1)

AWS Big Data

APRIL 17, 2024

With this new instance family, OpenSearch Service uses OpenSearch innovation and AWS technologies to reimagine how data is indexed and stored in the cloud. Today, customers widely use OpenSearch Service for operational analytics because of its ability to ingest high volumes of data while also providing rich and interactive analytics.

Optimization

Optimization Snapshot Metadata Cost-Benefit

Becoming a machine learning company means investing in foundational technologies

O'Reilly on Data

MAY 21, 2019

With all the hype around AI, it can be tempting to jump into use cases involving data types with which you aren’t familiar. We found that companies that have successfully adopted machine learning do so either by building on existing data products and services, or by modernizing existing models and algorithms. Source: O'Reilly.

Machine Learning

Machine Learning Technology Deep Learning Data Science

Doing Cloud Migration and Data Governance Right the First Time

erwin

OCTOBER 8, 2020

So if you’re going to move from your data from on-premise legacy data stores and warehouse systems to the cloud, you should do it right the first time. And as you make this transition, you need to understand what data you have, know where it is located, and govern it along the way. Automated Cloud Migration.

Data Governance

Data Governance Metadata Testing Data Lake

Recap of Amazon Redshift key product announcements in 2024

AWS Big Data

DECEMBER 17, 2024

Amazon Redshift , launched in 2013, has undergone significant evolution since its inception, allowing customers to expand the horizons of data warehousing and SQL analytics. Industry-leading price-performance Amazon Redshift offers up to three times better price-performance than alternative cloud data warehouses.

Data Lake

Data Lake Data Warehouse Data-driven Optimization

What’s the Current State of Data Governance and Automation?

erwin

JANUARY 30, 2020

Digital transformation and data standards/uniformity round out the top five data governance drivers, with 37 and 36 percent, respectively. Constructing a Digital Transformation Strategy: How Data Drives Digital. However, more than 50 percent say they have deployed metadata management, data analytics, and data quality solutions.

Data Governance

Data Governance Metadata Cost-Benefit Digital Transformation

How Metadata Makes Data Meaningful

erwin

DECEMBER 12, 2019

Metadata is an important part of data governance, and as a result, most nascent data governance programs are rife with project plans for assessing and documenting metadata. But in many scenarios, it seems that the underlying driver of metadata collection projects is that it’s just something you do for data governance.

Metadata

Metadata Data Governance Digital Transformation Data Quality

What is data governance? Best practices for managing data assets

CIO Business Intelligence

MARCH 24, 2023

Data governance definition Data governance is a system for defining who within an organization has authority and control over data assets and how those data assets may be used. It encompasses the people, processes, and technologies required to manage and protect data assets.

Data Governance

Data Governance Management Metadata Data Quality

Constructing A Digital Transformation Strategy: Putting the Data in Digital Transformation

erwin

JULY 17, 2019

Many organizations prioritize data collection as part of their digital transformation strategy. However, few organizations truly understand their data or know how to consistently maximize its value. How does your business become more adept at wringing all the value it can from its data?

Digital Transformation

Digital Transformation Strategy Metadata Data-driven

What Is Data Modeling? Data Modeling Best Practices for Data-Driven Organizations

erwin

JANUARY 17, 2020

Data modeling is a process that enables organizations to discover, design, visualize, standardize and deploy high-quality data assets through an intuitive, graphical interface. Data models provide visualization, create additional metadata and standardize data design across the enterprise. What is Data Modeling?

Data-driven

Data-driven Modeling Metadata Data Governance

How Cargotec uses metadata replication to enable cross-account data sharing

AWS Big Data

JUNE 7, 2023

For this, Cargotec built an Amazon Simple Storage Service (Amazon S3) data lake and cataloged the data assets in AWS Glue Data Catalog. They chose AWS Glue as their preferred data integration tool due to its serverless nature, low maintenance, ability to control compute resources in advance, and scale when needed.

Metadata

Metadata Data Lake Machine Learning Big Data

The Need For Personalized Data Journeys for Your Data Consumers

DataKitchen

OCTOBER 20, 2023

In today’s data-driven landscape, Data and Analytics Teams i ncreasingly face a unique set of challenges presented by Demanding Data Consumers who require a personalized level of Data Observability. ’ It assigns unique identifiers to each data item—referred to as ‘payloads’—related to each event.

Insurance

Insurance Metadata Data-driven Data Quality

Salesforce acquisition of Tableau – What does it mean?

Andrew White

JUNE 11, 2019

Salesforce closes acquisition of Mulesoft – May 2018 (business app vendor acquires data integration). But the recent spate of acquisitions just prove the point: the work of data and analytics governance remains an after-thought, even for these large vendors (just as it does for many of their prospects).

IT

IT Data Quality Data Integration Business Objectives

SAP Datasphere review: turning data from a technical problem to a business data product.

Jen Stirrup

MARCH 29, 2023

They are also starting to realize – and accept – that data is challenging. Post-COVID, companies now understand that IT skills are different from data skills. It is easier to list the symptoms of a problematic data foundation as they are often pretty clear to business users. Why is this interesting?

Data Warehouse

Data Warehouse Metadata Data Integration Business Intelligence

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

AWS Big Data

JUNE 10, 2024

A modern data strategy redefines and enables sharing data across the enterprise and allows for both reading and writing of a singular instance of the data using an open table format. Why Cloudinary chose Apache Iceberg Apache Iceberg is a high-performance table format for huge analytic workloads.

Data Lake

Data Lake Metadata Snapshot Analytics

The Semantic Web: 20 Years And a Handful of Enterprise Knowledge Graphs Later

Ontotext

JULY 29, 2021

The Semantic Web started in the late 90’s as a fascinating vision for a web of data, which is easy to interpret by both humans and machines. In this post you will discover the aspects of the Semantic Web that are key to enterprise data, knowledge and content management. Source: tag.ontotext.com. Take this restaurant, for example.

Enterprise

Enterprise Metadata Knowledge Discovery Management

How to Do Data Modeling the Right Way

erwin

MAY 27, 2020

Data modeling supports collaboration among business stakeholders – with different job roles and skills – to coordinate with business objectives. Data resides everywhere in a business , on-premise and in private or public clouds. A single source of data truth helps companies begin to leverage data as a strategic asset.

Modeling

Modeling Metadata Data Governance Visualization

Denodo Provides a Logical Approach to Data Management

David Menninger's Analyst Perspectives

OCTOBER 24, 2024

Data fabric and data mesh are also both related to logical data management, which is the approach of providing virtualized access to data across an enterprise without the requirement to first extract and load it into a central repository.

Management

Management Data-driven Data Governance Data Lake

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Build a high-performance quant research platform with Apache Iceberg

Webinars

Trending Sources

Bridging the gap between mainframe data and hybrid cloud environments

Webinars

Enterprises can gain an edge with Metadata Management

7 Benefits of Metadata Management

RDF-Star: Metadata Complexity Simplified

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

Data’s dark secret: Why poor quality cripples AI and growth

What Is a Metadata Catalog? (And How it Can Dramatically Improve Your Data Accuracy)

Four Use Cases Proving the Benefits of Metadata-Driven Automation

Metadata Management, Data Governance and Automation

Salesforce debuts Zero Copy Partner Network to ease data integration

Metadata, the Neglected Stepchild of IT

The Missing Link in Enterprise Data Governance: Metadata

How EUROGATE established a data mesh architecture using Amazon DataZone

How Metadata Makes Data Meaningful

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

Are You Content with Your Organization’s Content Strategy?

Why Your Business Should Use a Data Catalog to Organize Its Data

Don’t Fear Artificial Intelligence; Embrace it Through Data Governance

Deep automation in machine learning

IBM named a leader in the 2022 Gartner® Magic Quadrant™ for Data Integration Tools

5 Ways Data Modeling Is Critical to Data Governance

Demystify data sharing and collaboration patterns on AWS: Choosing the right tool for the job

Why data observability is essential to AI governance

Introducing MongoDB Atlas metadata collection with AWS Glue crawlers

Data integrity vs. data quality: Is there a difference?

Amazon OpenSearch Service Under the Hood : OpenSearch Optimized Instances(OR1)

Becoming a machine learning company means investing in foundational technologies

Doing Cloud Migration and Data Governance Right the First Time

Recap of Amazon Redshift key product announcements in 2024

What’s the Current State of Data Governance and Automation?

How Metadata Makes Data Meaningful

What is data governance? Best practices for managing data assets

Constructing A Digital Transformation Strategy: Putting the Data in Digital Transformation

What Is Data Modeling? Data Modeling Best Practices for Data-Driven Organizations

How Cargotec uses metadata replication to enable cross-account data sharing

The Need For Personalized Data Journeys for Your Data Consumers

Salesforce acquisition of Tableau – What does it mean?

SAP Datasphere review: turning data from a technical problem to a business data product.

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

The Semantic Web: 20 Years And a Handful of Enterprise Knowledge Graphs Later

How to Do Data Modeling the Right Way

Denodo Provides a Logical Approach to Data Management

Stay Connected