Data Governance, Interactive and Metadata

Accelerating AI at scale without sacrificing security

CIO Business Intelligence

NOVEMBER 27, 2024

As AI adoption accelerates, it demands increasingly vast amounts of data, leading to more users accessing, transferring, and managing it across diverse environments. Each interaction amplifies the potential for errors, breaches, or misuse, underscoring the critical need for a strong governance framework to mitigate these risks.

Data Governance

Data Governance Risk Insurance Metadata

Integrating Data Governance and Enterprise Architecture

erwin

SEPTEMBER 3, 2020

Why should you integrate data governance (DG) and enterprise architecture (EA)? Data governance provides time-sensitive, current-state architecture information with a high level of quality. Data governance provides time-sensitive, current-state architecture information with a high level of quality.

Data Governance

Data Governance Enterprise Risk Data Lake

What Is Data Governance? (And Why Your Organization Needs It)

erwin

AUGUST 28, 2020

Organizations with a solid understanding of data governance (DG) are better equipped to keep pace with the speed of modern business. In this post, the erwin Experts address: What Is Data Governance? Why Is Data Governance Important? What Is Good Data Governance? What Is Data Governance?

Data Governance

Data Governance IT Cost-Benefit Metadata

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Automating Data Governance

erwin

OCTOBER 29, 2020

Automating data governance is key to addressing the exponentially growing volume and variety of data. Data readiness is everything. The State of Data Automation. Data readiness depends on automation to create the data pipeline. We asked participants to “talk to us about data value chain bottlenecks.”

Data Governance

Data Governance Metadata Digital Transformation ROI

7 Benefits of Metadata Management

erwin

FEBRUARY 19, 2021

Metadata management is key to wringing all the value possible from data assets. However, most organizations don’t use all the data at their disposal to reach deeper conclusions about how to drive revenue, achieve regulatory compliance or accomplish other strategic objectives. What Is Metadata? Harvest data.

Metadata

Metadata Management Data Quality Cost-Benefit

Metadata Management Best Practices: How to Plan Your Metadata Management Program

Octopai

NOVEMBER 10, 2021

Metadata has been defined as the who, what, where, when, why, and how of data. Without the context given by metadata, data is just a bunch of numbers and letters. But going on a rampage to define, categorize, and otherwise metadata-ize your data doesn’t necessarily give you the key to the value in your data.

Metadata

Metadata Management Interactive Strategy

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

AWS Big Data

JULY 29, 2024

It addresses many of the shortcomings of traditional data lakes by providing features such as ACID transactions, schema evolution, row-level updates and deletes, and time travel. In this blog post, we’ll discuss how the metadata layer of Apache Iceberg can be used to make data lakes more efficient.

Metadata

Metadata Snapshot Data Lake Metrics

Data governance in the age of generative AI

AWS Big Data

FEBRUARY 29, 2024

Data is your generative AI differentiator, and a successful generative AI implementation depends on a robust data strategy incorporating a comprehensive data governance approach. Data governance is a critical building block across all these approaches, and we see two emerging areas of focus.

Data Governance

Data Governance Unstructured Data Metadata Data Lake

Cloudera and Snowflake Partner to Deliver the Most Comprehensive Open Data Lakehouse

Cloudera

OCTOBER 23, 2024

In August, we wrote about how in a future where distributed data architectures are inevitable, unifying and managing operational and business metadata is critical to successfully maximizing the value of data, analytics, and AI.

Metadata

Metadata Data Lake Dashboards Interactive

What Does It Mean to Make Data Governance Fun?

Alation

JANUARY 19, 2023

The words “ data governance ” and “fun” are seldom spoken together. The term data governance conjures images of restrictions and control that result in an uphill challenge for most programs and organizations from the beginning. Or they are spending too much time preparing the data for proper use.

Data Governance

Data Governance IT Metadata Recreation/Entertainment

Organize content across business units with enterprise-wide data governance using Amazon DataZone domain units and authorization policies

AWS Big Data

AUGUST 13, 2024

Amazon DataZone has announced a set of new data governance capabilities—domain units and authorization policies—that enable you to create business unit-level or team-level organization and manage policies according to your business needs. Data domains form a foundational pillar in data governance frameworks.

Data Governance

Data Governance Metadata Enterprise Sales

How BMW streamlined data access using AWS Lake Formation fine-grained access control

AWS Big Data

OCTOBER 29, 2024

To achieve this, they aimed to break down data silos and centralize data from various business units and countries into the BMW Cloud Data Hub (CDH). However, the initial version of CDH supported only coarse-grained access control to entire data assets, and hence it was not possible to scope access to data asset subsets.

Data Lake

Data Lake Sales Metadata Machine Learning

Becoming a machine learning company means investing in foundational technologies

O'Reilly on Data

MAY 21, 2019

You also need solutions that let you understand what data you have and who can access it. About a third of the respondents in the survey indicated they are interested in data governance systems and data catalogs. Metadata and artifacts needed for audits. Marquez (WeWork) and Databook (Uber). Source: O'Reilly.

Machine Learning

Machine Learning Technology Deep Learning Data Science

Data Management 20/20: Data Governance Challenges in a Digital Society

TDAN

MAY 5, 2020

The reversal from information scarcity to information abundance and the shift from the primacy of entities to the primacy of interactions has resulted in an increased burden for the data involved in those interactions to be trustworthy.

Data Governance

Data Governance Management Interactive Data Quality

Top 5 Data Catalog Benefits: Understanding Your Organization’s Data Lineage

erwin

AUGUST 7, 2019

A data catalog benefits organizations in a myriad of ways. With the right data catalog tool, organizations can automate enterprise metadata management – including data cataloging, data mapping, data quality and code generation for faster time to value and greater accuracy for data movement and/or deployment projects.

Metadata

Metadata Data Governance Data Quality Data Warehouse

What is Active Metadata & Why it Matters: Key Insights from Gartner’s Market Guide

Alation

MARCH 2, 2023

Instead, we got data. Lots and lots of data. Well, we got jetpacks, too, but we rarely interact with them during the workday. It does feel, however, as if we need jet-like speed to analyze and understand our data, who is using it, how it is used, and if it is being used to drive value. This data about data is valuable.

Metadata

Metadata Marketing IT Data Quality

How Cloudera Data Flow Enables Successful Data Mesh Architectures

Cloudera

OCTOBER 7, 2021

Application Logic: Application logic refers to the type of data processing, and can be anything from analytical or operational systems to data pipelines that ingest data inputs, apply transformations based on some business logic and produce data outputs.

Metadata

Metadata Cost-Benefit Enterprise Interactive

What’s Business Process Modeling Got to Do with It? – Choosing A BPM Tool

erwin

MARCH 21, 2019

With business process modeling (BPM) being a key component of data governance , choosing a BPM tool is part of a dilemma many businesses either have or will soon face. Historically, BPM didn’t necessarily have to be tied to an organization’s data governance initiative. Choosing a BPM Tool: An Overview.

Modeling

Modeling Metadata Data Governance IT

SAP enhances Datasphere and SAC for AI-driven transformation

CIO Business Intelligence

MARCH 6, 2024

SAP announced today a host of new AI copilot and AI governance features for SAP Datasphere and SAP Analytics Cloud (SAC). The company is expanding its partnership with Collibra to integrate Collibra’s AI Governance platform with SAP data assets to facilitate data governance for non-SAP data assets in customer environments. “We

Unstructured Data

Unstructured Data Dashboards Business Intelligence Data Governance

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

This person (or group of individuals) ensures that the theory behind data quality is communicated to the development team. 2 – Data profiling. Data profiling is an essential process in the DQM lifecycle. from the business interactions), but if not available, then through confirmation techniques of an independent nature.

Data Quality

Data Quality Metrics Data-driven Management

Top analytics announcements of AWS re:Invent 2024

AWS Big Data

FEBRUARY 26, 2025

S3 Tables integration with the AWS Glue Data Catalog is in preview, allowing you to stream, query, and visualize dataincluding Amazon S3 Metadata tablesusing AWS analytics services such as Amazon Data Firehose , Amazon Athena , Amazon Redshift, Amazon EMR, and Amazon QuickSight. With AWS Glue 5.0,

Analytics

Analytics Data Lake Metadata Data Warehouse

6 BI challenges IT teams must address

CIO Business Intelligence

DECEMBER 21, 2022

“The number-one issue for our BI team is convincing people that business intelligence will help to make true data-driven decisions,” says Diana Stout, senior business analyst at Schellman, a global cybersecurity assessor based in Tampa, Fl. It’s about being able to find relevant data and connect it through a knowledge graph.

IT

IT Business Intelligence Sales Key Performance Indicator

SAP Datasphere review: turning data from a technical problem to a business data product.

Jen Stirrup

MARCH 29, 2023

By providing a unified view of the data, the semantic layer helps ensure that different users and reports use consistent definitions and calculations, thereby helping to provide a single view of the customer. The new SAP Datasphere catalog provides data lineage, metadata information, and quick searching capabilities across your SAP landscape.

Data Warehouse

Data Warehouse Metadata Data Integration Business Intelligence

Themes and Conferences per Pacoid, Episode 11

Domino Data Lab

JULY 2, 2019

In other words, using metadata about data science work to generate code. In this case, code gets generated for data preparation, where so much of the “time and labor” in data science work is concentrated. Interactive Query Synthesis from Input-Output Examples ” – Chenglong Wang, Alvin Cheung, Rastislav Bodik (2017-05-14).

Metadata

Metadata Data Science Machine Learning Data-driven

Minimizing Supply Chain Disruptions with Advanced Analytics

Cloudera

AUGUST 3, 2021

Advanced analytics and enterprise data are empowering several overarching initiatives in supply chain risk reduction – improved visibility and transparency into all aspects of the supply chain balanced with data governance and security. . Improve Visibility within Supply Chains.

Analytics

Analytics Digital Transformation Forecasting Risk

Cross-account data collaboration with Amazon DataZone and AWS analytical tools

AWS Big Data

MARCH 5, 2025

You will then publish the data assets from these data sources. The Amazon DataZone data sources allow you to connect to various data sources, including databases, data warehouses, and data lakes, and ingest metadata into Amazon DataZone. Add an AWS Glue data source to publish the new AWS Glue table.

Analytics

Analytics Publishing Metadata Sales

Read and write S3 Iceberg table using AWS Glue Iceberg Rest Catalog from Open Source Apache Spark

AWS Big Data

DECEMBER 4, 2024

The post will include details on how to perform read/write data operations against Amazon S3 tables with AWS Lake Formation managing metadata and underlying data access using temporary credential vending. Create a user defined IAM role following the instructions in Requirements for roles used to register locations.

Data Lake

Data Lake Metadata Insurance Data-driven

How gaming companies can use Amazon Redshift Serverless to build scalable analytical applications faster and easier

AWS Big Data

MARCH 7, 2023

Analytics reference architecture for gaming organizations In this section, we discuss how gaming organizations can use a data hub architecture to address the analytical needs of an enterprise, which requires the same data at multiple levels of granularity and different formats, and is standardized for faster consumption.

Analytics

Analytics Data Warehouse Data Lake Metadata

6 benefits of data lineage for financial services

IBM Big Data Hub

FEBRUARY 26, 2024

The financial services industry has been in the process of modernizing its data governance for more than a decade. But as we inch closer to global economic downturn, the need for top-notch governance has become increasingly urgent. Download the Gartner® Market Guide for Active Metadata Management 1.

Cost-Benefit

Cost-Benefit Metadata Data Governance Reporting

How Volkswagen streamlined access to data across multiple data lakes using Amazon DataZone – Part 1

AWS Big Data

JULY 18, 2024

The current method is largely manual, relying on emails and general communication, which not only increases overhead but also varies from one use case to another in terms of data governance. Data domain producers publish data assets using datasource run to Amazon DataZone in the Central Governance account.

Data Lake

Data Lake Publishing Metadata Data-driven

How Amazon Finance Automation built a data mesh to support distributed data ownership and centralize governance

AWS Big Data

JULY 14, 2023

In this post, we discuss how the Amazon Finance Automation team used AWS Lake Formation and the AWS Glue Data Catalog to build a data mesh architecture that simplified data governance at scale and provided seamless data access for analytics, AI, and machine learning (ML) use cases.

Finance

Finance Metadata Big Data Recreation/Entertainment

Why I Joined Alation: A Former Customer’s Story

Alation

JULY 26, 2021

One of the first steps in any digital transformation journey is to understand what data assets exist in the organization. When we began, we had a very technical and archaic tool, an enterprise metadata management platform that cataloged our assets. The people behind the data are key. It was terribly complex.

Insurance

Insurance Digital Transformation Enterprise Data Governance

Accelerate Your Data Mesh in the Cloud with Cloudera Data Engineering and Modak NabuTM

Cloudera

OCTOBER 11, 2021

The platform converges data cataloging, data ingestion, data profiling, data tagging, data discovery, and data exploration into a unified platform, driven by metadata. Modak Nabu automates repetitive tasks in the data preparation process and thus accelerates the data preparation by 4x.

Data Lake

Data Lake Cost-Benefit Data-driven Dashboards

What Is Data Intelligence?

Alation

AUGUST 26, 2021

What Is Data Intelligence? Data intelligence is a system to deliver trustworthy, reliable data. It includes intelligence about data, or metadata. IDC coined the term, stating, “data intelligence helps organizations answer six fundamental questions about data.” Why keep data at all?

Metadata

Metadata Data Governance Dashboards Software

You Cannot Get to the Moon on a Bike!

Ontotext

JANUARY 10, 2024

Limiting growth by (data integration) complexity Most operational IT systems in an enterprise have been developed to serve a single business function and they use the simplest possible model for this. In both cases, semantic metadata is the glue that turns knowledge graphs into hubs of data, metadata, and content.

Metadata

Metadata Slice and Dice Data Integration Enterprise

The importance of governance: What we’re learning from AI advances in 2022

IBM Big Data Hub

DECEMBER 16, 2022

Over the last week, millions of people around the world have interacted with OpenAI’s ChatGPT, which represents a significant advance for generative artificial intelligence (AI) and the foundation models that underpin many of these use cases. It’s a fitting way to end what has been another big year for the industry.

Uncertainty

Uncertainty Metadata Modeling Data Collection

The Enduring Significance of Data Modeling in the Modern Data-Driven Enterprise

erwin

AUGUST 31, 2023

It delivers the ability to capture and unify the business and technical perspectives of data assets, enables effective collaboration between a variety of stakeholders, and delivers metadata-driven automation to accelerate the creation and maintenance of data sources on virtually any data management platform.

Data-driven

Data-driven Modeling Enterprise Structured Data

Themes and Conferences per Pacoid, Episode 8

Domino Data Lab

APRIL 3, 2019

Paco Nathan ‘s latest column dives into data governance. This month’s article features updates from one of the early data conferences of the year, Strata Data Conference – which was held just last week in San Francisco. In particular, here’s my Strata SF talk “Overview of Data Governance” presented in article form.

Machine Learning

Machine Learning Data Governance Metadata Data Science

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

Customer 360 (C360) provides a complete and unified view of a customer’s interactions and behavior across all touchpoints and channels. This view is used to identify patterns and trends in customer behavior, which can inform data-driven decisions to improve business outcomes. Then, you transform this data into a concise format.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

What is a Machine Learning Data Catalog?

Alation

MARCH 17, 2021

According to the Forrester Wave: Machine Learning Data Catalogs, Q4 2020 , “Alation exploits machine learning at every opportunity to improve data management, governance, and consumption by analytic citizens. An MLDC brings many benefits, like: Enhanced data management. Data governance streamlining.

Machine Learning

Machine Learning Metadata Data Governance Data Quality

AWS Lake Formation 2022 year in review

AWS Big Data

JANUARY 31, 2023

Data governance is the collection of policies, processes, and systems that organizations use to ensure the quality and appropriate handling of their data throughout its lifecycle for the purpose of generating business value.

Data Lake

Data Lake Data Governance Data Architecture Machine Learning

Introducing AWS Glue crawler and create table support for Apache Iceberg format

AWS Big Data

AUGUST 16, 2023

Iceberg has become very popular for its support for ACID transactions in data lakes and features like schema and partition evolution, time travel, and rollback. Iceberg captures metadata information on the state of datasets as they evolve and change over time. Choose Create.

Data Lake

Data Lake Metadata Snapshot Management

Insights from Gartner Data & Analytics Summit Orlando 2023

Alation

MARCH 31, 2023

Ehtisham Zaidi, Gartner’s VP of data management, and Robert Thanaraj, Gartner’s director of data management, gave an update on the fabric versus mesh debate in light of what they call the “active metadata era” we’re currently in. The foundations of successful data governance The state of data governance was also top of mind.

Data Analytics

Data Analytics Analytics Metadata Data Governance

Orchestrate an end-to-end ETL pipeline using Amazon S3, AWS Glue, and Amazon Redshift Serverless with Amazon MWAA

AWS Big Data

APRIL 25, 2024

This approach allows the team to process the raw data extracted from Account A to Account B, which is dedicated for data handling tasks. This makes sure the raw and processed data can be maintained securely separated across multiple accounts, if required, for enhanced data governance and security. secretsmanager ).

Metadata

Metadata Data Processing Management Testing

Accelerating AI at scale without sacrificing security

Integrating Data Governance and Enterprise Architecture

Webinars

Trending Sources

What Is Data Governance? (And Why Your Organization Needs It)

Webinars

Automating Data Governance

7 Benefits of Metadata Management

Metadata Management Best Practices: How to Plan Your Metadata Management Program

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

Data governance in the age of generative AI

Cloudera and Snowflake Partner to Deliver the Most Comprehensive Open Data Lakehouse

What Does It Mean to Make Data Governance Fun?

Organize content across business units with enterprise-wide data governance using Amazon DataZone domain units and authorization policies

How BMW streamlined data access using AWS Lake Formation fine-grained access control

Becoming a machine learning company means investing in foundational technologies

Data Management 20/20: Data Governance Challenges in a Digital Society

Top 5 Data Catalog Benefits: Understanding Your Organization’s Data Lineage

What is Active Metadata & Why it Matters: Key Insights from Gartner’s Market Guide

How Cloudera Data Flow Enables Successful Data Mesh Architectures

What’s Business Process Modeling Got to Do with It? – Choosing A BPM Tool

SAP enhances Datasphere and SAC for AI-driven transformation

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

Top analytics announcements of AWS re:Invent 2024

6 BI challenges IT teams must address

SAP Datasphere review: turning data from a technical problem to a business data product.

Themes and Conferences per Pacoid, Episode 11

Minimizing Supply Chain Disruptions with Advanced Analytics

Cross-account data collaboration with Amazon DataZone and AWS analytical tools

Read and write S3 Iceberg table using AWS Glue Iceberg Rest Catalog from Open Source Apache Spark

How gaming companies can use Amazon Redshift Serverless to build scalable analytical applications faster and easier

6 benefits of data lineage for financial services

How Volkswagen streamlined access to data across multiple data lakes using Amazon DataZone – Part 1

How Amazon Finance Automation built a data mesh to support distributed data ownership and centralize governance

Why I Joined Alation: A Former Customer’s Story

Accelerate Your Data Mesh in the Cloud with Cloudera Data Engineering and Modak NabuTM

What Is Data Intelligence?

You Cannot Get to the Moon on a Bike!

The importance of governance: What we’re learning from AI advances in 2022

The Enduring Significance of Data Modeling in the Modern Data-Driven Enterprise

Themes and Conferences per Pacoid, Episode 8

Create an end-to-end data strategy for Customer 360 on AWS

What is a Machine Learning Data Catalog?

AWS Lake Formation 2022 year in review

Introducing AWS Glue crawler and create table support for Apache Iceberg format

Insights from Gartner Data & Analytics Summit Orlando 2023

Orchestrate an end-to-end ETL pipeline using Amazon S3, AWS Glue, and Amazon Redshift Serverless with Amazon MWAA

Stay Connected