Data Governance, Data Lake and Management

Diving Deeper into the Data Lake

David Menninger's Analyst Perspectives

NOVEMBER 13, 2020

A data lake is a centralized repository designed to house big data in structured, semi-structured and unstructured form. I have been covering the data lake topic for several years and encourage you to check out an earlier perspective called Data Lakes: Safe Way to Swim in Big Data?

Data Lake

Data Lake Big Data Data Governance Technology

Data Lakes Meet Data Warehouses

David Menninger's Analyst Perspectives

MAY 7, 2020

In this analyst perspective, Dave Menninger takes a look at data lakes. He explains the term “data lake,” describes common use cases and shares his views on some of the latest market trends. He explores the relationship between data warehouses and data lakes and share some of Ventana Research’s findings on the subject.

Data Lake

Data Lake Data Warehouse Risk Marketing

Collibra Brings Effective Data Governance to Line-of-Business

David Menninger's Analyst Perspectives

SEPTEMBER 28, 2021

Collibra is a data governance software company that offers tools for metadata management and data cataloging. The software enables organizations to find data quickly, identify its source and assure its integrity.

Data Governance

Data Governance Metadata Software Management

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

What is data architecture? A framework to manage data

CIO Business Intelligence

DECEMBER 20, 2024

Data architecture definition Data architecture describes the structure of an organizations logical and physical data assets, and data management resources, according to The Open Group Architecture Framework (TOGAF). An organizations data architecture is the purview of data architects. Cloud computing.

Data Architecture

Data Architecture Management Consulting Internet of Things

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

AWS Big Data

NOVEMBER 22, 2024

This integration enables data teams to efficiently transform and manage data using Athena with dbt Cloud’s robust features, enhancing the overall data workflow experience. This enables you to extract insights from your data without the complexity of managing infrastructure.

Data Lake

Data Lake Data Warehouse Cost-Benefit Data Transformation

Why Your Data Lake Needs Bad Data

David Menninger's Analyst Perspectives

MAY 13, 2021

Everyone talks about data quality, as they should. Our research shows that improving the quality of information is the top benefit of data preparation activities. Data quality efforts are focused on clean data. Yes, clean data is important. but so is bad data.

Data Lake

Data Lake Data Quality Data Governance Management

Talend Data Fabric Simplifies Data Life Cycle Management

David Menninger's Analyst Perspectives

NOVEMBER 16, 2021

Talend is a data integration and management software company that offers applications for cloud computing, big data integration, application integration, data quality and master data management.

Management

Management Data Warehouse Data Quality Data Integration

Integrating Data Governance and Enterprise Architecture

erwin

SEPTEMBER 3, 2020

Why should you integrate data governance (DG) and enterprise architecture (EA)? Data governance provides time-sensitive, current-state architecture information with a high level of quality. Data governance provides time-sensitive, current-state architecture information with a high level of quality.

Data Governance

Data Governance Enterprise Risk Data Lake

Doing Cloud Migration and Data Governance Right the First Time

erwin

OCTOBER 8, 2020

So if you’re going to move from your data from on-premise legacy data stores and warehouse systems to the cloud, you should do it right the first time. That means your cloud data assets must be available for use by the right people for the right purposes to maximize their security, quality and value.

Data Governance

Data Governance Metadata Testing Data Lake

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

AWS Big Data

OCTOBER 30, 2024

Amazon DataZone now launched authentication supports through the Amazon Athena JDBC driver, allowing data users to seamlessly query their subscribed data lake assets via popular business intelligence (BI) and analytics tools like Tableau, Power BI, Excel, SQL Workbench, DBeaver, and more.

Visualization

Visualization Data Lake Testing Data Governance

Data Management on Display at Informatica World 2019

David Menninger's Analyst Perspectives

JUNE 12, 2019

Under that focus, Informatica's conference emphasized capabilities across six areas (all strong areas for Informatica): data integration, data management, data quality & governance, Master Data Management (MDM), data cataloging, and data security.

Management

Management Data Quality Data Integration Data Lake

Cloudera Consolidates Its Data Platform

David Menninger's Analyst Perspectives

JANUARY 22, 2021

Organizations are dealing with exponentially increasing data that ranges broadly from customer-generated information, financial transactions, edge-generated data and even operational IT server logs. A combination of complex data lake and data warehouse capabilities are required to leverage this data.

Data Lake

Data Lake IT Data Warehouse Data Governance

Informatica Continues to Evolve Data Management Platform

David Menninger's Analyst Perspectives

APRIL 8, 2021

Organizations are accelerating their digital transformation and looking for innovative ways to engage with customers in this new digital era of data management. The challenge is to ensure that processes, applications and data can still be integrated across cloud and on-premises systems.

Management

Management Digital Transformation Business Intelligence Internet of Things

How BMW streamlined data access using AWS Lake Formation fine-grained access control

AWS Big Data

OCTOBER 29, 2024

However, the initial version of CDH supported only coarse-grained access control to entire data assets, and hence it was not possible to scope access to data asset subsets. This led to inefficiencies in data governance and access control. The architecture is shown in the following figure.

Data Lake

Data Lake Sales Metadata Machine Learning

Unleash deeper insights with Amazon Redshift data sharing for data lake tables

AWS Big Data

OCTOBER 10, 2024

Amazon Redshift has established itself as a highly scalable, fully managed cloud data warehouse trusted by tens of thousands of customers for its superior price-performance and advanced data analytics capabilities. This allows you to maintain a comprehensive view of your data while optimizing for cost-efficiency.

Data Lake

Data Lake Data Warehouse Recreation/Entertainment Data-driven

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

Their terminal operations rely heavily on seamless data flows and the management of vast volumes of data. With the addition of these technologies alongside existing systems like terminal operating systems (TOS) and SAP, the number of data producers has grown substantially. datazone_env_twinsimsilverdata"."cycle_end";')

IoT

IoT Machine Learning Metadata Data-driven

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

AWS Big Data

AUGUST 15, 2024

Unlocking the true value of data often gets impeded by siloed information. Traditional data management—wherein each business unit ingests raw data in separate data lakes or warehouses—hinders visibility and cross-functional analysis. Business units access clean, standardized data.

Data Lake

Data Lake Data Warehouse Data Governance Publishing

Apply enterprise data governance and management using AWS Lake Formation and AWS IAM Identity Center

AWS Big Data

SEPTEMBER 26, 2024

In today’s rapidly evolving digital landscape, enterprises across regulated industries face a critical challenge as they navigate their digital transformation journeys: effectively managing and governing data from legacy systems that are being phased out or replaced. We have created two groups: Data Engineering and Auditor.

Data Governance

Data Governance Enterprise Management Data Lake

HEMA accelerates their data governance journey with Amazon DataZone

AWS Big Data

DECEMBER 19, 2024

However, many companies today still struggle to effectively harness and use their data due to challenges such as data silos, lack of discoverability, poor data quality, and a lack of data literacy and analytical capabilities to quickly access and use data across the organization.

Data Governance

Data Governance Publishing Data-driven Metadata

Choosing an open table format for your transactional data lake on AWS

AWS Big Data

JUNE 9, 2023

A modern data architecture enables companies to ingest virtually any type of data through automated pipelines into a data lake, which provides highly durable and cost-effective object storage at petabyte or exabyte scale. and Delta Lake 2.3.0. Apache Iceberg 1.2.0,

Data Lake

Data Lake Metadata Statistics Optimization

Streamline AI-driven analytics with governance: Integrating Tableau with Amazon DataZone

AWS Big Data

OCTOBER 30, 2024

Amazon DataZone is a data management service that makes it faster and easier for customers to catalog, discover, share, and govern data stored across AWS, on premises, and from third-party sources. When you’re connected, you can query, visualize, and share data—governed by Amazon DataZone—within Tableau.

Analytics

Analytics Visualization Data Governance Data-driven

Don’t Fear Artificial Intelligence; Embrace it Through Data Governance

CIO Business Intelligence

APRIL 29, 2022

This would be straightforward task were it not for the fact that, during the digital-era, there has been an explosion of data – collected and stored everywhere – much of it poorly governed, ill-understood, and irrelevant. Further, data management activities don’t end once the AI model has been developed.

Data Governance

Data Governance IT Data Lake Risk

How Volkswagen streamlined access to data across multiple data lakes using Amazon DataZone – Part 1

AWS Big Data

JULY 18, 2024

Over the years, organizations have invested in creating purpose-built, cloud-based data lakes that are siloed from one another. A major challenge is enabling cross-organization discovery and access to data across these multiple data lakes, each built on different technology stacks.

Data Lake

Data Lake Publishing Metadata Data-driven

Data Governance Makes Data Security Less Scary

erwin

OCTOBER 31, 2019

The Regulatory Rationale for Integrating Data Management & Data Governance. Now, as Cybersecurity Awareness Month comes to a close – and ghosts and goblins roam the streets – we thought it a good time to resurrect some guidance on how data governance can make data security less scary.

Data Governance

Data Governance Metadata Risk Data Lake

Data governance in the age of generative AI

AWS Big Data

FEBRUARY 29, 2024

Data is your generative AI differentiator, and a successful generative AI implementation depends on a robust data strategy incorporating a comprehensive data governance approach. Data governance is a critical building block across all these approaches, and we see two emerging areas of focus.

Data Governance

Data Governance Unstructured Data Metadata Data Lake

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

AWS Big Data

MARCH 10, 2023

Since the deluge of big data over a decade ago, many organizations have learned to build applications to process and analyze petabytes of data. Data lakes have served as a central repository to store structured and unstructured data at any scale and in various formats.

Data Lake

Data Lake Sales Data Warehouse Snapshot

3 things to get right with data management for gen AI projects

CIO Business Intelligence

OCTOBER 2, 2024

According to Kari Briski, VP of AI models, software, and services at Nvidia, successfully implementing gen AI hinges on effective data management and evaluating how different models work together to serve a specific use case. Data management, when done poorly, results in both diminished returns and extra costs.

Management

Management Data Governance Cost-Benefit Structured Data

Denodo Provides a Logical Approach to Data Management

David Menninger's Analyst Perspectives

OCTOBER 24, 2024

Data fabric refers to technology products that can be used to integrate, manage and govern data across distributed environments, supporting the cultural and organizational data ownership and access goals of data mesh.

Management

Management Data-driven Data Governance Data Lake

How Data Governance Protects Sensitive Data

erwin

APRIL 2, 2021

Organizations are managing more data than ever. With more companies increasingly migrating their data to the cloud to ensure availability and scalability, the risks associated with data management and protection also are growing. Data Security Starts with Data Governance.

Data Governance

Data Governance Cost-Benefit Metadata Risk

2021 Gift Giving Guide for Data Nerds

DataKitchen

DECEMBER 7, 2021

In the book, author Zhamak Dehghani reveals that, despite the time, money, and effort poured into them, data warehouses and data lakes fail when applied at the scale and speed of today’s organizations. A distributed data mesh is a better choice. Disrupting Data Governance: A Call to Action, by Laura B.

Data-driven

Data-driven Data Governance Big Data Data Science

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

AWS Big Data

JUNE 10, 2024

In this post, we delve into the key aspects of using Amazon EMR for modern data management, covering topics such as data governance, data mesh deployment, and streamlined data discovery. Organizations have multiple Hive data warehouses across EMR clusters, where the metadata gets generated.

Data Lake

Data Lake Metadata Data Warehouse Data Processing

Enhancing Data Catalog with AI

David Menninger's Analyst Perspectives

SEPTEMBER 22, 2022

But collecting data is only half of the equation. As the data grows, it becomes challenging to find the right data at the right time. Many organizations can’t take full advantage of their data lakes because they don’t know what data actually exists.

Data Lake

Data Lake Business Intelligence Analytics IT

Decentralize LF-tag management with AWS Lake Formation

AWS Big Data

NOVEMBER 16, 2023

In today’s data-driven world, organizations face unprecedented challenges in managing and extracting valuable insights from their ever-expanding data ecosystems. As the number of data assets and users grow, the traditional approaches to data management and governance are no longer sufficient.

Management

Management Data Lake Sales Machine Learning

Top 10 Data Governance Predictions for 2019

erwin

DECEMBER 13, 2018

This past year witnessed a data governance awakening – or as the Wall Street Journal called it, a “global data governance reckoning.” There was tremendous data drama and resulting trauma – from Facebook to Equifax and from Yahoo to Marriott. So what’s on the horizon for data governance in the year ahead?

Data Governance

Data Governance IoT Internet of Things Data-driven

Thank You Snowflake for Naming Alation the Data Governance Partner of the Year

Alation

JUNE 17, 2021

Leading companies like Cisco, Nielsen, and Finnair turn to Alation + Snowflake for data governance and analytics. By joining forces, we can build more potent, tailored solutions that leverage data governance as a competitive asset. Lastly, active data governance simplifies stewardship tasks of all kinds.

Data Governance

Data Governance Data Lake Insurance Enterprise

Automated data governance with AWS Glue Data Quality, sensitive data detection, and AWS Lake Formation

AWS Big Data

OCTOBER 10, 2023

Data governance is the process of ensuring the integrity, availability, usability, and security of an organization’s data. Due to the volume, velocity, and variety of data being ingested in data lakes, it can get challenging to develop and maintain policies and procedures to ensure data governance at scale for your data lake.

Data Quality

Data Quality Data Governance Data Lake Testing

How to Simplify Your Approach to Data Governance

Data Virtualization

JUNE 16, 2022

Reading Time: 6 minutes Data Governance as a concept and practice has been around for as long as data management has been around. It, however is gaining prominence and interest in recent years due to the increasing volume of data that needs to be.

Data Governance

Data Governance Data Integration Management Data Lake

What you don’t know about data management could kill your business

CIO Business Intelligence

NOVEMBER 28, 2023

To avoid the inevitable, CIOs must get serious about data management. Data, of course, has been all the rage the past decade, having been declared the “new oil” of the digital economy. Still, to truly create lasting value with data, organizations must develop data management mastery.

Management

Management Data Architecture Data Lake Data Strategy

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

AWS Big Data

DECEMBER 4, 2024

Recognizing this paradigm shift, ANZ Institutional Division has embarked on a transformative journey to redefine its approach to data management, utilization, and extracting significant business value from data insights. This enables global discoverability and collaboration without centralizing ownership or operations.

Metadata

Metadata Data Governance Data Quality Data-driven

Data Lakes: What Are They and Who Needs Them?

Jet Global

JULY 2, 2019

To address the flood of data and the needs of enterprise businesses to store, sort, and analyze that data, a new storage solution has evolved: the data lake. What’s in a Data Lake? Data warehouses do a great job of standardizing data from disparate sources for analysis. Taking a Dip.

Data Lake

Data Lake Data Warehouse Big Data Machine Learning

Data in 2020: Ventana Research Agenda

David Menninger's Analyst Perspectives

FEBRUARY 21, 2020

Ventana Research recently announced its 2020 research agenda for data, continuing the guidance we’ve offered for nearly two decades to help organizations derive optimal value and improve business outcomes. Data volumes continue to grow while data latency requirements continue to shrink.

Data Governance

Data Governance Optimization Data Lake IT

Pillars of Knowledge, Best Practices for Data Governance

Cloudera

AUGUST 4, 2021

With hackers now working overtime to expose business data or implant ransomware processes, data security is largely IT managers’ top priority. And if data security tops IT concerns, data governance should be their second priority. Effective data governance must extend beyond the IT organization.

Data Governance

Data Governance Metadata Data-driven Enterprise

Implement tag-based access control for your data lake and Amazon Redshift data sharing with AWS Lake Formation

AWS Big Data

JULY 21, 2023

Data-driven organizations treat data as an asset and use it across different lines of business (LOBs) to drive timely insights and better business decisions. This leads to having data across many instances of data warehouses and data lakes using a modern data architecture in separate AWS accounts.

Data Lake

Data Lake Data Warehouse Marketing Management

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

AWS Big Data

APRIL 24, 2023

Building a data lake on Amazon Simple Storage Service (Amazon S3) provides numerous benefits for an organization. However, many use cases, like performing change data capture (CDC) from an upstream relational database to an Amazon S3-based data lake, require handling data at a record level.

Data Lake

Data Lake Data Governance Machine Learning Cost-Benefit

Diving Deeper into the Data Lake

Data Lakes Meet Data Warehouses

Webinars

Trending Sources

Collibra Brings Effective Data Governance to Line-of-Business

Webinars

What is data architecture? A framework to manage data

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

Why Your Data Lake Needs Bad Data

Talend Data Fabric Simplifies Data Life Cycle Management

Integrating Data Governance and Enterprise Architecture

Doing Cloud Migration and Data Governance Right the First Time

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

Data Management on Display at Informatica World 2019

Cloudera Consolidates Its Data Platform

Informatica Continues to Evolve Data Management Platform

How BMW streamlined data access using AWS Lake Formation fine-grained access control

Unleash deeper insights with Amazon Redshift data sharing for data lake tables

How EUROGATE established a data mesh architecture using Amazon DataZone

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

Apply enterprise data governance and management using AWS Lake Formation and AWS IAM Identity Center

HEMA accelerates their data governance journey with Amazon DataZone

Choosing an open table format for your transactional data lake on AWS

Streamline AI-driven analytics with governance: Integrating Tableau with Amazon DataZone

Don’t Fear Artificial Intelligence; Embrace it Through Data Governance

How Volkswagen streamlined access to data across multiple data lakes using Amazon DataZone – Part 1

Data Governance Makes Data Security Less Scary

Data governance in the age of generative AI

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

3 things to get right with data management for gen AI projects

Denodo Provides a Logical Approach to Data Management

How Data Governance Protects Sensitive Data

2021 Gift Giving Guide for Data Nerds

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

Enhancing Data Catalog with AI

Decentralize LF-tag management with AWS Lake Formation

Top 10 Data Governance Predictions for 2019

Thank You Snowflake for Naming Alation the Data Governance Partner of the Year

Automated data governance with AWS Glue Data Quality, sensitive data detection, and AWS Lake Formation

How to Simplify Your Approach to Data Governance

What you don’t know about data management could kill your business

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

Data Lakes: What Are They and Who Needs Them?

Data in 2020: Ventana Research Agenda

Pillars of Knowledge, Best Practices for Data Governance

Implement tag-based access control for your data lake and Amazon Redshift data sharing with AWS Lake Formation

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

Stay Connected