Analytics, Data Governance and Data Lake

Analytics

Data Governance

Data Lake

Data Lakes Meet Data Warehouses

David Menninger's Analyst Perspectives

MAY 7, 2020

In this analyst perspective, Dave Menninger takes a look at data lakes. He explains the term “data lake,” describes common use cases and shares his views on some of the latest market trends. He explores the relationship between data warehouses and data lakes and share some of Ventana Research’s findings on the subject.

Data Lake

Data Lake Data Warehouse Risk Marketing

Good Data Governance Improves Business Processes

David Menninger's Analyst Perspectives

FEBRUARY 17, 2022

Many organizations invest in data governance out of concern over misuse of data or potential data breaches. These are important considerations and valid aspects of data governance programs. However, good data governance also has positive impacts on organizations.

Data Governance

Data Governance Data Lake Analytics Technology

Join 42,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

Collibra Brings Effective Data Governance to Line-of-Business

David Menninger's Analyst Perspectives

SEPTEMBER 28, 2021

Collibra is a data governance software company that offers tools for metadata management and data cataloging. The software enables organizations to find data quickly, identify its source and assure its integrity.

Data Governance

Data Governance Metadata Software Management

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

AWS Big Data

NOVEMBER 22, 2024

At AWS, we are committed to empowering organizations with tools that streamline data analytics and transformation processes. This integration enables data teams to efficiently transform and manage data using Athena with dbt Cloud’s robust features, enhancing the overall data workflow experience.

Data Lake

Data Lake Data Warehouse Cost-Benefit Data Transformation

Top Considerations for Building an Open Cloud Data Lake

Increasingly, enterprises are leveraging cloud data lakes as the platform used to store data for analytics, combined with various compute engines for processing that data. Read this paper to learn about: The value of cloud data lakes as the new system of record.

Data Lake

Streamline AI-driven analytics with governance: Integrating Tableau with Amazon DataZone

AWS Big Data

OCTOBER 30, 2024

With this integration, you can now seamlessly query your governed data lake assets in Amazon DataZone using popular business intelligence (BI) and analytics tools, including partner solutions like Tableau. When you’re connected, you can query, visualize, and share data—governed by Amazon DataZone—within Tableau.

Analytics

Analytics Visualization Data Governance Data-driven

Unleash deeper insights with Amazon Redshift data sharing for data lake tables

AWS Big Data

OCTOBER 10, 2024

Amazon Redshift has established itself as a highly scalable, fully managed cloud data warehouse trusted by tens of thousands of customers for its superior price-performance and advanced data analytics capabilities. This allows you to maintain a comprehensive view of your data while optimizing for cost-efficiency.

Data Lake

Data Lake Data Warehouse Recreation/Entertainment Data-driven

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

AWS Big Data

OCTOBER 30, 2024

Amazon DataZone now launched authentication supports through the Amazon Athena JDBC driver, allowing data users to seamlessly query their subscribed data lake assets via popular business intelligence (BI) and analytics tools like Tableau, Power BI, Excel, SQL Workbench, DBeaver, and more.

Visualization

Visualization Data Lake Testing Data Governance

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

For container terminal operators, data-driven decision-making and efficient data sharing are vital to optimizing operations and boosting supply chain efficiency. Enhance agility by localizing changes within business domains and clear data contracts. Eliminate centralized bottlenecks and complex data pipelines.

IoT

IoT Machine Learning Metadata Data-driven

How BMW streamlined data access using AWS Lake Formation fine-grained access control

AWS Big Data

OCTOBER 29, 2024

The CDH is used to create, discover, and consume data products through a central metadata catalog, while enforcing permission policies and tightly integrating data engineering, analytics, and machine learning services to streamline the user journey from data to insight.

Data Lake

Data Lake Sales Metadata Machine Learning

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

AWS Big Data

AUGUST 15, 2024

Unlocking the true value of data often gets impeded by siloed information. Traditional data management—wherein each business unit ingests raw data in separate data lakes or warehouses—hinders visibility and cross-functional analysis. Amazon DataZone natively supports data sharing for Amazon Redshift data assets.

Data Lake

Data Lake Data Warehouse Data Governance Publishing

Cloudera Consolidates Its Data Platform

David Menninger's Analyst Perspectives

JANUARY 22, 2021

Organizations are dealing with exponentially increasing data that ranges broadly from customer-generated information, financial transactions, edge-generated data and even operational IT server logs. A combination of complex data lake and data warehouse capabilities are required to leverage this data.

Data Lake

Data Lake IT Data Warehouse Data Governance

Data governance in the age of generative AI

AWS Big Data

FEBRUARY 29, 2024

Data is your generative AI differentiator, and a successful generative AI implementation depends on a robust data strategy incorporating a comprehensive data governance approach. Data governance is a critical building block across all these approaches, and we see two emerging areas of focus.

Data Governance

Data Governance Unstructured Data Metadata Data Lake

Choosing an open table format for your transactional data lake on AWS

AWS Big Data

JUNE 9, 2023

A modern data architecture enables companies to ingest virtually any type of data through automated pipelines into a data lake, which provides highly durable and cost-effective object storage at petabyte or exabyte scale.

Data Lake

Data Lake Metadata Statistics Optimization

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

AWS Big Data

JUNE 10, 2024

In this post, we delve into the key aspects of using Amazon EMR for modern data management, covering topics such as data governance, data mesh deployment, and streamlined data discovery. Organizations have multiple Hive data warehouses across EMR clusters, where the metadata gets generated.

Data Lake

Data Lake Metadata Data Warehouse Data Processing

HEMA accelerates their data governance journey with Amazon DataZone

AWS Big Data

DECEMBER 19, 2024

However, many companies today still struggle to effectively harness and use their data due to challenges such as data silos, lack of discoverability, poor data quality, and a lack of data literacy and analytical capabilities to quickly access and use data across the organization.

Data Governance

Data Governance Publishing Data-driven Metadata

Incorta Streamlines Analytics with Direct Data Access

David Menninger's Analyst Perspectives

APRIL 21, 2021

The amount of data flowing into organizations is growing exponentially, creating a need to process more data more quickly than ever before. Our Data Preparation Benchmark Research shows that accessing and preparing data continues to be the most time-consuming part of making data available for analysis.

Analytics

Analytics Technology Data Lake Data Governance

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

AWS Big Data

MARCH 10, 2023

Since the deluge of big data over a decade ago, many organizations have learned to build applications to process and analyze petabytes of data. Data lakes have served as a central repository to store structured and unstructured data at any scale and in various formats.

Data Lake

Data Lake Sales Data Warehouse Snapshot

How Volkswagen streamlined access to data across multiple data lakes using Amazon DataZone – Part 1

AWS Big Data

JULY 18, 2024

Over the years, organizations have invested in creating purpose-built, cloud-based data lakes that are siloed from one another. A major challenge is enabling cross-organization discovery and access to data across these multiple data lakes, each built on different technology stacks.

Data Lake

Data Lake Publishing Metadata Data-driven

What is data architecture? A framework to manage data

CIO Business Intelligence

DECEMBER 20, 2024

Beyond breaking down silos, modern data architectures need to provide interfaces that make it easy for users to consume data using tools fit for their jobs. Data must be able to freely move to and from data warehouses, data lakes, and data marts, and interfaces must make it easy for users to consume that data.

Data Architecture

Data Architecture Management Consulting Internet of Things

Enhancing Data Catalog with AI

David Menninger's Analyst Perspectives

SEPTEMBER 22, 2022

Organizations are collecting data from multiple data sources and a variety of systems to enrich their analytics and business intelligence (BI). But collecting data is only half of the equation. As the data grows, it becomes challenging to find the right data at the right time.

Data Lake

Data Lake Business Intelligence Analytics IT

How Data Governance Protects Sensitive Data

erwin

APRIL 2, 2021

How can companies protect their enterprise data assets, while also ensuring their availability to stewards and consumers while minimizing costs and meeting data privacy requirements? Data Security Starts with Data Governance. Lack of a solid data governance foundation increases the risk of data-security incidents.

Data Governance

Data Governance Cost-Benefit Metadata Risk

Apply enterprise data governance and management using AWS Lake Formation and AWS IAM Identity Center

AWS Big Data

SEPTEMBER 26, 2024

In this blog post, there are three personas: Data Lake Administrator (with admin level access) User Silver from the Data Engineering group User Lead Auditor from the Auditor group. You will see how different personas in an organization can access the data without the need to modify their existing enterprise entitlements.

Data Governance

Data Governance Enterprise Management Data Lake

2021 Gift Giving Guide for Data Nerds

DataKitchen

DECEMBER 7, 2021

This book is not available until January 2022, but considering all the hype around the data mesh, we expect it to be a best seller. In the book, author Zhamak Dehghani reveals that, despite the time, money, and effort poured into them, data warehouses and data lakes fail when applied at the scale and speed of today’s organizations.

Data-driven

Data-driven Data Governance Big Data Data Science

Top analytics announcements of AWS re:Invent 2024

AWS Big Data

FEBRUARY 26, 2025

Analytics remained one of the key focus areas this year, with significant updates and innovations aimed at helping businesses harness their data more efficiently and accelerate insights. This zero-ETL integration reduces the complexity and operational burden of data replication to let you focus on deriving insights from your data.

Analytics

Analytics Data Lake Metadata Data Warehouse

Microsoft Azure: Cloud Computing for Data and Analytics

David Menninger's Analyst Perspectives

MARCH 17, 2021

Organizations are increasingly using data as a strategic asset, which makes data services critical. Huge volumes of data need to be stored, managed, discovered and analyzed. Cloud computing and storage approaches provide enterprises with various capabilities to store and process their data in third-party data centers.

Analytics

Analytics Enterprise Management Data Lake

Automated data governance with AWS Glue Data Quality, sensitive data detection, and AWS Lake Formation

AWS Big Data

OCTOBER 10, 2023

Data governance is the process of ensuring the integrity, availability, usability, and security of an organization’s data. Due to the volume, velocity, and variety of data being ingested in data lakes, it can get challenging to develop and maintain policies and procedures to ensure data governance at scale for your data lake.

Data Quality

Data Quality Data Governance Data Lake Testing

Thank You Snowflake for Naming Alation the Data Governance Partner of the Year

Alation

JUNE 17, 2021

Leading companies like Cisco, Nielsen, and Finnair turn to Alation + Snowflake for data governance and analytics. By joining forces, we can build more potent, tailored solutions that leverage data governance as a competitive asset. Lastly, active data governance simplifies stewardship tasks of all kinds.

Data Governance

Data Governance Data Lake Insurance Enterprise

Building end-to-end data lineage for one-time and complex queries using Amazon Athena, Amazon Redshift, Amazon Neptune and dbt

AWS Big Data

DECEMBER 12, 2024

One-time and complex queries are two common scenarios in enterprise data analytics. Complex queries, on the other hand, refer to large-scale data processing and in-depth analysis based on petabyte-level data warehouses in massive data scenarios.

Snapshot

Snapshot Recreation/Entertainment Experimentation Data Lake

Emerging Data Platforms Tackle Big Challenges

David Menninger's Analyst Perspectives

JANUARY 14, 2021

But unlocking value from data requires multiple analytics workloads, data science tools and machine learning algorithms to run against the same diverse data sets. In our ongoing benchmark research project , we are researching the ways in which organizations work with big data and the challenges they face.

Machine Learning

Machine Learning Data Governance Big Data Data Science

Data Lakes: What Are They and Who Needs Them?

Jet Global

JULY 2, 2019

To address the flood of data and the needs of enterprise businesses to store, sort, and analyze that data, a new storage solution has evolved: the data lake. What’s in a Data Lake? Data warehouses do a great job of standardizing data from disparate sources for analysis. Taking a Dip.

Data Lake

Data Lake Data Warehouse Big Data Machine Learning

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

AWS Big Data

DECEMBER 4, 2024

Under the federated mesh architecture, each divisional mesh functions as a node within the broader enterprise data mesh, maintaining a degree of autonomy in managing its data products. These nodes can implement analytical platforms like data lake houses, data warehouses, or data marts, all united by producing data products.

Metadata

Metadata Data Governance Data Quality Data-driven

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

AWS Big Data

APRIL 24, 2023

Building a data lake on Amazon Simple Storage Service (Amazon S3) provides numerous benefits for an organization. However, many use cases, like performing change data capture (CDC) from an upstream relational database to an Amazon S3-based data lake, require handling data at a record level.

Data Lake

Data Lake Data Governance Machine Learning Cost-Benefit

TIBCO Broadens Portfolio for Improved Analytics Efficiency

David Menninger's Analyst Perspectives

NOVEMBER 30, 2021

TIBCO is a large, independent cloud-computing and data analytics software company that offers integration, analytics, business intelligence and events processing software. It enables organizations to analyze streaming data in real time and provides the capability to automate analytics processes.

Analytics

Analytics Data Warehouse Business Intelligence Software

Implement tag-based access control for your data lake and Amazon Redshift data sharing with AWS Lake Formation

AWS Big Data

JULY 21, 2023

Data-driven organizations treat data as an asset and use it across different lines of business (LOBs) to drive timely insights and better business decisions. This leads to having data across many instances of data warehouses and data lakes using a modern data architecture in separate AWS accounts.

Data Lake

Data Lake Data Warehouse Marketing Management

Pillars of Knowledge, Best Practices for Data Governance

Cloudera

AUGUST 4, 2021

And if data security tops IT concerns, data governance should be their second priority. Not only is it critical to protect data, but data governance is also the foundation for data-driven businesses and maximizing value from data analytics. But it’s still not easy.

Data Governance

Data Governance Metadata Data-driven Enterprise

Your guide to AWS Analytics at AWS re:Invent 2023

AWS Big Data

NOVEMBER 13, 2023

Join the AWS Analytics team at AWS re:Invent this year, where new ideas and exciting innovations come together. For those in the data world, this post provides a curated guide for all analytics sessions that you can use to quickly schedule and build your itinerary. A shapeshifting guardian and protector of data like Data Lynx?

Analytics

Analytics Data Lake Data Warehouse Data-driven

Your 5-Step Journey from Analytics to AI

CIO Business Intelligence

MARCH 22, 2022

How do you introduce AI into your data and analytics infrastructure? To companies entrenched in decades-old business and IT processes, data fiefdoms, and legacy systems, the task may seem insurmountable. Which type(s) of storage consolidation you use depends on the data you generate and collect. . Outcomes you can expect.

Analytics

Analytics Key Performance Indicator Data Warehouse Data-driven

Foundational blocks of Amazon SageMaker Unified Studio: An admin’s guide to implement unified access to all your data, analytics, and AI

AWS Big Data

FEBRUARY 13, 2025

Amazon SageMaker Unified Studio (preview) provides a unified experience for using data, analytics, and AI capabilities. You can use familiar AWS services for model development, generative AI, data processing, and analyticsall within a single, governed environment. She can be reached via LinkedIn.

Data Analytics

Data Analytics Analytics Modeling Data-driven

AWS Lake Formation 2023 year in review

AWS Big Data

JANUARY 18, 2024

AWS Lake Formation and the AWS Glue Data Catalog form an integral part of a data governance solution for data lakes built on Amazon Simple Storage Service (Amazon S3) with multiple AWS analytics services integrating with them. We realized that your use cases need more flexibility in data governance.

Data Lake

Data Lake Metadata Data Governance Statistics

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

AWS Big Data

JULY 29, 2024

In the era of big data, data lakes have emerged as a cornerstone for storing vast amounts of raw data in its native format. They support structured, semi-structured, and unstructured data, offering a flexible and scalable environment for data ingestion from multiple sources.

Metadata

Metadata Snapshot Data Lake Metrics

How Data Governance Supports Analytics

Alation

JANUARY 27, 2022

How do businesses transform raw data into competitive insights? Data analytics. Modern businesses are increasingly leveraging analytics for a range of use cases. Analytics can help a business improve customer relationships, optimize advertising campaigns, develop new products, and much more. What is Data Analytics?

Data Governance

Data Governance Analytics Cost-Benefit Data-driven

Cross-account data collaboration with Amazon DataZone and AWS analytical tools

AWS Big Data

MARCH 5, 2025

This post describes the process of using the business data catalog resource of Amazon DataZone to publish data assets so theyre discoverable by other accounts. In this solution (as shown in the preceding figure), the AWS account that contains the data assets is referred to as the producer account.

Analytics

Analytics Publishing Metadata Sales

Cloudera and Snowflake Partner to Deliver the Most Comprehensive Open Data Lakehouse

Cloudera

OCTOBER 23, 2024

In August, we wrote about how in a future where distributed data architectures are inevitable, unifying and managing operational and business metadata is critical to successfully maximizing the value of data, analytics, and AI.

Metadata

Metadata Data Lake Dashboards Interactive

Data Lakes Meet Data Warehouses

Good Data Governance Improves Business Processes

Webinars

Trending Sources

Collibra Brings Effective Data Governance to Line-of-Business

Webinars

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

Top Considerations for Building an Open Cloud Data Lake

Streamline AI-driven analytics with governance: Integrating Tableau with Amazon DataZone

Unleash deeper insights with Amazon Redshift data sharing for data lake tables

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

How EUROGATE established a data mesh architecture using Amazon DataZone

How BMW streamlined data access using AWS Lake Formation fine-grained access control

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

Cloudera Consolidates Its Data Platform

Data governance in the age of generative AI

Choosing an open table format for your transactional data lake on AWS

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

HEMA accelerates their data governance journey with Amazon DataZone

Incorta Streamlines Analytics with Direct Data Access

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

How Volkswagen streamlined access to data across multiple data lakes using Amazon DataZone – Part 1

What is data architecture? A framework to manage data

Enhancing Data Catalog with AI

How Data Governance Protects Sensitive Data

Apply enterprise data governance and management using AWS Lake Formation and AWS IAM Identity Center

2021 Gift Giving Guide for Data Nerds

Top analytics announcements of AWS re:Invent 2024

Microsoft Azure: Cloud Computing for Data and Analytics

Automated data governance with AWS Glue Data Quality, sensitive data detection, and AWS Lake Formation

Thank You Snowflake for Naming Alation the Data Governance Partner of the Year

Building end-to-end data lineage for one-time and complex queries using Amazon Athena, Amazon Redshift, Amazon Neptune and dbt

Emerging Data Platforms Tackle Big Challenges

Data Lakes: What Are They and Who Needs Them?

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

TIBCO Broadens Portfolio for Improved Analytics Efficiency

Implement tag-based access control for your data lake and Amazon Redshift data sharing with AWS Lake Formation

Pillars of Knowledge, Best Practices for Data Governance

Your guide to AWS Analytics at AWS re:Invent 2023

Your 5-Step Journey from Analytics to AI

Foundational blocks of Amazon SageMaker Unified Studio: An admin’s guide to implement unified access to all your data, analytics, and AI

AWS Lake Formation 2023 year in review

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

How Data Governance Supports Analytics

Cross-account data collaboration with Amazon DataZone and AWS analytical tools

Cloudera and Snowflake Partner to Deliver the Most Comprehensive Open Data Lakehouse

Stay Connected