Big Data, Data Lake and Data Warehouse

Big Data

Data Lake

Data Warehouse

Data Lakes Meet Data Warehouses

David Menninger's Analyst Perspectives

MAY 7, 2020

In this analyst perspective, Dave Menninger takes a look at data lakes. He explains the term “data lake,” describes common use cases and shares his views on some of the latest market trends. He explores the relationship between data warehouses and data lakes and share some of Ventana Research’s findings on the subject.

Data Lakes Meet Data Warehouses

A Comprehensive Guide to Data Lake vs. Data Warehouse

Webinars

Trending Sources

Incremental refresh for Amazon Redshift materialized views on data lake tables

Webinars

Differentiating Between Data Lakes and Data Warehouses

Top Considerations for Building an Open Cloud Data Lake

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

Understanding the Differences Between Data Lakes and Data Warehouses

Load data incrementally from transactional data lakes to data warehouses

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

Data Lakes vs. Data Warehouses

Recap of Amazon Redshift key product announcements in 2024

Delta Lake: A Comprehensive Introduction

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Simplify data ingestion from Amazon S3 to Amazon Redshift using auto-copy

Talend Data Fabric Simplifies Data Life Cycle Management

Accelerate analytics and AI innovation with the next generation of Amazon SageMaker

Run Apache XTable in AWS Lambda for background conversion of open table formats

Unleash deeper insights with Amazon Redshift data sharing for data lake tables

What's the difference between data lakes and data warehouses?

Accelerate SQL code migration from Google BigQuery to Amazon Redshift using BladeBridge

Important Considerations When Migrating to a Data Lake

Query your Iceberg tables in data lake using Amazon Redshift (Preview)

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

Accelerate Amazon Redshift Data Lake queries with AWS Glue Data Catalog Column Statistics

A Comprehensive Guide on Delta Lake

Migrate a petabyte-scale data warehouse from Actian Vectorwise to Amazon Redshift

Expand data access through Apache Iceberg using Delta Lake UniForm on AWS

Understanding Apache Iceberg on AWS with the new technical guide

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

Use Apache Iceberg in a data lake to support incremental data processing

Choosing an open table format for your transactional data lake on AWS

Complexity Drives Costs: A Look Inside BYOD and Azure Data Lakes

Build an Amazon Redshift data warehouse using an Amazon DynamoDB single-table design

Amazon Q data integration adds DataFrame support and in-prompt context-aware job creation

Synchronize data lakes with CDC-based UPSERT using open table format, AWS Glue, and Amazon MSK

Amazon Web Services named a Leader in the 2024 Gartner Magic Quadrant for Data Integration Tools

Ingest data from Google Analytics 4 and Google Sheets to Amazon Redshift using Amazon AppFlow

2021 Gift Giving Guide for Data Nerds

Enriching metadata for accurate text-to-SQL generation for Amazon Athena

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

Enable business users to analyze large datasets in your data lake with Amazon QuickSight

Stay Connected