Data Lake, Data Warehouse and Metadata

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

AWS Big Data

AUGUST 15, 2024

Unlocking the true value of data often gets impeded by siloed information. Traditional data management—wherein each business unit ingests raw data in separate data lakes or warehouses—hinders visibility and cross-functional analysis. Business units access clean, standardized data.

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

Understanding the Differences Between Data Lakes and Data Warehouses

Webinars

Trending Sources

Accelerate Amazon Redshift Data Lake queries with AWS Glue Data Catalog Column Statistics

Webinars

Enriching metadata for accurate text-to-SQL generation for Amazon Athena

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

The Data Lakehouse: Blending Data Warehouses and Data Lakes

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

Query your Iceberg tables in data lake using Amazon Redshift (Preview)

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

The Increasing Importance of Open Table Formats

Use Apache Iceberg in a data lake to support incremental data processing

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

Choosing an open table format for your transactional data lake on AWS

The Data Warehouse is Dead, Long Live the Data Warehouse, Part I

Improve operational efficiencies of Apache Iceberg tables built on Amazon S3 data lakes

How Morningstar used tag-based access controls in AWS Lake Formation to manage permissions for an Amazon Redshift data warehouse

Data Swamp, Data Lake, Data Lakehouse: What to Know

Manage your data warehouse cost allocations with Amazon Redshift Serverless tagging

Achieve the best price-performance in Amazon Redshift with elastic histograms for selectivity estimation

Implement tag-based access control for your data lake and Amazon Redshift data sharing with AWS Lake Formation

Build a real-time GDPR-aligned Apache Iceberg data lake

Salesforce debuts Zero Copy Partner Network to ease data integration

Open Data Lakehouse powered by Iceberg for all your Data Warehouse needs

Cloudera Data Warehouse Demonstrates Best-in-Class Cloud-Native Price-Performance

Perform data parity at scale for data modernization programs using AWS Glue Data Quality

How Ruparupa gained updated insights with an Amazon S3 data lake, AWS Glue, Apache Hudi, and Amazon QuickSight

Data Lakes: What Are They and Who Needs Them?

Amazon Redshift announcements at AWS re:Invent 2023 to enable analytics on all your data

Expand data access through Apache Iceberg using Delta Lake UniForm on AWS

What is an open data lakehouse and why you should care?

Cloudera Lakehouse Optimizer Makes it Easier Than Ever to Deliver High-Performance Iceberg Tables

Data governance in the age of generative AI

How Knowledge Graphs Power Data Mesh and Data Fabric

The Future of the Data Lakehouse – Open

The Future of the Data Lakehouse – Open

Building a Beautiful Data Lakehouse

Write queries faster with Amazon Q generative SQL for Amazon Redshift

Access Amazon Redshift data from Salesforce Data Cloud with Zero Copy Data Federation

Introducing Apache Hudi support with AWS Glue crawlers

Achieve your AI goals with an open data lakehouse approach

Governing data in relational databases using Amazon DataZone

The Security Challenges of Data Warehousing in the Cloud

Amazon DataZone announces custom blueprints for AWS services

Stay Connected