Data Lake and Management - Data Leaders Brief

Data Lake

Management

Diving Deeper into the Data Lake

David Menninger's Analyst Perspectives

NOVEMBER 13, 2020

A data lake is a centralized repository designed to house big data in structured, semi-structured and unstructured form. I have been covering the data lake topic for several years and encourage you to check out an earlier perspective called Data Lakes: Safe Way to Swim in Big Data?

Top Data Lakes Interview Questions

Diving Deeper into the Data Lake

Webinars

Trending Sources

Data Lake or Data Warehouse- Which is Better?

Webinars

Data Lakes Meet Data Warehouses

The Unexpected Cost of Data Copies

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

A Comprehensive Guide to Data Lake vs. Data Warehouse

Incremental refresh for Amazon Redshift materialized views on data lake tables

A Detailed Introduction on Data Lakes and Delta Lakes

Checklist Report: Preparing for the Next-Generation Cloud Data Architecture

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

How to Implement Data Engineering in Practice?

Setting up Data Lake on GCP using Cloud Storage and BigQuery

What is data architecture? A framework to manage data

The Next-Generation Cloud Data Lake: An Open, No-Copy Data Architecture

Why Your Data Lake Needs Bad Data

Manage concurrent write conflicts in Apache Iceberg on the AWS Glue Data Catalog

Delta Lake: A Comprehensive Introduction

Talend Data Fabric Simplifies Data Life Cycle Management

Data Analytics in the Cloud for Developers and Founders

Drug Launch Case Study: Amazing Efficiency Using DataOps

Unleash deeper insights with Amazon Redshift data sharing for data lake tables

Multicloud data lake analytics with Amazon Athena

Migrate an existing data lake to a transactional data lake using Apache Iceberg

Recap of Amazon Redshift key product announcements in 2024

Data Management on Display at Informatica World 2019

Load data incrementally from transactional data lakes to data warehouses

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

How BMW streamlined data access using AWS Lake Formation fine-grained access control

Cloudera Consolidates Its Data Platform

Informatica Continues to Evolve Data Management Platform

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

Choosing an open table format for your transactional data lake on AWS

Use Apache Iceberg in a data lake to support incremental data processing

How Volkswagen streamlined access to data across multiple data lakes using Amazon DataZone – Part 1

Accelerate Amazon Redshift Data Lake queries with AWS Glue Data Catalog Column Statistics

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Enrich your serverless data lake with Amazon Bedrock

Using AWS AppSync and AWS Lake Formation to access a secure data lake through a GraphQL API

Accelerate analytics and AI innovation with the next generation of Amazon SageMaker

Synchronize data lakes with CDC-based UPSERT using open table format, AWS Glue, and Amazon MSK

Migrate Delta tables from Azure Data Lake Storage to Amazon S3 using AWS Glue

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

Stay Connected