Data Lake and Technology - Data Leaders Brief

Data Lake

Technology

Diving Deeper into the Data Lake

David Menninger's Analyst Perspectives

NOVEMBER 13, 2020

A data lake is a centralized repository designed to house big data in structured, semi-structured and unstructured form. I have been covering the data lake topic for several years and encourage you to check out an earlier perspective called Data Lakes: Safe Way to Swim in Big Data?

Diving Deeper into the Data Lake

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

Webinars

Trending Sources

Understanding the Differences Between Data Lakes and Data Warehouses

Webinars

5 things on our data and AI radar for 2021

Building Best-in-Class Enterprise Analytics

Oracle Wants to Be the Database for AI

Load data incrementally from transactional data lakes to data warehouses

Recap of Amazon Redshift key product announcements in 2024

Choosing an open table format for your transactional data lake on AWS

Query your Iceberg tables in data lake using Amazon Redshift (Preview)

Using AWS AppSync and AWS Lake Formation to access a secure data lake through a GraphQL API

How BMW streamlined data access using AWS Lake Formation fine-grained access control

How Volkswagen streamlined access to data across multiple data lakes using Amazon DataZone – Part 1

Accelerate analytics and AI innovation with the next generation of Amazon SageMaker

Migrate Delta tables from Azure Data Lake Storage to Amazon S3 using AWS Glue

Enrich your serverless data lake with Amazon Bedrock

Run Apache XTable in AWS Lambda for background conversion of open table formats

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

Use External Data Platform to Improve Analytics

Good Data Governance Improves Business Processes

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

Implement slowly changing dimensions in a data lake using AWS Glue and Delta

What is data architecture? A framework to manage data

Enable business users to analyze large datasets in your data lake with Amazon QuickSight

Centralize Your Data Processes With a DataOps Process Hub

United Airlines sets its flight plan for gen AI success

Implement tag-based access control for your data lake and Amazon Redshift data sharing with AWS Lake Formation

Amazon Web Services named a Leader in the 2024 Gartner Magic Quadrant for Data Integration Tools

How EUROGATE established a data mesh architecture using Amazon DataZone

Eight Top DataOps Trends for 2022

Streamline AI-driven analytics with governance: Integrating Tableau with Amazon DataZone

Create an Apache Hudi-based near-real-time transactional data lake using AWS DMS, Amazon Kinesis, AWS Glue streaming ETL, and data visualization using Amazon QuickSight

Introducing the technology behind watsonx.ai, IBM’s AI and data platform for enterprise

Analyzing the business-case approach Perdue Farms takes to derive value from data

The rise of the data lakehouse: A new era of data value

Look Out: Computer Vision in AI is Coming Into Sight

2021 Gift Giving Guide for Data Nerds

Carhartt turns to data under new CIO

The success of GenAI models lies in your data management strategy

Stay Connected