Data Lake and Events - Data Leaders Brief

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

AWS Big Data

OCTOBER 30, 2024

This is part two of a three-part series where we show how to build a data lake on AWS using a modern data architecture. This post shows how to load data from a legacy database (SQL Server) into a transactional data lake ( Apache Iceberg ) using AWS Glue.

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

Webinars

Trending Sources

Drug Launch Case Study: Amazing Efficiency Using DataOps

Webinars

Introducing simplified interaction with the Airflow REST API in Amazon MWAA

Unleash deeper insights with Amazon Redshift data sharing for data lake tables

How Volkswagen streamlined access to data across multiple data lakes using Amazon DataZone – Part 1

Using AWS AppSync and AWS Lake Formation to access a secure data lake through a GraphQL API

Monitor data pipelines in a serverless data lake

Run Apache XTable in AWS Lambda for background conversion of open table formats

Choosing an open table format for your transactional data lake on AWS

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Enrich your serverless data lake with Amazon Bedrock

Synchronize data lakes with CDC-based UPSERT using open table format, AWS Glue, and Amazon MSK

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

Amazon Q data integration adds DataFrame support and in-prompt context-aware job creation

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

United Airlines sets its flight plan for gen AI success

Implementing a Pharma Data Mesh using DataOps

Deriving Value from Data Lakes with AI

Race Ahead of Threats with a Security Data Lake

The Increasing Importance of Open Table Formats

Simplify data ingestion from Amazon S3 to Amazon Redshift using auto-copy

Achieve data resilience using Amazon OpenSearch Service disaster recovery with snapshot and restore

Nexthink scales to trillions of events per day with Amazon MSK

Build a real-time GDPR-aligned Apache Iceberg data lake

Data Lakes: What Are They and Who Needs Them?

Automate replication of relational sources into a transactional data lake with Apache Iceberg and AWS Glue

Empower your Jira data in a data lake with Amazon AppFlow and AWS Glue

Perform upserts in a data lake using Amazon Athena and Apache Iceberg

Implement slowly changing dimensions in a data lake using AWS Glue and Delta

Build an ETL process for Amazon Redshift using Amazon S3 Event Notifications and AWS Step Functions

Enable business users to analyze large datasets in your data lake with Amazon QuickSight

Use open table format libraries on AWS Glue 5.0 for Apache Spark

Building end-to-end data lineage for one-time and complex queries using Amazon Athena, Amazon Redshift, Amazon Neptune and dbt

Create an Apache Hudi-based near-real-time transactional data lake using AWS DMS, Amazon Kinesis, AWS Glue streaming ETL, and data visualization using Amazon QuickSight

Ingest, transform, and deliver events published by Amazon Security Lake to Amazon OpenSearch Service

How Salesforce optimized their detection and response platform using AWS managed services

Apache Iceberg optimization: Solving the small files problem in Amazon EMR

Build a high-performance quant research platform with Apache Iceberg

How EUROGATE established a data mesh architecture using Amazon DataZone

Petabyte-scale log analytics with Amazon S3, Amazon OpenSearch Service, and Amazon OpenSearch Ingestion

The Internet of Things: Real-Time Data and Analytics Enable Business Innovation

Simplify data lake access control for your enterprise users with trusted identity propagation in AWS IAM Identity Center, AWS Lake Formation, and Amazon S3 Access Grants

Stay Connected