Data Lake, Enterprise and Metadata

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

AWS Big Data

OCTOBER 30, 2024

This is part two of a three-part series where we show how to build a data lake on AWS using a modern data architecture. This post shows how to load data from a legacy database (SQL Server) into a transactional data lake ( Apache Iceberg ) using AWS Glue. Delete the bucket.

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

Enriching metadata for accurate text-to-SQL generation for Amazon Athena

Webinars

Trending Sources

Recap of Amazon Redshift key product announcements in 2024

Webinars

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

Migrate an existing data lake to a transactional data lake using Apache Iceberg

Bridging the gap between mainframe data and hybrid cloud environments

Use Apache Iceberg in a data lake to support incremental data processing

Build a high-performance quant research platform with Apache Iceberg

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

Integrating Data Governance and Enterprise Architecture

Data Lakes on Cloud & it’s Usage in Healthcare

Enrich your serverless data lake with Amazon Bedrock

Migrate Delta tables from Azure Data Lake Storage to Amazon S3 using AWS Glue

Write queries faster with Amazon Q generative SQL for Amazon Redshift

What is a Data Mesh?

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

Improve operational efficiencies of Apache Iceberg tables built on Amazon S3 data lakes

Build a real-time GDPR-aligned Apache Iceberg data lake

Use AWS Glue ETL to perform merge, partition evolution, and schema evolution on Apache Iceberg

The Increasing Importance of Open Table Formats

Data Lakes: What Are They and Who Needs Them?

Top analytics announcements of AWS re:Invent 2024

Power enterprise-grade Data Vaults with Amazon Redshift – Part 1

Doing Cloud Migration and Data Governance Right the First Time

Implement tag-based access control for your data lake and Amazon Redshift data sharing with AWS Lake Formation

Data governance in the age of generative AI

Salesforce debuts Zero Copy Partner Network to ease data integration

Modernize your ETL platform with AWS Glue Studio: A case study from BMS

Informatica’s new data management clouds target health, finance services

Unstructured data management and governance using AWS AI/ML and analytics services

Data Swamp, Data Lake, Data Lakehouse: What to Know

3 Surprising Data Catalog Use Cases for Enterprises

Power enterprise-grade Data Vaults with Amazon Redshift – Part 2

Driving Business Value and ROI from a Hybrid Cloud Data Lake

Building a Beautiful Data Lakehouse

Build a multi-Region and highly resilient modern data architecture using AWS Glue and AWS Lake Formation

Denodo Provides a Logical Approach to Data Management

Data Governance Makes Data Security Less Scary

Accelerate SQL code migration from Google BigQuery to Amazon Redshift using BladeBridge

Petabyte-scale log analytics with Amazon S3, Amazon OpenSearch Service, and Amazon OpenSearch Ingestion

Stay Connected