Big Data, Data Architecture and Data Lake

Big Data

Data Architecture

Data Lake

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

AWS Big Data

OCTOBER 30, 2024

This is part two of a three-part series where we show how to build a data lake on AWS using a modern data architecture. This post shows how to load data from a legacy database (SQL Server) into a transactional data lake ( Apache Iceberg ) using AWS Glue. Delete the bucket.

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

Incremental refresh for Amazon Redshift materialized views on data lake tables

Webinars

Trending Sources

Five Modern Data Architecture Trends

Webinars

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

Snowflake Builds on Its Success

Migrate an existing data lake to a transactional data lake using Apache Iceberg

Run Apache XTable in AWS Lambda for background conversion of open table formats

Unleash deeper insights with Amazon Redshift data sharing for data lake tables

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Load data incrementally from transactional data lakes to data warehouses

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Query your Iceberg tables in data lake using Amazon Redshift (Preview)

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

Choosing an open table format for your transactional data lake on AWS

Accelerate Amazon Redshift Data Lake queries with AWS Glue Data Catalog Column Statistics

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

Synchronize data lakes with CDC-based UPSERT using open table format, AWS Glue, and Amazon MSK

Manage concurrent write conflicts in Apache Iceberg on the AWS Glue Data Catalog

How Volkswagen streamlined access to data across multiple data lakes using Amazon DataZone – Part 1

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

Insiders Cite The Wondrous Benefits Of Big Data In Fortnite

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

Expand data access through Apache Iceberg using Delta Lake UniForm on AWS

Simplify data ingestion from Amazon S3 to Amazon Redshift using auto-copy

Amazon Web Services named a Leader in the 2024 Gartner Magic Quadrant for Data Integration Tools

Automate replication of relational sources into a transactional data lake with Apache Iceberg and AWS Glue

Implement slowly changing dimensions in a data lake using AWS Glue and Delta

Modern Data Architecture for Telecommunications

How EUROGATE established a data mesh architecture using Amazon DataZone

Modern Data Architecture: Data Warehousing, Data Lakes, and Data Mesh Explained

Building a Beautiful Data Lakehouse

Modernizing Data Architectures

Implement tag-based access control for your data lake and Amazon Redshift data sharing with AWS Lake Formation

Accelerate SQL code migration from Google BigQuery to Amazon Redshift using BladeBridge

Ingest data from Google Analytics 4 and Google Sheets to Amazon Redshift using Amazon AppFlow

Accelerate Amazon Redshift secure data use with Satori – Part 2

What is a data architect? Skills, salaries, and how to become a data framework master

Build a multi-Region and highly resilient modern data architecture using AWS Glue and AWS Lake Formation

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

Architecture for the Data Lake

Data architecture strategy for data quality

AWS Lake Formation 2022 year in review

Stay Connected