Big Data, Data Analytics and Data Lake

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

AWS Big Data

OCTOBER 30, 2024

This is part two of a three-part series where we show how to build a data lake on AWS using a modern data architecture. This post shows how to load data from a legacy database (SQL Server) into a transactional data lake ( Apache Iceberg ) using AWS Glue. Delete the bucket.

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

Incremental refresh for Amazon Redshift materialized views on data lake tables

Webinars

Trending Sources

Differentiating Between Data Lakes and Data Warehouses

Webinars

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Here’s Why Automation For Data Lakes Could Be Important

Important Considerations When Migrating to a Data Lake

7 Key Benefits of Proper Data Lake Ingestion

Choosing an open table format for your transactional data lake on AWS

Using AWS AppSync and AWS Lake Formation to access a secure data lake through a GraphQL API

Unleash deeper insights with Amazon Redshift data sharing for data lake tables

Multicloud data lake analytics with Amazon Athena

Use Apache Iceberg in a data lake to support incremental data processing

Migrate Delta tables from Azure Data Lake Storage to Amazon S3 using AWS Glue

Monitor data pipelines in a serverless data lake

Recap of Amazon Redshift key product announcements in 2024

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Accelerate Amazon Redshift Data Lake queries with AWS Glue Data Catalog Column Statistics

Enrich your serverless data lake with Amazon Bedrock

Reality and misconceptions about big data analytics, data lakes and the future of AI

Insiders Cite The Wondrous Benefits Of Big Data In Fortnite

Complexity Drives Costs: A Look Inside BYOD and Azure Data Lakes

2021 Gift Giving Guide for Data Nerds

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

Did Big Data Deliver Business Transformation & Improved CX?

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

How EUROGATE established a data mesh architecture using Amazon DataZone

Foundational blocks of Amazon SageMaker Unified Studio: An admin’s guide to implement unified access to all your data, analytics, and AI

Amazon Q data integration adds DataFrame support and in-prompt context-aware job creation

Accelerate analytics and AI innovation with the next generation of Amazon SageMaker

Automate replication of relational sources into a transactional data lake with Apache Iceberg and AWS Glue

Building a Beautiful Data Lakehouse

Acquisitions on the Horizon in BI and Data Analytics Industry?

Empower your Jira data in a data lake with Amazon AppFlow and AWS Glue

Accelerate SQL code migration from Google BigQuery to Amazon Redshift using BladeBridge

Implement slowly changing dimensions in a data lake using AWS Glue and Delta

Create an Apache Hudi-based near-real-time transactional data lake using AWS DMS, Amazon Kinesis, AWS Glue streaming ETL, and data visualization using Amazon QuickSight

Ingest data from Google Analytics 4 and Google Sheets to Amazon Redshift using Amazon AppFlow

How Salesforce optimized their detection and response platform using AWS managed services

How Data Management and Big Data Analytics Speed Up Business Growth

5 Best Practices for Extracting, Analyzing, and Visualizing Data

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

Read and write S3 Iceberg table using AWS Glue Iceberg Rest Catalog from Open Source Apache Spark

Stay Connected