Data Lake and Strategy - Data Leaders Brief

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

AWS Big Data

OCTOBER 30, 2024

This is part two of a three-part series where we show how to build a data lake on AWS using a modern data architecture. This post shows how to load data from a legacy database (SQL Server) into a transactional data lake ( Apache Iceberg ) using AWS Glue. Delete the bucket.

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

Migrate an existing data lake to a transactional data lake using Apache Iceberg

Webinars

Trending Sources

SAP customers on Business Suite: New strategy, same old concerns

Webinars

Oracle Wants to Be the Database for AI

The Unexpected Cost of Data Copies

How BMW streamlined data access using AWS Lake Formation fine-grained access control

Accelerate Amazon Redshift Data Lake queries with AWS Glue Data Catalog Column Statistics

Use Apache Iceberg in a data lake to support incremental data processing

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

Choosing an open table format for your transactional data lake on AWS

Recap of Amazon Redshift key product announcements in 2024

Query your Iceberg tables in data lake using Amazon Redshift (Preview)

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

Enrich your serverless data lake with Amazon Bedrock

The success of GenAI models lies in your data management strategy

Accelerate analytics and AI innovation with the next generation of Amazon SageMaker

Migrate Delta tables from Azure Data Lake Storage to Amazon S3 using AWS Glue

Differences Between Data Lake and Data Warehouses

Complexity Drives Costs: A Look Inside BYOD and Azure Data Lakes

The data flywheel: A better way to think about your data strategy

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

El Consejo Superior de Deportes destinará 2,8 millones a la creación de una plataforma de TI y un ‘data lake’

Improve operational efficiencies of Apache Iceberg tables built on Amazon S3 data lakes

Build a real-time GDPR-aligned Apache Iceberg data lake

Implement slowly changing dimensions in a data lake using AWS Glue and Delta

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

Enable business users to analyze large datasets in your data lake with Amazon QuickSight

Achieve data resilience using Amazon OpenSearch Service disaster recovery with snapshot and restore

Build a high-performance quant research platform with Apache Iceberg

MongoDB Enhances Developer Data Platform

Implement tag-based access control for your data lake and Amazon Redshift data sharing with AWS Lake Formation

Outdated business apps can cloud your AI vision

Five Strategies to Accelerate Data Product Development

Steps taken to build Sevita’s first enterprise data platform

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

What is data architecture? A framework to manage data

Manage concurrent write conflicts in Apache Iceberg on the AWS Glue Data Catalog

Data Architecture and Strategy in the AI Era

Streamline AI-driven analytics with governance: Integrating Tableau with Amazon DataZone

The Increasing Importance of Open Table Formats

Data’s dark secret: Why poor quality cripples AI and growth

The rise of the data lakehouse: A new era of data value

Analyzing the business-case approach Perdue Farms takes to derive value from data

Stay Connected