Data Lake - Data Leaders Brief

Data Lake

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

AWS Big Data

OCTOBER 30, 2024

This is part two of a three-part series where we show how to build a data lake on AWS using a modern data architecture. This post shows how to load data from a legacy database (SQL Server) into a transactional data lake ( Apache Iceberg ) using AWS Glue. Delete the bucket.

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

From Data Lake to Data Products: Operationalising Analytics at Scale

Webinars

Trending Sources

Incremental refresh for Amazon Redshift materialized views on data lake tables

Webinars

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

Top Considerations for Building an Open Cloud Data Lake

Data Lakehouses Enable Data as a Product

Drug Launch Case Study: Amazing Efficiency Using DataOps

How to Use Apache Iceberg Tables?

El Principado de Asturias apuesta por la gestión inteligente del agua mediante un ‘data lake’

12 Considerations When Evaluating Data Lake Engine Vendors for Analytics and BI

Recap of Amazon Redshift key product announcements in 2024

How BMW streamlined data access using AWS Lake Formation fine-grained access control

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Accelerate your data quality journey for lakehouse architecture with Amazon SageMaker, Apache Iceberg on AWS, Amazon S3 tables, and AWS Glue Data Quality

The Next-Generation Cloud Data Lake: An Open, No-Copy Data Architecture

Run Apache XTable in AWS Lambda for background conversion of open table formats

Enforce table level access control on data lake tables using AWS Glue 5.0 with AWS Lake Formation

Better together? Why AWS is unifying data analytics and AI services in SageMaker

Microsoft reimagines Fabric with focus on AI

The Unexpected Cost of Data Copies

Introducing simplified interaction with the Airflow REST API in Amazon MWAA

Amazon Q data integration adds DataFrame support and in-prompt context-aware job creation

Amazon Redshift out-of-the-box performance innovations for data lake queries

Building End-to-End Data Pipelines: From Data Ingestion to Analysis

Checklist Report: Preparing for the Next-Generation Cloud Data Architecture

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

How EUROGATE established a data mesh architecture using Amazon DataZone

Accelerate analytics and AI innovation with the next generation of Amazon SageMaker

Building end-to-end data lineage for one-time and complex queries using Amazon Athena, Amazon Redshift, Amazon Neptune and dbt

Building Best-in-Class Enterprise Analytics

Reduce time to access your transactional data for analytical processing using the power of Amazon SageMaker Lakehouse and zero-ETL

Accelerate queries on Apache Iceberg tables through AWS Glue auto compaction

Steps taken to build Sevita’s first enterprise data platform

Introducing AWS Glue 5.0 for Apache Spark

Ultimate Guide to the Cloud Data Lake Engine

Bridging the AI Execution Gap: Why Strong Data Foundations Make or Break Enterprise AI

Build a high-performance quant research platform with Apache Iceberg

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

United Airlines sets its flight plan for gen AI success

Data Analytics in the Cloud for Developers and Founders

Cloudera Lakehouse Optimizer Makes it Easier Than Ever to Deliver High-Performance Iceberg Tables

Tapping into the benefits of an open data lakehouse for enterprise AI

Simplify data ingestion from Amazon S3 to Amazon Redshift using auto-copy

Indeed puts AI to work to help job seekers find new roles

Stay Connected