Data Analytics, Data Lake and Modeling

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

AWS Big Data

OCTOBER 30, 2024

This is part two of a three-part series where we show how to build a data lake on AWS using a modern data architecture. This post shows how to load data from a legacy database (SQL Server) into a transactional data lake ( Apache Iceberg ) using AWS Glue. Delete the bucket.

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

Webinars

Trending Sources

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Webinars

7 Key Benefits of Proper Data Lake Ingestion

Recap of Amazon Redshift key product announcements in 2024

Here’s Why Automation For Data Lakes Could Be Important

Unleash deeper insights with Amazon Redshift data sharing for data lake tables

Enrich your serverless data lake with Amazon Bedrock

Choosing an open table format for your transactional data lake on AWS

Accelerate analytics and AI innovation with the next generation of Amazon SageMaker

Complexity Drives Costs: A Look Inside BYOD and Azure Data Lakes

How EUROGATE established a data mesh architecture using Amazon DataZone

Eight Top DataOps Trends for 2022

Foundational blocks of Amazon SageMaker Unified Studio: An admin’s guide to implement unified access to all your data, analytics, and AI

What is a Data Mesh?

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

Implementing a Pharma Data Mesh using DataOps

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

TransUnion transforms its business model with IT

Empower your Jira data in a data lake with Amazon AppFlow and AWS Glue

Acquisitions on the Horizon in BI and Data Analytics Industry?

DataOps For Business Analytics Teams

2021 Gift Giving Guide for Data Nerds

Carhartt turns to data under new CIO

Building a Beautiful Data Lakehouse

Building end-to-end data lineage for one-time and complex queries using Amazon Athena, Amazon Redshift, Amazon Neptune and dbt

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

Accelerate data science feature engineering on transactional data lakes using Amazon Athena with Apache Iceberg

Data science vs data analytics: Unpacking the differences

Interview with: Sankar Narayanan, Chief Practice Officer at Fractal Analytics

7 key Microsoft Azure analytics services (plus one extra)

Cloudera and Snowflake Partner to Deliver the Most Comprehensive Open Data Lakehouse

Accomplish Agile Business Intelligence & Analytics For Your Business

Write queries faster with Amazon Q generative SQL for Amazon Redshift

Top analytics announcements of AWS re:Invent 2024

Building Better Data Models to Unlock Next-Level Intelligence

Regeneron turns to IT to accelerate drug discovery

Improve healthcare services through patient 360: A zero-ETL approach to enable near real-time data analytics

KDnuggets News, January 18: 7 Best Platforms to Practice SQL • Explainable AI: 10 Python Libraries for Demystifying Your Model’s Decisions

Build a semantic search engine for tabular columns with Transformers and Amazon OpenSearch Service

Deploy and Optimize Your Snowflake Environment Faster With Accelerators

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

Use IAM runtime roles with Amazon EMR Studio Workspaces and AWS Lake Formation for cross-account fine-grained access control

PepsiCo transforms for the digital era

Stay Connected