Blog, Data Architecture and Data Lake

Migrate an existing data lake to a transactional data lake using Apache Iceberg

AWS Big Data

OCTOBER 3, 2023

A data lake is a centralized repository that you can use to store all your structured and unstructured data at any scale. You can store your data as-is, without having to first structure the data and then run different types of analytics for better business insights. Choose Next to create your stack.

Migrate an existing data lake to a transactional data lake using Apache Iceberg

What is a Data Mesh?

Webinars

Trending Sources

Laying the Foundation for Modern Data Architecture

Webinars

Data Minimization as Design Guideline for New Data Architectures

Eight Top DataOps Trends for 2022

Query your Iceberg tables in data lake using Amazon Redshift (Preview)

How Volkswagen streamlined access to data across multiple data lakes using Amazon DataZone – Part 1

Synchronize data lakes with CDC-based UPSERT using open table format, AWS Glue, and Amazon MSK

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

Data Architecture and Strategy in the AI Era

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Centralize Your Data Processes With a DataOps Process Hub

Modern Data Architecture: Data Warehousing, Data Lakes, and Data Mesh Explained

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

Modern Data Architecture for Telecommunications

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

Breaking State and Local Data Silos with Modern Data Architectures

Implement tag-based access control for your data lake and Amazon Redshift data sharing with AWS Lake Formation

Cloudera and Snowflake Partner to Deliver the Most Comprehensive Open Data Lakehouse

Implement slowly changing dimensions in a data lake using AWS Glue and Delta

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

Expand data access through Apache Iceberg using Delta Lake UniForm on AWS

Simplify data ingestion from Amazon S3 to Amazon Redshift using auto-copy

Data architecture strategy for data quality

How to modernize data lakes with a data lakehouse architecture

Data Mart vs. Data Lake: Understanding the Difference

Data democratization: How data architecture can drive business decisions and AI initiatives

Databricks’ new data lakehouse aims at media, entertainment sector

The Future of the Data Lakehouse – Open

DataOps For Business Analytics Teams

Why Data Mesh Needs Data Virtualization

Why Data Mesh Needs Data Virtualization

Supercharge Your Data Lakehouse with Apache Iceberg in Cloudera Data Platform

Deploy and Optimize Your Snowflake Environment Faster With Accelerators

SoftBank Selects Cloudera Data Platform to Leverage Customer Intelligence While Ensuring Data Security

Enhance data security and governance for Amazon Redshift Spectrum with VPC endpoints

Habib Bank manages data at scale with Cloudera Data Platform

Automated data governance with AWS Glue Data Quality, sensitive data detection, and AWS Lake Formation

Perform data parity at scale for data modernization programs using AWS Glue Data Quality

Connecting the Data Lifecycle

How Getir unleashed data democratization using a data mesh architecture with Amazon Redshift

How the Public Sector Can Maximize the Value of Dark Data

How ATPCO enables governed self-service data access to accelerate innovation with Amazon DataZone

Stay Connected