Data Architecture, Data Lake and Optimization

What is data architecture? A framework to manage data

CIO Business Intelligence

DECEMBER 20, 2024

Data architecture definition Data architecture describes the structure of an organizations logical and physical data assets, and data management resources, according to The Open Group Architecture Framework (TOGAF). An organizations data architecture is the purview of data architects.

Data Architecture

Data Architecture Management Consulting Internet of Things

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

AWS Big Data

OCTOBER 30, 2024

This is part two of a three-part series where we show how to build a data lake on AWS using a modern data architecture. This post shows how to load data from a legacy database (SQL Server) into a transactional data lake ( Apache Iceberg ) using AWS Glue.

What is data architecture? A framework to manage data

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

Webinars

Trending Sources

Incremental refresh for Amazon Redshift materialized views on data lake tables

Webinars

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

The Unexpected Cost of Data Copies

Accelerate Amazon Redshift Data Lake queries with AWS Glue Data Catalog Column Statistics

Unleash deeper insights with Amazon Redshift data sharing for data lake tables

Choosing an open table format for your transactional data lake on AWS

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

Query your Iceberg tables in data lake using Amazon Redshift (Preview)

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

Synchronize data lakes with CDC-based UPSERT using open table format, AWS Glue, and Amazon MSK

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Centralize Your Data Processes With a DataOps Process Hub

Data Architecture and Strategy in the AI Era

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

How EUROGATE established a data mesh architecture using Amazon DataZone

Modern Data Architecture for Telecommunications

Manage concurrent write conflicts in Apache Iceberg on the AWS Glue Data Catalog

Deploy and Optimize Your Snowflake Environment Faster With Accelerators

Automate replication of relational sources into a transactional data lake with Apache Iceberg and AWS Glue

Build a multi-Region and highly resilient modern data architecture using AWS Glue and AWS Lake Formation

Carhartt turns to data under new CIO

Top analytics announcements of AWS re:Invent 2024

Building a Beautiful Data Lakehouse

Simplify data ingestion from Amazon S3 to Amazon Redshift using auto-copy

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

Data’s dark secret: Why poor quality cripples AI and growth

Petabyte-scale log analytics with Amazon S3, Amazon OpenSearch Service, and Amazon OpenSearch Ingestion

What is a data architect? Skills, salaries, and how to become a data framework master

Amazon Redshift announcements at AWS re:Invent 2023 to enable analytics on all your data

Accelerate SQL code migration from Google BigQuery to Amazon Redshift using BladeBridge

How to modernize data lakes with a data lakehouse architecture

Data architecture strategy for data quality

How smava makes loans transparent and affordable using Amazon Redshift Serverless

Databricks’ new data lakehouse aims at media, entertainment sector

Detect, mask, and redact PII data using AWS Glue before loading into Amazon OpenSearch Service

The Future of the Data Lakehouse – Open

Data democratization: How data architecture can drive business decisions and AI initiatives

The Future of the Data Lakehouse – Open

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

Enhance data security and governance for Amazon Redshift Spectrum with VPC endpoints

AWS re:Invent 2023 Amazon Redshift Sessions Recap

Stay Connected