Data Architecture, Data Lake and IT

What is data architecture? A framework to manage data

CIO Business Intelligence

DECEMBER 20, 2024

Data architecture definition Data architecture describes the structure of an organizations logical and physical data assets, and data management resources, according to The Open Group Architecture Framework (TOGAF). An organizations data architecture is the purview of data architects.

Data Architecture

Data Architecture Management Consulting Internet of Things

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

AWS Big Data

OCTOBER 30, 2024

This is part two of a three-part series where we show how to build a data lake on AWS using a modern data architecture. This post shows how to load data from a legacy database (SQL Server) into a transactional data lake ( Apache Iceberg ) using AWS Glue. To start the job, choose Run. format(dbname)).config("spark.sql.catalog.glue_catalog.catalog-impl",

What is data architecture? A framework to manage data

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

Webinars

Trending Sources

Incremental refresh for Amazon Redshift materialized views on data lake tables

Webinars

Snowflake Builds on Its Success

Checklist Report: Preparing for the Next-Generation Cloud Data Architecture

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

Migrate an existing data lake to a transactional data lake using Apache Iceberg

What is a Data Mesh?

Laying the Foundation for Modern Data Architecture

Unleash deeper insights with Amazon Redshift data sharing for data lake tables

Run Apache XTable in AWS Lambda for background conversion of open table formats

Eight Top DataOps Trends for 2022

Load data incrementally from transactional data lakes to data warehouses

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

Choosing an open table format for your transactional data lake on AWS

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

How Volkswagen streamlined access to data across multiple data lakes using Amazon DataZone – Part 1

Data Minimization as Design Guideline for New Data Architectures

Accelerate Amazon Redshift Data Lake queries with AWS Glue Data Catalog Column Statistics

Query your Iceberg tables in data lake using Amazon Redshift (Preview)

Synchronize data lakes with CDC-based UPSERT using open table format, AWS Glue, and Amazon MSK

Centralize Your Data Processes With a DataOps Process Hub

Data Architecture and Strategy in the AI Era

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Building a Lakehouse – Try Delta Lake!

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Data’s dark secret: Why poor quality cripples AI and growth

Modern Data Architecture for Telecommunications

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

Breaking State and Local Data Silos with Modern Data Architectures

Automate replication of relational sources into a transactional data lake with Apache Iceberg and AWS Glue

Amazon Web Services named a Leader in the 2024 Gartner Magic Quadrant for Data Integration Tools

Manage concurrent write conflicts in Apache Iceberg on the AWS Glue Data Catalog

How EUROGATE established a data mesh architecture using Amazon DataZone

Carhartt turns to data under new CIO

Implement tag-based access control for your data lake and Amazon Redshift data sharing with AWS Lake Formation

Implement slowly changing dimensions in a data lake using AWS Glue and Delta

Cloudera and Snowflake Partner to Deliver the Most Comprehensive Open Data Lakehouse

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

Building a Beautiful Data Lakehouse

The essential check list for effective data democratization

Stay Connected