Data Architecture - Data Leaders Brief

We’ve Been Using FITT Data Architecture For Many Years, And Honestly, We Can Never Go Back

DataKitchen

JULY 22, 2025

TL;DR: Functional, Idempotent, Tested, Two-stage (FITT) data architecture has saved our sanity—no more 3 AM pipeline debugging sessions. The alternative—maintaining three to five copies of data in every environment and spending entire weekends debugging why Level 1 data differs from Level 3 data—is unsustainable.

Data Architecture

Data Architecture Testing Data Quality Cost-Benefit

What is data architecture? A framework to manage data

CIO Business Intelligence

DECEMBER 20, 2024

Data architecture definition Data architecture describes the structure of an organizations logical and physical data assets, and data management resources, according to The Open Group Architecture Framework (TOGAF). An organizations data architecture is the purview of data architects.

Data Architecture

Data Architecture Management Consulting Internet of Things

The key to operational AI: Modern data architecture

CIO Business Intelligence

NOVEMBER 27, 2024

However, the biggest challenge for most organizations in adopting Operational AI is outdated or inadequate data infrastructure. To succeed, Operational AI requires a modern data architecture.

Data Architecture

Data Architecture Cost-Benefit Machine Learning Experimentation

Webinars

How to Streamline Payment Applications & Lien Waivers Through Innovative Construction Technology

Airflow Best Practices for ETL/ELT Pipelines

MORE WEBINARS

When Timing Goes Wrong: How Latency Issues Cascade Into Data Quality Nightmares

DataKitchen

JUNE 18, 2025

Taking Ownership of Time The solution isn’t to abandon modern data architectures, but to explicitly own the timing aspects of data quality. Document not just what data moves where, but when it moves and what depends on that timing. This means: Treating schedules as first-class design artifacts.

Data Quality

Data Quality Metrics Snapshot Data Architecture

The Next-Generation Cloud Data Lake: An Open, No-Copy Data Architecture

However, they often struggle with increasingly larger data volumes, reverting back to bottlenecking data access to manage large numbers of data engineering requests and rising data warehousing costs. This new open data architecture is built to maximize data access with minimal data movement and no data copies.

Data Lake

Are enterprises ready to adopt AI at scale?

CIO Business Intelligence

OCTOBER 30, 2024

The path to achieving AI at scale is paved with myriad challenges: data quality and availability, deployment, and integration with existing systems among them. Another challenge here stems from the existing architecture within these organizations. Building a strong, modern, foundation But what goes into a modern data architecture?

Enterprise

Enterprise Data Architecture Unstructured Data Insurance

Building a Trusted AI Data Architecture: The Foundation of Scalable Intelligence

Teradata

JUNE 30, 2025

Learn more Check out Teradata AI Factory close Home Resources Data architecture Article Building a Trusted AI Data Architecture: The Foundation of Scalable Intelligence Discover how AI data architecture shapes data quality and governance for successful AI initiatives. What is AI data architecture?

Data Architecture

Data Architecture ROI Data-driven Enterprise

3 Keys to a Modern Data Architecture Strategy Fit for Scaling AI

Dataiku

JUNE 16, 2025

If there’s one thing we’ve learned at Dataiku after talking to thousands of prospects and customers about their data architecture, it’s that they also tend to be more aspirational than realistic because, at the enterprise level, data architecture is both complex and constantly changing.

Data Architecture

Data Architecture Strategy Enterprise IT

Telco Enterprise Data Platforms: Key Success Factors in Building for an AI Future

Cloudera

DECEMBER 17, 2024

The introduction of these faster, more powerful networks has triggered an explosion of data, which needs to be processed in real time to meet customer demands. Traditional data architectures struggle to handle these workloads, and without a robust, scalable hybrid data platform, the risk of falling behind is real.

Enterprise

Enterprise Data Architecture Data-driven Optimization

Build Your Open Data Lakehouse on Apache Iceberg

Speaker: Veena Vasudevan and Jason Hughes

By moving analytic workloads to the data lakehouse you can save money, make more of your data accessible to consumers faster, and provide users a better experience. In this webinar, Dremio and AWS will discuss the most common challenges in data architecture and how to overcome them with an open data lakehouse architecture on AWS.

Data Architecture

Bridging the AI Execution Gap: Why Strong Data Foundations Make or Break Enterprise AI

Jen Stirrup

JULY 12, 2025

Create a Scalable Data Architecture Modern AI requires architectures designed for flexibility, performance, and scale: Implement cloud-based data platforms Adopt data lake/data mesh architectures Ensure real-time data processing capabilities Design for scalability and performance Build self-service data access capabilities 5.

Enterprise

Enterprise Data Quality Data Governance Business Objectives

The future of data: A 5-pillar approach to modern data management

CIO Business Intelligence

DECEMBER 11, 2024

Furthermore, generally speaking, data should not be split across multiple databases on different cloud providers to achieve cloud neutrality. Not my original quote, but a cardinal sin of cloud-native data architecture is copying data from one location to another.

Management

Management Data Governance Data Science Reporting

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

AWS Big Data

OCTOBER 30, 2024

This is part two of a three-part series where we show how to build a data lake on AWS using a modern data architecture. This post shows how to load data from a legacy database (SQL Server) into a transactional data lake ( Apache Iceberg ) using AWS Glue.

Data Lake

Data Lake Data Processing Optimization Machine Learning

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

We also examine how centralized, hybrid and decentralized data architectures support scalable, trustworthy ecosystems. As data-centric AI, automated metadata management and privacy-aware data sharing mature, the opportunity to embed data quality into the enterprises core has never been more significant.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

The Unexpected Cost of Data Copies

Unfortunately, data replication, transformation, and movement can result in longer time to insight, reduced efficiency, elevated costs, and increased security and compliance risk.

Data Lake

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

Need for a data mesh architecture Because entities in the EUROGATE group generate vast amounts of data from various sourcesacross departments, locations, and technologiesthe traditional centralized data architecture struggles to keep up with the demands for real-time insights, agility, and scalability.

IoT

IoT Machine Learning Metadata Data-driven

Ingest data from Google Analytics 4 and Google Sheets to Amazon Redshift using Amazon AppFlow

AWS Big Data

JANUARY 6, 2025

He has over 13 years of professional experience building and optimizing enterprise data warehouses and is passionate about enabling customers to realize the power of their data. He specializes in migrating enterprise data warehouses to AWS Modern Data Architecture.

Analytics

Analytics Data Warehouse Big Data Metrics

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

AWS Big Data

DECEMBER 4, 2024

With this launch, you can query data regardless of where it is stored with support for a wide range of use cases, including analytics, ad-hoc querying, data science, machine learning, and generative AI. We’ve simplified data architectures, saving you time and costs on unnecessary data movement, data duplication, and custom solutions.

Data Analytics

Data Analytics Analytics Data Lake Data Quality

Indeed puts AI to work to help job seekers find new roles

CIO Business Intelligence

JULY 14, 2025

“The first pivot was moving to become an agile organization, getting into the hyperscaler model, pivoting our services toward that, and unifying our data strategies to get ready for the next wave of transformation,” Moisant says. Overhauling the company’s data architecture was a top priority.

Data Lake

Data Lake Data Architecture Data-driven Digital Transformation

Modern Data Architecture for Embedded Analytics

Every data-driven project calls for a review of your data architecture—and that includes embedded analytics. Before you add new dashboards and reports to your application, you need to evaluate your data architecture with analytics in mind. 9 questions to ask yourself when planning your ideal architecture.

Data Architecture

Companies to shift AI goals in 2025 — with setbacks inevitable, Forrester predicts

CIO Business Intelligence

OCTOBER 24, 2024

The challenge is that these architectures are convoluted, requiring diverse and multiple models, sophisticated retrieval-augmented generation stacks, advanced data architectures, and niche expertise,” they said. They predicted more mature firms will seek help from AI service providers and systems integrators.

ROI

ROI Data-driven Enterprise Experimentation

Why a data-first culture is key to unlocking value from AI in insurance

CIO Business Intelligence

NOVEMBER 26, 2024

The fact is, even the world’s most powerful large language models (LLMs) are only as good as the data foundations on which they are built. So, unless insurers get their data houses in order, the real gains promised by AI will not materialize.

Insurance

Insurance Data-driven Data Architecture Technology

Simplify real-time analytics with zero-ETL from Amazon DynamoDB to Amazon SageMaker Lakehouse

AWS Big Data

JUNE 6, 2025

About the authors Narayani Ambashta is an Analytics Specialist Solutions Architect at AWS, focusing on the automotive and manufacturing sector, where she guides strategic customers in developing modern data and AI strategies.

Analytics

Analytics Data Architecture Insurance Big Data

Tapping into the benefits of an open data lakehouse for enterprise AI

CIO Business Intelligence

NOVEMBER 27, 2024

If an organization is going to achieve truly impactful, real-time outputs from analytics and AI, it needs to ensure that all data—including structured and unstructured—is properly governed and managed even as the scale of data grows rapidly.

Enterprise

Enterprise Unstructured Data Data Lake Data Warehouse

Checklist Report: Preparing for the Next-Generation Cloud Data Architecture

Data architectures to support reporting, business intelligence, and analytics have evolved dramatically over the past 10 years. Download this TDWI Checklist report to understand: How your organization can make this transition to a modernized data architecture. The decision making around this transition.

Data Architecture

Expand data access through Apache Iceberg using Delta Lake UniForm on AWS

AWS Big Data

NOVEMBER 14, 2024

The landscape of big data management has been transformed by the rising popularity of open table formats such as Apache Iceberg, Apache Hudi, and Linux Foundation Delta Lake. These formats, designed to address the limitations of traditional data storage systems, have become essential in modern data architectures.

Metadata

Metadata Data Warehouse Big Data Data Lake

Accelerate your data quality journey for lakehouse architecture with Amazon SageMaker, Apache Iceberg on AWS, Amazon S3 tables, and AWS Glue Data Quality

AWS Big Data

JULY 28, 2025

Narayani Ambashta is an Analytics Specialist Solutions Architect at AWS, focusing on the automotive and manufacturing sector, where she guides strategic customers in developing modern data and AI strategies.

Data Quality

Data Quality Data Lake Data Architecture Visualization

Accelerate SQL code migration from Google BigQuery to Amazon Redshift using BladeBridge

AWS Big Data

NOVEMBER 7, 2024

He has over 13 years of professional experience building and optimizing enterprise data warehouses and is passionate about enabling customers to realize the power of their data. He specializes in migrating enterprise data warehouses to AWS Modern Data Architecture.

Data Warehouse

Data Warehouse Reporting Big Data Data Lake

Manage concurrent write conflicts in Apache Iceberg on the AWS Glue Data Catalog

AWS Big Data

APRIL 8, 2025

In modern data architectures, Apache Iceberg has emerged as a popular table format for data lakes, offering key features including ACID transactions and concurrent write support.

Snapshot

Snapshot Management Metadata Big Data

How to Democratize Data Across Your Organization Using a Semantic Layer

Speaker: speakers from Verizon, Snowflake, Affinity Federal Credit Union, EverQuote, and AtScale

Using predictive/prescriptive analytics, given the available data. The impact that data literacy programs and using a semantic layer can deliver. Avoiding common analytics infrastructure and data architecture challenges. Thursday, July 29th, 2021 at 11AM PDT, 2PM EDT, 7PM GMT.

Prescriptive Analytics

Simplify data ingestion from Amazon S3 to Amazon Redshift using auto-copy

AWS Big Data

OCTOBER 30, 2024

He has over 13 years of professional experience building and optimizing enterprise data warehouses and is passionate about enabling customers to realize the power of their data. He specializes in migrating enterprise data warehouses to AWS Modern Data Architecture.

Data Warehouse

Data Warehouse Sales Data Lake Recreation/Entertainment

The UAE emerges as a global leader in AI, driving innovation and future technology

CIO Business Intelligence

NOVEMBER 26, 2024

With Gen AI interest growing, organizations are forced to examine their data architecture and maturity. This also led to many data modernization projects where specialized business and IT services players with data life-cycle services capabilities have started engaging with clients across different vertical markets.”

Technology

Technology Data Architecture Data-driven Consulting

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

AWS Big Data

NOVEMBER 22, 2024

This enables you to extract insights from your data without the complexity of managing infrastructure. dbt has emerged as a leading framework, allowing data teams to transform and manage data pipelines effectively.

Data Lake

Data Lake Data Warehouse Cost-Benefit Data Transformation

Introducing AWS Glue Data Catalog usage metrics for API usage

AWS Big Data

JUNE 26, 2025

Conclusion AWS Glue Data Catalog usage metrics is an effective enhancement to your data infrastructure monitoring capabilities. It addresses the growing need for detailed observability through Amazon CloudWatch in modern data architectures built on top of Data Catalog.

Metrics

Metrics Statistics Dashboards Metadata

How To Use Airbyte, dbt-teradata, Dagster, and Teradata Vantage™ for Seamless Data Integration

Teradata

MAY 30, 2025

Data Integration

Data Integration Data Processing Metadata Testing

Incremental refresh for Amazon Redshift materialized views on data lake tables

AWS Big Data

NOVEMBER 8, 2024

He has over 13 years of professional experience building and optimizing enterprise data warehouses and is passionate about enabling customers to realize the power of their data. He specializes in migrating enterprise data warehouses to AWS Modern Data Architecture. Enrico holds a M.Sc.

Data Lake

Data Lake Data Warehouse Optimization Testing

Go vs. Python for Modern Data Workflows: Need Help Deciding?

KDnuggets

JUNE 19, 2025

Go excels at: High-throughput data ingestion Real-time stream processing Microservices architectures System reliability and uptime Operational simplicity Go vs. Python: Which Fits Into the Modern Data Stack Better? Understanding how these languages fit into modern data architectures requires looking at the bigger picture.

Experimentation

Experimentation Machine Learning Data Science Advertising

Reduce time to access your transactional data for analytical processing using the power of Amazon SageMaker Lakehouse and zero-ETL

AWS Big Data

JUNE 16, 2025

Through this integrated environment, data analysts, data scientists, and ML engineers can use SageMaker Unified Studio to perform advanced SQL analytics on the transactional data. Sudarshan Narasimhan is a Principal Solutions Architect at AWS specialized in data, analytics and databases.

Data Lake

Data Lake Analytics Data Warehouse Metadata

Introducing Point in Time queries and SQL/PPL support in Amazon OpenSearch Serverless

AWS Big Data

NOVEMBER 19, 2024

He is deeply passionate about Data Architecture and helps customers build analytics solutions at scale on AWS. Frank Dattalo is a Software Engineer with Amazon OpenSearch Service. He focuses on the search and plugin experience in Amazon OpenSearch Serverless.

Internet of Things

Internet of Things Visualization Structured Data Data Architecture

7 types of tech debt that could cripple your business

CIO Business Intelligence

MARCH 25, 2025

Build up: Databases that have grown in size, complexity, and usage build up the need to rearchitect the model and architecture to support that growth over time.

Risk

Risk Cost-Benefit Data-driven Digital Transformation

Data infrastructure: The missing link in successful AI adoption

CIO Business Intelligence

JULY 23, 2025

The same sort of logic can be applied to AI adoption by modern businesses: You can roll out AI systems, but you can’t force them to use the data they need to operate effectively. You know the old saying that you can lead a horse to water, but not make it drink?

Data Governance

Data Governance Unstructured Data Data Warehouse Strategy

Run Apache XTable in AWS Lambda for background conversion of open table formats

AWS Big Data

NOVEMBER 26, 2024

This post was co-written with Dipankar Mazumdar, Staff Data Engineering Advocate with AWS Partner OneHouse. Data architecture has evolved significantly to handle growing data volumes and diverse workloads.

Metadata

Metadata Data Lake Snapshot Data Warehouse

Centralize Apache Spark observability on Amazon EMR on EKS with external Spark History Server

AWS Big Data

JUNE 3, 2025

Suvojit Dasgupta is a Principal Data Architect at AWS. He leads a team of skilled engineers in designing and building scalable data solutions for AWS customers. He specializes in developing and implementing innovative data architectures to address complex business challenges.

Metrics

Metrics Data Processing Visualization Data-driven

RocksDB 101: Optimizing stateful streaming in Apache Spark with Amazon EMR and AWS Glue

AWS Big Data

JUNE 18, 2025

He helps customers with architectural guidance and optimisation. He leverages his experience to help people bring their ideas to life, focusing on distributed processing and big data architectures. He is passionate about helping customers resolve challenging issues in the Big Data area.

Optimization

Optimization Snapshot Metrics Big Data

We’ve Been Using FITT Data Architecture For Many Years, And Honestly, We Can Never Go Back

What is data architecture? A framework to manage data

Webinars

Trending Sources

The key to operational AI: Modern data architecture

Webinars

When Timing Goes Wrong: How Latency Issues Cascade Into Data Quality Nightmares

The Next-Generation Cloud Data Lake: An Open, No-Copy Data Architecture

Are enterprises ready to adopt AI at scale?

Building a Trusted AI Data Architecture: The Foundation of Scalable Intelligence

3 Keys to a Modern Data Architecture Strategy Fit for Scaling AI

Telco Enterprise Data Platforms: Key Success Factors in Building for an AI Future

Build Your Open Data Lakehouse on Apache Iceberg

Bridging the AI Execution Gap: Why Strong Data Foundations Make or Break Enterprise AI

The future of data: A 5-pillar approach to modern data management

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

Data’s dark secret: Why poor quality cripples AI and growth

The Unexpected Cost of Data Copies

How EUROGATE established a data mesh architecture using Amazon DataZone

Ingest data from Google Analytics 4 and Google Sheets to Amazon Redshift using Amazon AppFlow

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Indeed puts AI to work to help job seekers find new roles

Modern Data Architecture for Embedded Analytics

Companies to shift AI goals in 2025 — with setbacks inevitable, Forrester predicts

Why a data-first culture is key to unlocking value from AI in insurance

Simplify real-time analytics with zero-ETL from Amazon DynamoDB to Amazon SageMaker Lakehouse

Tapping into the benefits of an open data lakehouse for enterprise AI

Checklist Report: Preparing for the Next-Generation Cloud Data Architecture

Expand data access through Apache Iceberg using Delta Lake UniForm on AWS

Accelerate your data quality journey for lakehouse architecture with Amazon SageMaker, Apache Iceberg on AWS, Amazon S3 tables, and AWS Glue Data Quality

Accelerate SQL code migration from Google BigQuery to Amazon Redshift using BladeBridge

Manage concurrent write conflicts in Apache Iceberg on the AWS Glue Data Catalog

How to Democratize Data Across Your Organization Using a Semantic Layer

Simplify data ingestion from Amazon S3 to Amazon Redshift using auto-copy

The UAE emerges as a global leader in AI, driving innovation and future technology

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

Introducing AWS Glue Data Catalog usage metrics for API usage

How To Use Airbyte, dbt-teradata, Dagster, and Teradata Vantage™ for Seamless Data Integration

Incremental refresh for Amazon Redshift materialized views on data lake tables

Go vs. Python for Modern Data Workflows: Need Help Deciding?

Reduce time to access your transactional data for analytical processing using the power of Amazon SageMaker Lakehouse and zero-ETL

Introducing Point in Time queries and SQL/PPL support in Amazon OpenSearch Serverless

7 types of tech debt that could cripple your business

Data infrastructure: The missing link in successful AI adoption

Run Apache XTable in AWS Lambda for background conversion of open table formats

Centralize Apache Spark observability on Amazon EMR on EKS with external Spark History Server

RocksDB 101: Optimizing stateful streaming in Apache Spark with Amazon EMR and AWS Glue

Stay Connected