Data Architecture, Data Warehouse and Interactive

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

AWS Big Data

NOVEMBER 22, 2024

The need for streamlined data transformations As organizations increasingly adopt cloud-based data lakes and warehouses, the demand for efficient data transformation tools has grown. This enables you to extract insights from your data without the complexity of managing infrastructure.

Data Lake

Data Lake Data Warehouse Cost-Benefit Data Transformation

Load data incrementally from transactional data lakes to data warehouses

AWS Big Data

OCTOBER 19, 2023

Data lakes and data warehouses are two of the most important data storage and management technologies in a modern data architecture. Data lakes store all of an organization’s data, regardless of its format or structure. Delta Lake doesn’t have a specific concept for incremental queries.

Data Lake

Data Lake Data Warehouse Visualization Snapshot

What is a Data Mesh?

DataKitchen

AUGUST 3, 2021

The data mesh design pattern breaks giant, monolithic enterprise data architectures into subsystems or domains, each managed by a dedicated team. The past decades of enterprise data platform architectures can be summarized in 69 words. Introduction to Data Mesh. Source: Thoughtworks.

Data Architecture

Data Architecture Data Lake Cost-Benefit Data Warehouse

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Unlocking near real-time analytics with petabytes of transaction data using Amazon Aurora Zero-ETL integration with Amazon Redshift and dbt Cloud

AWS Big Data

NOVEMBER 27, 2024

While customers can perform some basic analysis within their operational or transactional databases, many still need to build custom data pipelines that use batch or streaming jobs to extract, transform, and load (ETL) data into their data warehouse for more comprehensive analysis. Deploy dbt models to Amazon Redshift.

Data Warehouse

Data Warehouse Analytics Testing Sales

Run Apache XTable in AWS Lambda for background conversion of open table formats

AWS Big Data

NOVEMBER 26, 2024

This post was co-written with Dipankar Mazumdar, Staff Data Engineering Advocate with AWS Partner OneHouse. Data architecture has evolved significantly to handle growing data volumes and diverse workloads. In Amazon S3 and AWS Glue, we can see our Hudi dataset and table along with the metadata folder.hoodie.

Metadata

Metadata Data Lake Snapshot Data Warehouse

Centralize near-real-time governance through alerts on Amazon Redshift data warehouses for sensitive queries

AWS Big Data

JUNE 29, 2023

Amazon Redshift is a fully managed, petabyte-scale data warehouse service in the cloud that delivers powerful and secure insights on all your data with the best price-performance. With Amazon Redshift, you can analyze your data to derive holistic insights about your business and your customers.

Data Warehouse

Data Warehouse Dashboards Testing Visualization

Build a secure data visualization application using the Amazon Redshift Data API with AWS IAM Identity Center

AWS Big Data

MARCH 6, 2025

Tens of thousands of customers use Amazon Redshift for modern data analytics at scale, delivering up to three times better price-performance and seven times better throughput than other cloud data warehouses. The credentials make sure that only authorized users can interact with the Redshift data.

Visualization

Visualization Sales Data Warehouse Management

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

AWS Big Data

AUGUST 1, 2024

Large-scale data warehouse migration to the cloud is a complex and challenging endeavor that many organizations undertake to modernize their data infrastructure, enhance data management capabilities, and unlock new business opportunities. This makes sure the new data platform can meet current and future business goals.

Data Warehouse

Data Warehouse KPI Optimization Cost-Benefit

Data democratization: How data architecture can drive business decisions and AI initiatives

IBM Big Data Hub

AUGUST 4, 2023

Traditionally, data was seen as information to be put on reserve, only called upon during customer interactions or executing a program. Today, the way businesses use data is much more fluid; data literate employees use data across hundreds of apps, analyze data for better decision-making, and access data from numerous locations.

Data Architecture

Data Architecture Data Lake Machine Learning Data Governance

The Future of the Data Lakehouse – Open

CIO Business Intelligence

JUNE 23, 2022

These lakes power mission critical large scale data analytics, business intelligence (BI), and machine learning use cases, including enterprise data warehouses. In recent years, the term “data lakehouse” was coined to describe this architectural pattern of tabular analytics over data in the data lake.

Data Lake

Data Lake Data Warehouse Machine Learning Data-driven

Automate data loading from your database into Amazon Redshift using AWS Database Migration Service (DMS), AWS Step Functions, and the Redshift Data API

AWS Big Data

JULY 2, 2024

Amazon Redshift is a fast, scalable, secure, and fully managed cloud data warehouse that makes it simple and cost-effective to analyze all your data using standard SQL and your existing ETL (extract, transform, and load), business intelligence (BI), and reporting tools.

Data Warehouse

Data Warehouse Sales Testing Big Data

The Future of the Data Lakehouse – Open

Cloudera

JUNE 18, 2022

These lakes power mission critical large scale data analytics, business intelligence (BI), and machine learning use cases, including enterprise data warehouses. In recent years, the term “data lakehouse” was coined to describe this architectural pattern of tabular analytics over data in the data lake.

Data Lake

Data Lake Data Warehouse Machine Learning Data-driven

Power enterprise-grade Data Vaults with Amazon Redshift – Part 1

AWS Big Data

NOVEMBER 16, 2023

Amazon Redshift is a popular cloud data warehouse, offering a fully managed cloud-based service that seamlessly integrates with an organization’s Amazon Simple Storage Service (Amazon S3) data lake, real-time streams, machine learning (ML) workflows, transactional workflows, and much more—all while providing up to 7.9x

Enterprise

Enterprise Data Warehouse Data Lake Optimization

How smava makes loans transparent and affordable using Amazon Redshift Serverless

AWS Big Data

DECEMBER 21, 2023

To speed up the self-service analytics and foster innovation based on data, a solution was needed to provide ways to allow any team to create data products on their own in a decentralized manner. To create and manage the data products, smava uses Amazon Redshift , a cloud data warehouse.

Data Lake

Data Lake Data Warehouse Data-driven B2B

Accelerate Amazon Redshift secure data use with Satori – Part 1

AWS Big Data

SEPTEMBER 21, 2023

Satori integrates natively with both Amazon Redshift provisioned clusters and Amazon Redshift Serverless for easy setup of your Amazon Redshift data warehouse in the secure Satori portal. In part 2, we will explore how to set up self-service data access with Satori to data stored in Amazon Redshift.

Data Warehouse

Data Warehouse Interactive Data Architecture Data-driven

SAP Datasphere review: turning data from a technical problem to a business data product.

Jen Stirrup

MARCH 29, 2023

Organisations are looking at ways of simplifying data; for example, through simple rebranding efforts to disguise the complexity. However, SAP Datasphere goes much deeper deeper than a simple rebranding; it is the next generation of SAP Data Warehouse Cloud. They fail to get a grip on their data.

Data Warehouse

Data Warehouse Metadata Data Integration Business Intelligence

Combine transactional, streaming, and third-party data on Amazon Redshift for financial services

AWS Big Data

FEBRUARY 1, 2024

This stack creates the following resources and necessary permissions to integrate the services: Data stream – With Amazon Kinesis Data Streams , you can send data from your streaming source to a data stream to ingest the data into a Redshift data warehouse. version cluster. version cluster.

Data Warehouse

Data Warehouse Dashboards Risk Management Risk

Top analytics announcements of AWS re:Invent 2024

AWS Big Data

FEBRUARY 26, 2025

Amazon SageMaker Lakehouse provides an open data architecture that reduces data silos and unifies data across Amazon Simple Storage Service (Amazon S3) data lakes, Redshift data warehouses, and third-party and federated data sources. AWS Glue 5.0 Finally, AWS Glue 5.0

Analytics

Analytics Data Lake Metadata Data Warehouse

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

AWS Big Data

MARCH 10, 2023

They enable transactions on top of data lakes and can simplify data storage, management, ingestion, and processing. These transactional data lakes combine features from both the data lake and the data warehouse. Data can be organized into three different zones, as shown in the following figure.

Data Lake

Data Lake Sales Data Warehouse Snapshot

Empowering data-driven excellence: How the Bluestone Data Platform embraced data mesh for success

AWS Big Data

FEBRUARY 27, 2024

The following are the key components of the Bluestone Data Platform: Data mesh architecture – Bluestone adopted a data mesh architecture, a paradigm that distributes data ownership across different business units. This enables data-driven decision-making across the organization.

Data-driven

Data-driven Data Lake Data Quality Data Governance

Migrate Microsoft Azure Synapse Analytics to Amazon Redshift using AWS SCT

AWS Big Data

OCTOBER 18, 2023

Amazon Redshift is a fast, fully managed, petabyte-scale data warehouse that provides the flexibility to use provisioned or serverless compute for your analytical workloads. The decoupled compute and storage architecture of Amazon Redshift enables you to build highly scalable, resilient, and cost-effective workloads.

Analytics

Analytics Data Warehouse Dashboards Testing

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

AWS Big Data

NOVEMBER 9, 2023

The aim was to bolster their analytical capabilities and improve data accessibility while ensuring a quick time to market and high data quality, all with low total cost of ownership (TCO) and no need for additional tools or licenses. It’s raw, unprocessed data straight from the source.

Data Warehouse

Data Warehouse Testing Data Quality Reporting

Your guide to AWS Analytics at AWS re:Invent 2023

AWS Big Data

NOVEMBER 13, 2023

11:30 AM – 12:30 PM (PDT) Ceasars Forum ANT318 | Accelerate innovation with end-to-end serverless data architecture. 4:30 PM – 5:30 PM (PDT) Wynn ANT207 | Understand your data with business context. 1:00 PM – 2:00 PM (PDT) Venetian ANT201 | Accelerate innovation with real-time data.

Analytics

Analytics Data Lake Data Warehouse Data-driven

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

Customer 360 (C360) provides a complete and unified view of a customer’s interactions and behavior across all touchpoints and channels. This view is used to identify patterns and trends in customer behavior, which can inform data-driven decisions to improve business outcomes. Then, you transform this data into a concise format.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

Generic orchestration framework for data warehousing workloads using Amazon Redshift RSQL

AWS Big Data

APRIL 3, 2023

Tens of thousands of customers run business-critical workloads on Amazon Redshift , AWS’s fast, petabyte-scale cloud data warehouse delivering the best price-performance. With Amazon Redshift, you can query data across your data warehouse, operational data stores, and data lake using standard SQL.

Data Warehouse

Data Warehouse Testing Data Lake Data-driven

AWS Lake Formation 2022 year in review

AWS Big Data

JANUARY 31, 2023

In this post, we are excited to summarize the features that the AWS Glue Data Catalog, AWS Glue crawler, and Lake Formation teams delivered in 2022. Whether you are a data platform builder, data engineer, data scientist, or any technology leader interested in data lake solutions, this post is for you.

Data Lake

Data Lake Data Governance Data Architecture Machine Learning

Lay the groundwork now for advanced analytics and AI

CIO Business Intelligence

AUGUST 3, 2023

In the past, First Service Credit Union’s Chief data officer Ty Robbins struggled to integrate data from the legacy, non-relational, and often proprietary tabular databases on which many credit unions run. Each of the acquired companies had multiple data sets with different primary keys, says Hepworth. “We

Analytics

Analytics Data Lake Metadata Cost-Benefit

96 Percent of Businesses Can’t Be Wrong: How Hybrid Cloud Came to Dominate the Data Sector

Cloudera

JANUARY 26, 2022

Modern, real-time businesses require accelerated cycles of innovation that are expensive and difficult to maintain with legacy data platforms. The hybrid cloud’s premise—two data architectures fused together—gives companies options to leverage those solutions and to address decision-making criteria, on a case-by-case basis. .

Data Processing

Data Processing IoT Data Warehouse Cost-Benefit

Cloudera Uses CDP to Reduce IT Cloud Spend by $12 Million

Cloudera

OCTOBER 18, 2022

Like all of our customers, Cloudera depends on the Cloudera Data Platform (CDP) to manage our day-to-day analytics and operational insights. Many aspects of our business live within this modern data architecture, providing all Clouderans the ability to ask, and answer, important questions for the business.

IT

IT Data Warehouse Dashboards Visualization

BMW Cloud Efficiency Analytics powered by Amazon QuickSight and Amazon Athena

AWS Big Data

NOVEMBER 15, 2023

The difference lies in when and where data transformation takes place. In ETL, data is transformed before it’s loaded into the data warehouse. In ELT, raw data is loaded into the data warehouse first, then it’s transformed directly within the warehouse.

Analytics

Analytics Dashboards Metadata Data Warehouse

The Multifaceted Value Proposition of the Cloudera Data Platform

Cloudera

FEBRUARY 22, 2021

The Cloudera Data Platform (CDP) represents a paradigm shift in modern data architecture by addressing all existing and future analytical needs. without integration delays or having to deal with fragmented data silos that result in operational inefficiencies. . Technology cost reduction / avoidance.

Cost-Benefit

Cost-Benefit Data Warehouse Data Processing Data Governance

Synchronize data lakes with CDC-based UPSERT using open table format, AWS Glue, and Amazon MSK

AWS Big Data

JULY 31, 2024

In the current industry landscape, data lakes have become a cornerstone of modern data architecture, serving as repositories for vast amounts of structured and unstructured data. Later, we use an AWS Glue exchange, transform, and load (ETL) job for batch processing of CDC data from the S3 raw data lake.

Data Lake

Data Lake Marketing Data Processing Management

Shopping for Data

Alation

FEBRUARY 20, 2020

It’s no longer enough to build the data warehouse. Dave Wells, analyst with the Eckerson Group suggests that realizing the promise of the data warehouse requires a paradigm shift in the way we think about data along with a change in how we access and use it. Building the EDM.

Data Warehouse

Data Warehouse Metadata Data Lake Data Architecture

AI agents will transform business processes — and magnify risks

CIO Business Intelligence

AUGUST 21, 2024

When multiple independent but interactive agents are combined, each capable of perceiving the environment and taking actions, you get a multiagent system. The systems are fed the data, and trained, and then improve over time on their own.” They use gen AI to interpret complex questions and identify the most relevant data sources.

Risk

Risk Insurance Cost-Benefit Software

The hidden history of Db2

IBM Big Data Hub

JULY 5, 2022

In today’s world of complex data architectures and emerging technologies, databases can sometimes be undervalued and unrecognized. Nedbank builds a scalable data warehouse architecture . Endless data but your queries aren’t fast enough. Via analyzes customer interactions to improve AI assistance .

Data Lake

Data Lake Data Warehouse Publishing Structured Data

Visualize data quality scores and metrics generated by AWS Glue Data Quality

AWS Big Data

JUNE 6, 2023

After you create the table definition on the AWS Glue Data Catalog, you can use Athena to query the Data Catalog table. Query the Data Catalog table using Athena Athena is an interactive query service that makes it easy to analyze data in Amazon S3 and the AWS Glue Data Catalog using standard SQL.

Data Quality

Data Quality Metrics Visualization Dashboards

Data platform trinity: Competitive or complementary?

IBM Big Data Hub

JANUARY 18, 2023

They defined it as : “ A data lakehouse is a new, open data management architecture that combines the flexibility, cost-efficiency, and scale of data lakes with the data management and ACID transactions of data warehouses, enabling business intelligence (BI) and machine learning (ML) on all data. ”.

Data Lake

Data Lake Data Warehouse Data-driven Metadata

Why Invest Now? Three Investors Share the Story Behind Alation’s Series E

Alation

NOVEMBER 2, 2022

At Databricks, we’re focused on enabling customers to adopt the data lakehouse, and that’s an open data architecture that combines the best of the data warehouse and the data lake into one platform,” Ferguson says. “[The And data governance is critical to driving adoption.”.

Data Governance

Data Governance Marketing Finance Data Lake

Exploring real-time streaming for generative AI Applications

AWS Big Data

MARCH 25, 2024

AWS Glue can interact with streaming data services such as Kinesis Data Streams and Amazon MSK for processing and transforming CDC data. With Amazon Managed Service for Apache Flink Studio , you can build and run Apache Flink stream processing applications using standard SQL, Python, and Scala in an interactive notebook.

Data Lake

Data Lake Unstructured Data Management Snapshot

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

AWS Big Data

SEPTEMBER 13, 2023

The Analytics specialty practice of AWS Professional Services (AWS ProServe) helps customers across the globe with modern data architecture implementations on the AWS Cloud. Amazon Athena is used for interactive querying and AWS Lake Formation is used for access controls. He enjoys reading history and civilizations.

Data Lake

Data Lake Data Processing Metadata Snapshot

What is Data Mesh?

Ontotext

NOVEMBER 16, 2023

Figure 1 Shows the overall idea of a data mesh with the major components: What Is a Data Mesh and How Does It Work? Think of data mesh as an operational mode for organizations with a domain-driven, decentralized data architecture. Figure 2 Shows the concept of a data product: What Is a Data Contract?

Metadata

Metadata Data-driven Data Quality Data Architecture

Improve User Adoption of BI Tools with Embedded BI!

Smarten

MAY 30, 2023

They can use the data within those familiar solutions to gather and analyze data without manually exporting data or dealing with time-consuming delays and overburdened IT teams. Flexible Deployment via public or private cloud, or enterprise on-premises hardware.

Slice and Dice

Slice and Dice Business Intelligence Cost-Benefit Dashboards

How to modernize data lakes with a data lakehouse architecture

IBM Big Data Hub

JULY 5, 2023

A highly available distributed processing framework meant giving up on performance in favor of resiliency (we are talking orders of magnitude performance degradation for interactive analytics and BI). Get the ebook on the benefits of a lakehouse architecture Why modernize your data lake?

Data Lake

Data Lake Metadata Cost-Benefit Data Warehouse

The Cloud Connection: How Governance Supports Security

Alation

APRIL 14, 2022

Supports the ability to interact with the actual data and perform analysis on it. Similar to a data warehouse schema, this prep tool automates the development of the recipe to match. In today’s complex data architecture, governance has grown too complicated for manual methods. Visual Profiling.

Metadata

Metadata Data Governance Data-driven Modeling

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

Load data incrementally from transactional data lakes to data warehouses

Webinars

Trending Sources

What is a Data Mesh?

Webinars

Unlocking near real-time analytics with petabytes of transaction data using Amazon Aurora Zero-ETL integration with Amazon Redshift and dbt Cloud

Run Apache XTable in AWS Lambda for background conversion of open table formats

Centralize near-real-time governance through alerts on Amazon Redshift data warehouses for sensitive queries

Build a secure data visualization application using the Amazon Redshift Data API with AWS IAM Identity Center

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

Data democratization: How data architecture can drive business decisions and AI initiatives

The Future of the Data Lakehouse – Open

Automate data loading from your database into Amazon Redshift using AWS Database Migration Service (DMS), AWS Step Functions, and the Redshift Data API

The Future of the Data Lakehouse – Open

Power enterprise-grade Data Vaults with Amazon Redshift – Part 1

How smava makes loans transparent and affordable using Amazon Redshift Serverless

Accelerate Amazon Redshift secure data use with Satori – Part 1

SAP Datasphere review: turning data from a technical problem to a business data product.

Combine transactional, streaming, and third-party data on Amazon Redshift for financial services

Top analytics announcements of AWS re:Invent 2024

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

Empowering data-driven excellence: How the Bluestone Data Platform embraced data mesh for success

Migrate Microsoft Azure Synapse Analytics to Amazon Redshift using AWS SCT

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

Your guide to AWS Analytics at AWS re:Invent 2023

Create an end-to-end data strategy for Customer 360 on AWS

Generic orchestration framework for data warehousing workloads using Amazon Redshift RSQL

AWS Lake Formation 2022 year in review

Lay the groundwork now for advanced analytics and AI

96 Percent of Businesses Can’t Be Wrong: How Hybrid Cloud Came to Dominate the Data Sector

Cloudera Uses CDP to Reduce IT Cloud Spend by $12 Million

BMW Cloud Efficiency Analytics powered by Amazon QuickSight and Amazon Athena

The Multifaceted Value Proposition of the Cloudera Data Platform

Synchronize data lakes with CDC-based UPSERT using open table format, AWS Glue, and Amazon MSK

Shopping for Data

AI agents will transform business processes — and magnify risks

The hidden history of Db2

Visualize data quality scores and metrics generated by AWS Glue Data Quality

Data platform trinity: Competitive or complementary?

Why Invest Now? Three Investors Share the Story Behind Alation’s Series E

Exploring real-time streaming for generative AI Applications

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

What is Data Mesh?

Improve User Adoption of BI Tools with Embedded BI!

How to modernize data lakes with a data lakehouse architecture

The Cloud Connection: How Governance Supports Security

Stay Connected