Analytics, Data Architecture and Data Transformation

Ingest data from Google Analytics 4 and Google Sheets to Amazon Redshift using Amazon AppFlow

AWS Big Data

JANUARY 6, 2025

Google Analytics 4 (GA4) provides valuable insights into user behavior across websites and apps. But what if you need to combine GA4 data with other sources or perform deeper analysis? It also helps you securely access your data in operational databases, data lakes, or third-party datasets with minimal movement or copying of data.

Analytics

Analytics Data Warehouse Big Data Metrics

Texas Rangers data transformation modernizes stadium operations

CIO Business Intelligence

OCTOBER 18, 2022

“In the old stadium, we just didn’t have the ability to get the data that we needed,” says Machelle Noel, manager of analytic systems at the Texas Rangers Baseball Club. Analytics, Data Management Some of our systems were old. We just didn’t have the ability that we now have in this new, state-of-the-art facility.”.

Data Transformation

Data Transformation Consulting Data Lake Reporting

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

AWS Big Data

NOVEMBER 22, 2024

At AWS, we are committed to empowering organizations with tools that streamline data analytics and transformation processes. This integration enables data teams to efficiently transform and manage data using Athena with dbt Cloud’s robust features, enhancing the overall data workflow experience.

Data Lake

Data Lake Data Warehouse Cost-Benefit Data Transformation

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Is The Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Unlocking near real-time analytics with petabytes of transaction data using Amazon Aurora Zero-ETL integration with Amazon Redshift and dbt Cloud

AWS Big Data

NOVEMBER 27, 2024

Zero-ETL integration with Amazon Redshift reduces the need for custom pipelines, preserves resources for your transactional systems, and gives you access to powerful analytics. The data in Amazon Redshift is transactionally consistent and updates are automatically and continuously propagated.

Data Warehouse

Data Warehouse Analytics Testing Sales

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

As technology and business leaders, your strategic initiatives, from AI-powered decision-making to predictive insights and personalized experiences, are all fueled by data. Yet, despite growing investments in advanced analytics and AI, organizations continue to grapple with a persistent and often underestimated challenge: poor data quality.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

For container terminal operators, data-driven decision-making and efficient data sharing are vital to optimizing operations and boosting supply chain efficiency. Enhance agility by localizing changes within business domains and clear data contracts. Eliminate centralized bottlenecks and complex data pipelines.

IoT

IoT Machine Learning Metadata Data-driven

Lay the groundwork now for advanced analytics and AI

CIO Business Intelligence

AUGUST 3, 2023

When global technology company Lenovo started utilizing data analytics, they helped identify a new market niche for its gaming laptops, and powered remote diagnostics so their customers got the most from their servers and other devices.

Analytics

Analytics Data Lake Metadata Cost-Benefit

Improve Business Agility by Hiring a DataOps Engineer

DataKitchen

DECEMBER 20, 2020

Data-driven companies sense change through data analytics. Analytics tell the story of markets and customers. Analytics enable companies to understand their environment. Companies turn to their data organization to provide the analytics that stimulates creative problem-solving.

Data-driven

Data-driven Manufacturing Data Architecture Data Analytics

BMW Cloud Efficiency Analytics powered by Amazon QuickSight and Amazon Athena

AWS Big Data

NOVEMBER 15, 2023

BMW Cloud Efficiency Analytics (CLEA) is a homegrown tool developed within the BMW FinOps CoE (Center of Excellence) aiming to optimize and reduce costs across all these accounts. In this post, we explore how the BMW Group FinOps CoE implemented their Cloud Efficiency Analytics tool (CLEA), powered by Amazon QuickSight and Amazon Athena.

Dashboards

Dashboards Analytics Metadata Data Warehouse

How GamesKraft uses Amazon Redshift data sharing to support growing analytics workloads

AWS Big Data

NOVEMBER 13, 2023

Amazon Redshift is a fully managed data warehousing service that offers both provisioned and serverless options, making it more efficient to run and scale analytics without having to manage your data warehouse. Key considerations Gameskraft embraces a modern data architecture, with the data lake residing in Amazon S3.

Data Warehouse

Data Warehouse Analytics Data Lake Data Science

Introducing blueprint discovery and other UI enhancements for Amazon OpenSearch Ingestion

AWS Big Data

MAY 22, 2024

Amazon OpenSearch Ingestion is a fully managed serverless pipeline that allows you to ingest, filter, transform, enrich, and route data to an Amazon OpenSearch Service domain or Amazon OpenSearch Serverless collection. He is deeply passionate about Data Architecture and helps customers build analytics solutions at scale on AWS.

Data Architecture

Data Architecture Visualization Data Transformation Management

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

AWS Big Data

JULY 20, 2023

With data becoming the driving force behind many industries today, having a modern data architecture is pivotal for organizations to be successful. In this post, we describe Orca’s journey building a transactional data lake using Amazon Simple Storage Service (Amazon S3), Apache Iceberg, and AWS Analytics.

Data Lake

Data Lake Analytics Snapshot Data Quality

How Open Universities Australia modernized their data platform and significantly reduced their ETL costs with AWS Cloud Development Kit and AWS Step Functions

AWS Big Data

JANUARY 30, 2025

Diagram 1: Overall architecture of the solution, using AWS Step Functions, Amazon Redshift and Amazon S3 The following AWS services were used to shape our new ETL architecture: Amazon Redshift A fully managed, petabyte-scale data warehouse service in the cloud. The following Diagram 2 shows this workflow.

Data Warehouse

Data Warehouse Data Architecture Machine Learning Data Transformation

Supercharge Your Data Lakehouse with Apache Iceberg in Cloudera Data Platform

Cloudera

JUNE 30, 2022

These tools empower analysts and data scientists to easily collaborate on the same data, with their choice of tools and analytic engines. No more lock-in, unnecessary data transformations, or data movement across tools and clouds just to extract insights out of the data.

Data Lake

Data Lake Data Warehouse Data Architecture Metadata

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

AWS Big Data

NOVEMBER 9, 2023

It does this by helping teams handle the T in ETL (extract, transform, and load) processes. It allows users to write data transformation code, run it, and test the output, all within the framework it provides. As part of their cloud modernization initiative, they sought to migrate and modernize their legacy data platform.

Data Warehouse

Data Warehouse Testing Data Quality Reporting

Amazon Redshift data ingestion options

AWS Big Data

SEPTEMBER 5, 2024

If storing operational data in a data warehouse is a requirement, synchronization of tables between operational data stores and Amazon Redshift tables is supported. In scenarios where data transformation is required, you can use Redshift stored procedures to modify data in Redshift tables.

IoT

IoT Data Warehouse Cost-Benefit Reporting

Breaking down data silos for digital success

CIO Business Intelligence

NOVEMBER 7, 2023

Given the importance of sharing information among diverse disciplines in the era of digital transformation, this concept is arguably as important as ever. The aim is to normalize, aggregate, and eventually make available to analysts across the organization data that originates in various pockets of the enterprise.

Data Warehouse

Data Warehouse Digital Transformation Data-driven Reporting

How smava makes loans transparent and affordable using Amazon Redshift Serverless

AWS Big Data

DECEMBER 21, 2023

To speed up the self-service analytics and foster innovation based on data, a solution was needed to provide ways to allow any team to create data products on their own in a decentralized manner. To create and manage the data products, smava uses Amazon Redshift , a cloud data warehouse.

Data Lake

Data Lake Data Warehouse Data-driven B2B

Cloudera Data Engineering 2021 Year End Review

Cloudera

DECEMBER 21, 2021

We are excited to offer in Tech Preview this born-in-the-cloud table format that will help future proof data architectures at many of our public cloud customers. This enabled new use-cases with customers that were using a mix of Spark and Hive to perform data transformations. . Modernizing pipelines.

Snapshot

Snapshot Data-driven Optimization Management

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

AWS Big Data

AUGUST 1, 2024

As data volumes continue to grow exponentially, traditional data warehousing solutions may struggle to keep up with the increasing demands for scalability, performance, and advanced analytics. However, you might face significant challenges when planning for a large-scale data warehouse migration.

Data Warehouse

Data Warehouse KPI Optimization Cost-Benefit

Automate discovery of data relationships using ML and Amazon Neptune graph technology

AWS Big Data

APRIL 19, 2023

Independent data products often only have value if you can connect them, join them, and correlate them to create a higher order data product that creates additional insights. A modern data architecture is critical in order to become a data-driven organization.

Technology

Technology Data-driven Machine Learning Sales

Data Integrity, the Basis for Reliable Insights

Sisense

AUGUST 28, 2020

The former allows us to control the data before it is generated, and the latter allows us to identify if there is an issue with our data that would impact its availability, completeness, or accuracy. Process-driven data integrity: Getting data generation right. Cleaning up data that doesn’t meet data integrity standards.

Data Integration

Data Integration Testing Data Quality Data-driven

Connecting the Data Lifecycle

Cloudera

NOVEMBER 29, 2021

Data transforms businesses. That’s where the data lifecycle comes into play. Managing data and its flow, from the edge to the cloud, is one of the most important tasks in the process of gaining data intelligence. . The company needed a modern data architecture to manage the growing traffic effectively. .

Data Lake

Data Lake Data Warehouse Data Architecture Reporting

Deep dive into the AWS ProServe Hadoop Migration Delivery Kit TCO tool

AWS Big Data

FEBRUARY 6, 2023

He helps customers innovate their business with AWS Analytics, IoT, and AI/ML services. He has a specialty in big data services and technologies and an interest in building customer business outcomes together. Jiseong Kim is a Senior Data Architect at AWS ProServe. George Zhao is a Senior Data Architect at AWS ProServe.

Dashboards

Dashboards Optimization Data Lake Cost-Benefit

Birst automates the creation of data warehouses in Snowflake

Birst BI

FEBRUARY 25, 2020

Managing large-scale data warehouse systems has been known to be very administrative, costly, and lead to analytic silos. The good news is that Snowflake, the cloud data platform, lowers costs and administrative overhead. The result is a lower total cost of ownership and trusted data and analytics.

Data Warehouse

Data Warehouse Cost-Benefit Data Architecture Enterprise

Power enterprise-grade Data Vaults with Amazon Redshift – Part 1

AWS Big Data

NOVEMBER 16, 2023

Data Vault 2.0 allows for the following: Agile data warehouse development Parallel data ingestion A scalable approach to handle multiple data sources even on the same entity A high level of automation Historization Full lineage support However, Data Vault 2.0

Enterprise

Enterprise Data Warehouse Data Lake Optimization

Measuring Maturity

Peter James Thomas

MARCH 9, 2020

Obviously things improve as you climb up the “stairs” Of course organisations may be at a more advanced stage with respect to Data Controls than they are with Analytics. Given my extensive work implementaing modern Data Architectures that deliver positive commercial outcomes, this is not a surprising state of affairs.

Measurement

Measurement Data Strategy Strategy Modeling

Best BI Tools For 2024 You Need to Know

FineReport

MARCH 31, 2024

Acting as a comprehensive solution, the best BI tools collect and analyze company data to generate easily interpretable graphs, reports, and charts , leveraging advanced data mining, analytics, and visualization techniques. Best BI Tools for Data Analysts 3.1

Dashboards

Dashboards Visualization Data mining Data-driven

Data Mesh 101: How Data Mesh Helps Organizations Be Data-Driven and Achieve Velocity

Ontotext

FEBRUARY 12, 2024

Transferring ownership of data/datasets to domain-specific units that possess a deeper understanding of rules around the data empowers teams, improves data quality and trust, and greatly accelerates the building of data models and analytics.

Data-driven

Data-driven Data Lake Data Quality Business Objectives

Introducing the AWS ProServe Hadoop Migration Delivery Kit TCO tool

AWS Big Data

FEBRUARY 6, 2023

Use case overview Migrating Hadoop workloads to Amazon EMR accelerates big data analytics modernization, increases productivity, and reduces operational cost. Refactoring coupled compute and storage to a decoupling architecture is a modern data solution. Jiseong Kim is a Senior Data Architect at AWS ProServe.

Cost-Benefit

Cost-Benefit Data Lake Dashboards Big Data

How to modernize data lakes with a data lakehouse architecture

IBM Big Data Hub

JULY 5, 2023

Data Lakes have been around for well over a decade now, supporting the analytic operations of some of the largest world corporations. This was, without a question, a significant departure from traditional analytic environments, which often meant vendor-lock in and the inability to work with data at scale.

Data Lake

Data Lake Metadata Cost-Benefit Data Warehouse

A step-by-step guide to setting up a data governance program

IBM Big Data Hub

FEBRUARY 9, 2023

In our last blog , we delved into the seven most prevalent data challenges that can be addressed with effective data governance. Today we will share our approach to developing a data governance program to drive data transformation and fuel a data-driven culture. Don’t try to do everything at once!

Data Governance

Data Governance Business Objectives Data Quality Measurement

Empowering data mesh: The tools to deliver BI excellence

erwin

APRIL 16, 2024

The data mesh framework In the dynamic landscape of data management, the search for agility, scalability, and efficiency has led organizations to explore new, innovative approaches. One such innovation gaining traction is the data mesh framework. This empowers individual teams to own and manage their data.

Metadata

Metadata Data Quality Data Governance Modeling

The Chief Marketing Officer and the CDO – A Modern Fable

Peter James Thomas

OCTOBER 30, 2018

It may well be that one thing that a CDO needs to get going is a data transformation programme. This may purely be focused on cultural aspects of how an organisation records, shares and otherwise uses data. It may be to build a new (or a first) Data Architecture. It may be to introduce or expand Data Governance.

Marketing

Marketing Strategy Data Architecture Data Strategy

Data platform trinity: Competitive or complementary?

IBM Big Data Hub

JANUARY 18, 2023

This adds an additional ETL step, making the data even more stale. Data lakehouse was created to solve these problems. The data warehouse storage layer is removed from lakehouse architectures. Instead, continuous data transformation is performed within the BLOB storage. Data mesh: A mostly new culture.

Data Lake

Data Lake Data Warehouse Data-driven Metadata

Data Landscape – Navigating The Data Jungle

Anmut

MARCH 24, 2022

We could give many answers, but they all centre on the same root cause: most data leaders focus on flashy technology and symptomatic fixes instead of approaching data transformation in a way that addresses the root causes of data problems and leads to tangible results and business success. It doesn’t have to be this way.

ROI

ROI Measurement Data-driven Data Transformation

Choosing A Graph Data Model to Best Serve Your Use Case

Ontotext

MARCH 27, 2024

It accelerates data projects with data quality and lineage and contextualizes through ontologies , taxonomies, and vocabularies, making integrations easier. RDF is used extensively for data publishing and data interchange and is based on W3C and other industry standards. LPGs are rudimentary knowledge graphs.

Modeling

Modeling Metadata Data Quality Enterprise

Showpad accelerates data maturity to unlock innovation using Amazon QuickSight

AWS Big Data

APRIL 5, 2023

The company decided to use AWS to unify its business intelligence (BI) and reporting strategy for both internal organization-wide use cases and in-product embedded analytics targeted at its customers. The platform also provides analytics and insights to support successful information sharing and fuel continuous improvement.

Dashboards

Dashboards Reporting Cost-Benefit Visualization

Build incremental data pipelines to load transactional data changes using AWS DMS, Delta 2.0, and Amazon EMR Serverless

AWS Big Data

MARCH 3, 2023

You can then apply transformations and store data in Delta format for managing inserts, updates, and deletes. Amazon EMR Serverless is a serverless option in Amazon EMR that makes it easy for data analysts and engineers to run open-source big data analytics frameworks without configuring, managing, and scaling clusters or servers.

Data Lake

Data Lake Dashboards Metrics Metadata

Successful Data Virtualisation: more than the right choice of platform

Data Virtualization

JANUARY 20, 2021

Learn in 12 minutes: What makes a strong use case for data virtualisation How to come up with a solid Proof of Concept How to prepare your organisation for data virtualisation You’ll have read all about data virtualisation and you’ve.

Data Warehouse

Data Warehouse Data Architecture Data Transformation Big Data

BHP Leverages the Denodo Platform to Create a Logical Data Fabric

Data Virtualization

APRIL 21, 2021

BHP is a global resources company headquartered in Melbourne, Australia. It is among the world’s top producers of major commodities, including iron ore, metallurgical coal, and copper, and has substantial interests in oil and gas. BHP has operations and offices.

IT

IT Data-driven Data Architecture Data Transformation

Accelerate data pipeline creation with the new visual interface in Amazon OpenSearch Ingestion

AWS Big Data

APRIL 22, 2025

Amazon OpenSearch Ingestion is a fully managed serverless pipeline that allows you to ingest, filter, transform, enrich, and route data to an Amazon OpenSearch Service domain or Amazon OpenSearch Serverless collection. He is deeply passionate about Data Architecture and helps customers build analytics solutions at scale on AWS.

Visualization

Visualization Data Transformation Management Risk

CIO 100 Award winners drive business results with IT

CIO Business Intelligence

AUGUST 7, 2024

The company started its New Analytics Era initiative by migrating its data from outdated SQL servers to a modern AWS data lake. It then built a cutting-edge cloud-based analytics platform, designed with an innovative data architecture. So was articulating the business value the data platform could deliver.

IT

IT Insurance Cost-Benefit Testing

What Is Embedded Analytics?

Jet Global

MAY 1, 2023

Introduction Why should I read the definitive guide to embedded analytics? But many companies fail to achieve this goal because they struggle to provide the reporting and analytics users have come to expect. The Definitive Guide to Embedded Analytics is designed to answer any and all questions you have about the topic.

Analytics

Analytics Cost-Benefit Visualization Dashboards

Ingest data from Google Analytics 4 and Google Sheets to Amazon Redshift using Amazon AppFlow

Texas Rangers data transformation modernizes stadium operations

Webinars

Trending Sources

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

Webinars

Unlocking near real-time analytics with petabytes of transaction data using Amazon Aurora Zero-ETL integration with Amazon Redshift and dbt Cloud

Data’s dark secret: Why poor quality cripples AI and growth

How EUROGATE established a data mesh architecture using Amazon DataZone

Lay the groundwork now for advanced analytics and AI

Improve Business Agility by Hiring a DataOps Engineer

BMW Cloud Efficiency Analytics powered by Amazon QuickSight and Amazon Athena

How GamesKraft uses Amazon Redshift data sharing to support growing analytics workloads

Introducing blueprint discovery and other UI enhancements for Amazon OpenSearch Ingestion

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

How Open Universities Australia modernized their data platform and significantly reduced their ETL costs with AWS Cloud Development Kit and AWS Step Functions

Supercharge Your Data Lakehouse with Apache Iceberg in Cloudera Data Platform

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

Amazon Redshift data ingestion options

Breaking down data silos for digital success

How smava makes loans transparent and affordable using Amazon Redshift Serverless

Cloudera Data Engineering 2021 Year End Review

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

Automate discovery of data relationships using ML and Amazon Neptune graph technology

Data Integrity, the Basis for Reliable Insights

Connecting the Data Lifecycle

Deep dive into the AWS ProServe Hadoop Migration Delivery Kit TCO tool

Birst automates the creation of data warehouses in Snowflake

Power enterprise-grade Data Vaults with Amazon Redshift – Part 1

Measuring Maturity

Best BI Tools For 2024 You Need to Know

Data Mesh 101: How Data Mesh Helps Organizations Be Data-Driven and Achieve Velocity

Introducing the AWS ProServe Hadoop Migration Delivery Kit TCO tool

How to modernize data lakes with a data lakehouse architecture

A step-by-step guide to setting up a data governance program

Empowering data mesh: The tools to deliver BI excellence

The Chief Marketing Officer and the CDO – A Modern Fable

Data platform trinity: Competitive or complementary?

Data Landscape – Navigating The Data Jungle

Choosing A Graph Data Model to Best Serve Your Use Case

Showpad accelerates data maturity to unlock innovation using Amazon QuickSight

Build incremental data pipelines to load transactional data changes using AWS DMS, Delta 2.0, and Amazon EMR Serverless

Successful Data Virtualisation: more than the right choice of platform

BHP Leverages the Denodo Platform to Create a Logical Data Fabric

Accelerate data pipeline creation with the new visual interface in Amazon OpenSearch Ingestion

CIO 100 Award winners drive business results with IT

What Is Embedded Analytics?

Stay Connected