Data Architecture, Data Transformation and Modeling

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

AWS Big Data

NOVEMBER 22, 2024

The need for streamlined data transformations As organizations increasingly adopt cloud-based data lakes and warehouses, the demand for efficient data transformation tools has grown. This enables you to extract insights from your data without the complexity of managing infrastructure.

Data Lake

Data Lake Data Warehouse Cost-Benefit Data Transformation

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

These strategies, such as investing in AI-powered cleansing tools and adopting federated governance models, not only address the current data quality challenges but also pave the way for improved decision-making, operational efficiency and customer satisfaction. When financial data is inconsistent, reporting becomes unreliable.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

Texas Rangers data transformation modernizes stadium operations

CIO Business Intelligence

OCTOBER 18, 2022

She decided to bring Resultant in to assist, starting with the firm’s strategic data assessment (SDA) framework, which evaluates a client’s data challenges in terms of people and processes, data models and structures, data architecture and platforms, visual analytics and reporting, and advanced analytics.

Data Transformation

Data Transformation Consulting Data Lake Reporting

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Making OT-IT integration a reality with new data architectures and generative AI

CIO Business Intelligence

FEBRUARY 20, 2024

A number of industry leaders are already experimenting with advanced AI use cases, including Denso, a leading mobility supplier that develops advanced technology and components for nearly every vehicle make and model on the road today. Denso uses AI to verify the structuring of unstructured data from across its organisation.

Data Architecture

Data Architecture Unstructured Data Manufacturing IT

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

Need for a data mesh architecture Because entities in the EUROGATE group generate vast amounts of data from various sourcesacross departments, locations, and technologiesthe traditional centralized data architecture struggles to keep up with the demands for real-time insights, agility, and scalability.

IoT

IoT Machine Learning Metadata Data-driven

Unlocking near real-time analytics with petabytes of transaction data using Amazon Aurora Zero-ETL integration with Amazon Redshift and dbt Cloud

AWS Big Data

NOVEMBER 27, 2024

Together with price-performance, Amazon Redshift offers capabilities such as serverless architecture, machine learning integration within your data warehouse and secure data sharing across the organization. dbt Cloud is a hosted service that helps data teams productionize dbt deployments. Create dbt models in dbt Cloud.

Data Warehouse

Data Warehouse Analytics Testing Sales

Lay the groundwork now for advanced analytics and AI

CIO Business Intelligence

AUGUST 3, 2023

But reaching all these goals, as well as using enterprise data for generative AI to streamline the business and develop new services, requires a proper foundation. That hard, ongoing work includes integrating siloed data, modeling, and understanding it, as well as maintaining and securing it over time.

Analytics

Analytics Data Lake Metadata Cost-Benefit

How Open Universities Australia modernized their data platform and significantly reduced their ETL costs with AWS Cloud Development Kit and AWS Step Functions

AWS Big Data

JANUARY 30, 2025

Pattern 1: Data transformation, load, and unload Several of our data pipelines included significant data transformation steps, which were primarily performed through SQL statements executed by Amazon Redshift. The following Diagram 2 shows this workflow.

Data Warehouse

Data Warehouse Data Architecture Machine Learning Data Transformation

Choosing A Graph Data Model to Best Serve Your Use Case

Ontotext

MARCH 27, 2024

For example, GPS, social media, cell phone handoffs are modeled as graphs while data catalogs, data lineage and MDM tools leverage knowledge graphs for linking metadata with semantics. Knowledge graphs model knowledge of a domain as a graph with a network of entities and relationships.

Modeling

Modeling Metadata Data Quality Enterprise

Breaking down data silos for digital success

CIO Business Intelligence

NOVEMBER 7, 2023

Given the importance of sharing information among diverse disciplines in the era of digital transformation, this concept is arguably as important as ever. The aim is to normalize, aggregate, and eventually make available to analysts across the organization data that originates in various pockets of the enterprise.

Data Warehouse

Data Warehouse Digital Transformation Data-driven Reporting

Supercharge Your Data Lakehouse with Apache Iceberg in Cloudera Data Platform

Cloudera

JUNE 30, 2022

These tools empower analysts and data scientists to easily collaborate on the same data, with their choice of tools and analytic engines. No more lock-in, unnecessary data transformations, or data movement across tools and clouds just to extract insights out of the data.

Data Lake

Data Lake Data Warehouse Data Architecture Metadata

BMW Cloud Efficiency Analytics powered by Amazon QuickSight and Amazon Athena

AWS Big Data

NOVEMBER 15, 2023

The difference lies in when and where data transformation takes place. In ETL, data is transformed before it’s loaded into the data warehouse. In ELT, raw data is loaded into the data warehouse first, then it’s transformed directly within the warehouse.

Dashboards

Dashboards Analytics Metadata Data Warehouse

Cloudera Data Engineering 2021 Year End Review

Cloudera

DECEMBER 21, 2021

Cloudera’s Shared Data Experience (SDX) provides all these capabilities allowing seamless data sharing across all the Data Services including CDE. We are excited to offer in Tech Preview this born-in-the-cloud table format that will help future proof data architectures at many of our public cloud customers.

Snapshot

Snapshot Data-driven Optimization Data Architecture

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

AWS Big Data

NOVEMBER 9, 2023

In this post, we delve into a case study for a retail use case, exploring how the Data Build Tool (dbt) was used effectively within an AWS environment to build a high-performing, efficient, and modern data platform. It does this by helping teams handle the T in ETL (extract, transform, and load) processes.

Data Warehouse

Data Warehouse Testing Data Quality Reporting

Connecting the Data Lifecycle

Cloudera

NOVEMBER 29, 2021

Data transforms businesses. That’s where the data lifecycle comes into play. Managing data and its flow, from the edge to the cloud, is one of the most important tasks in the process of gaining data intelligence. . The company needed a modern data architecture to manage the growing traffic effectively. .

Data Lake

Data Lake Data Warehouse Data Architecture Reporting

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

AWS Big Data

AUGUST 1, 2024

However, you might face significant challenges when planning for a large-scale data warehouse migration. The following diagram illustrates a scalable migration pattern for extract, transform, and load (ETL) scenario. The success criteria are the key performance indicators (KPIs) for each component of the data workflow.

Data Warehouse

Data Warehouse KPI Optimization Cost-Benefit

Power enterprise-grade Data Vaults with Amazon Redshift – Part 1

AWS Big Data

NOVEMBER 16, 2023

As with all AWS services, Amazon Redshift is a customer-obsessed service that recognizes there isn’t a one-size-fits-all for customers when it comes to data models, which is why Amazon Redshift supports multiple data models such as Star Schemas, Snowflake Schemas and Data Vault. Data Vault 2.0

Enterprise

Enterprise Data Warehouse Data Lake Optimization

Amazon Redshift data ingestion options

AWS Big Data

SEPTEMBER 5, 2024

If storing operational data in a data warehouse is a requirement, synchronization of tables between operational data stores and Amazon Redshift tables is supported. In scenarios where data transformation is required, you can use Redshift stored procedures to modify data in Redshift tables.

IoT

IoT Data Warehouse Cost-Benefit Reporting

Automate discovery of data relationships using ML and Amazon Neptune graph technology

AWS Big Data

APRIL 19, 2023

Independent data products often only have value if you can connect them, join them, and correlate them to create a higher order data product that creates additional insights. A modern data architecture is critical in order to become a data-driven organization.

Technology

Technology Data-driven Machine Learning Sales

How smava makes loans transparent and affordable using Amazon Redshift Serverless

AWS Big Data

DECEMBER 21, 2023

Overview of solution As a data-driven company, smava relies on the AWS Cloud to power their analytics use cases. smava ingests data from various external and internal data sources into a landing stage on the data lake based on Amazon Simple Storage Service (Amazon S3).

Data Lake

Data Lake Data Warehouse Data-driven B2B

Measuring Maturity

Peter James Thomas

MARCH 9, 2020

I last wrote about the process of creating a Data Strategy back in 2014 and – with the many changes that the field has seen since then – am overdue publishing an update, so watch this space. Such a model presents a series of states into which an organisation may fall with respect to its data.

Measurement

Measurement Data Strategy Strategy Modeling

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

AWS Big Data

JULY 20, 2023

With data becoming the driving force behind many industries today, having a modern data architecture is pivotal for organizations to be successful. This data is then used by various applications for streaming analytics, business intelligence, and reporting. This ensures that the data is suitable for training purposes.

Data Lake

Data Lake Analytics Snapshot Data Quality

Empowering data mesh: The tools to deliver BI excellence

erwin

APRIL 16, 2024

The data mesh approach distributes data ownership and decentralizes data architecture, paving the way for enhanced agility and scalability. With distributed ownership there is a need for effective governance to ensure the success of any data initiative. This empowers individual teams to own and manage their data.

Metadata

Metadata Data Quality Data Governance Modeling

Birst automates the creation of data warehouses in Snowflake

Birst BI

FEBRUARY 25, 2020

Customers such as Crossmark , DJO Global and others use Birst with Snowflake to deliver the ultimate modern data architecture. Data never leaves Snowflake with Birst’s ability to support the reporting and self-service needs of both centralized IT and decentralized LOB teams.

Data Warehouse

Data Warehouse Cost-Benefit Data Architecture Enterprise

Best BI Tools For 2024 You Need to Know

FineReport

MARCH 31, 2024

Furthermore, these tools boast customization options, allowing users to tailor data sources to address areas critical to their business success, thereby generating actionable insights and customizable reports. Best BI Tools for Data Analysts 3.1 Key Features: Extensive library of pre-built connectors for diverse data sources.

Dashboards

Dashboards Visualization Data mining Data-driven

Data Mesh 101: How Data Mesh Helps Organizations Be Data-Driven and Achieve Velocity

Ontotext

FEBRUARY 12, 2024

Transferring ownership of data/datasets to domain-specific units that possess a deeper understanding of rules around the data empowers teams, improves data quality and trust, and greatly accelerates the building of data models and analytics. However, data mesh is not about introducing new technologies.

Data-driven

Data-driven Data Lake Data Quality Business Objectives

A step-by-step guide to setting up a data governance program

IBM Big Data Hub

FEBRUARY 9, 2023

In our last blog , we delved into the seven most prevalent data challenges that can be addressed with effective data governance. Today we will share our approach to developing a data governance program to drive data transformation and fuel a data-driven culture. Don’t try to do everything at once!

Data Governance

Data Governance Business Objectives Data Quality Measurement

Data platform trinity: Competitive or complementary?

IBM Big Data Hub

JANUARY 18, 2023

Furthermore, data warehouse storage cannot support workloads like Artificial Intelligence (AI) or Machine Learning (ML), which require huge amounts of data for model training. For these workloads, data lake vendors usually recommend extracting data into flat files to be used solely for model training and testing purposes.

Data Lake

Data Lake Data Warehouse Data-driven Metadata

The Chief Marketing Officer and the CDO – A Modern Fable

Peter James Thomas

OCTOBER 30, 2018

It may well be that one thing that a CDO needs to get going is a data transformation programme. This may purely be focused on cultural aspects of how an organisation records, shares and otherwise uses data. It may be to build a new (or a first) Data Architecture. It may be to introduce or expand Data Governance.

Marketing

Marketing Strategy Data Architecture Data Strategy

Data Landscape – Navigating The Data Jungle

Anmut

MARCH 24, 2022

After decades in the background, data is currently king of the business world. Visionary companies like Google and Amazon are renowned for figuring out the transformational power of data, using data-driven business models to achieve extraordinary success. It doesn’t have to be this way.

ROI

ROI Measurement Data-driven Data Transformation

Showpad accelerates data maturity to unlock innovation using Amazon QuickSight

AWS Big Data

APRIL 5, 2023

QuickSight would help local data stewards, who weren’t technical but knew the use cases intimately, to create their own dashboards and prototype them with their customers before promoting them through the product. The serverless model was also compelling because we did not have to pay for server instances nor license fees per reader.

Dashboards

Dashboards Reporting Cost-Benefit Visualization

CIO 100 Award winners drive business results with IT

CIO Business Intelligence

AUGUST 7, 2024

This project represents a transformative initiative designed to address the evolving landscape of cyber threats,” says Kunal Krushev, head of cybersecurity automation and intelligence with the firm’s Corporate IT — Digital Infrastructure Services. “We The system complements preconfigured components, workflows, and libraries.

IT

IT Insurance Cost-Benefit Testing

What Is Embedded Analytics?

Jet Global

MAY 1, 2023

These licensing terms are critical: Perpetual license vs subscription: Subscription is a pay-as-you-go model that provides flexibility as you evaluate a vendor. Pricing model: The pricing scale is dependent on several factors. Some cloud applications can even provide new benchmarks based on customer data.

Analytics

Analytics Cost-Benefit Visualization Dashboards

“You Complete Me,” said Data Lineage to DataOps Observability.

DataKitchen

JANUARY 23, 2023

Like an apartment blueprint, Data lineage provides a written document that is only marginally useful during a crisis. This is especially true in the case of the one-to-many, producer-to-consumer relationships we have on our data architecture. Are problems with data tests? Which report tab is wrong? When did it last run?

Testing

Testing Data Governance Data Quality Data-driven

Data Leaders Brief

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

Data’s dark secret: Why poor quality cripples AI and growth

Webinars

Trending Sources

Texas Rangers data transformation modernizes stadium operations

Webinars

Making OT-IT integration a reality with new data architectures and generative AI

How EUROGATE established a data mesh architecture using Amazon DataZone

Unlocking near real-time analytics with petabytes of transaction data using Amazon Aurora Zero-ETL integration with Amazon Redshift and dbt Cloud

Lay the groundwork now for advanced analytics and AI

How Open Universities Australia modernized their data platform and significantly reduced their ETL costs with AWS Cloud Development Kit and AWS Step Functions

Choosing A Graph Data Model to Best Serve Your Use Case

Breaking down data silos for digital success

Supercharge Your Data Lakehouse with Apache Iceberg in Cloudera Data Platform

BMW Cloud Efficiency Analytics powered by Amazon QuickSight and Amazon Athena

Cloudera Data Engineering 2021 Year End Review

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

Connecting the Data Lifecycle

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

Power enterprise-grade Data Vaults with Amazon Redshift – Part 1

Amazon Redshift data ingestion options

Automate discovery of data relationships using ML and Amazon Neptune graph technology

How smava makes loans transparent and affordable using Amazon Redshift Serverless

Measuring Maturity

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

Empowering data mesh: The tools to deliver BI excellence

Birst automates the creation of data warehouses in Snowflake

Best BI Tools For 2024 You Need to Know

Data Mesh 101: How Data Mesh Helps Organizations Be Data-Driven and Achieve Velocity

A step-by-step guide to setting up a data governance program

Data platform trinity: Competitive or complementary?

The Chief Marketing Officer and the CDO – A Modern Fable

Data Landscape – Navigating The Data Jungle

Showpad accelerates data maturity to unlock innovation using Amazon QuickSight

CIO 100 Award winners drive business results with IT

What Is Embedded Analytics?

“You Complete Me,” said Data Lineage to DataOps Observability.

Stay Connected