Data Transformation and Presentation

IKEA’s Data Transformation: Lessons from a Global Giant

Timo Elliott

NOVEMBER 7, 2024

At IKEA, the global home furnishings leader, data is more than an operational necessity—it’s a strategic asset. In a recent presentation at the SAPSA Impuls event in Stockholm , George Sandu, IKEA’s Master Data Leader, shared the company’s data transformation story, offering valuable lessons for organizations navigating similar challenges.

Data Transformation

Data Transformation Data Quality Digital Transformation Strategy

Intro to R and Power BI Presentation and a Presenting Secret

Jen Stirrup

NOVEMBER 30, 2021

I was all set to present this session at the European Collaboration Summit in November 2021, but the organizers needed to change the time and date of my session which was rescheduled to take place after I’d left to go back home. Then, we will move towards powerful but simple to use datatypes in R such as data frames.

Visualization

Visualization Dashboards Data Transformation Publishing

Texas Rangers data transformation modernizes stadium operations

CIO Business Intelligence

OCTOBER 18, 2022

“In the strategic data assessment, when people were like, ‘Oh, you can show us the ice cream sales?’ When the business operations team first presented the new end-of-game report in an executive meeting, the owners asked to be included. I think you have to toot your own horn that, yes, we have this information available.”.

Data Transformation

Data Transformation Consulting Data Lake Reporting

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Data transformation takes flight at Atlanta’s Hartsfield-Jackson airport

CIO Business Intelligence

AUGUST 9, 2024

The dashboard now in production uses Databricks’ Azure data lake to ingest, clean, store, and analyze the data, and Microsoft’s Power BI to generate graphical analytics that present critical operational data in a single view, such as the number of flights coming into domestic and international terminals and average security wait times.

Data Transformation

Data Transformation Machine Learning Data Lake Dashboards

SAP Datasphere Powers Business at the Speed of Data

Rocket-Powered Data Science

MARCH 20, 2023

Data collections are the ones and zeroes that encode the actionable insights (patterns, trends, relationships) that we seek to extract from our data through machine learning and data science. Live online presentations, demos, and customer testimonials were complemented with new content posted at sap.com/datasphere.

Data Warehouse

Data Warehouse Metadata Digital Transformation Machine Learning

It’s Essential — Verifying Data Transformations (Part 4)

Wayne Yaddow

FEBRUARY 4, 2025

Its EssentialVerifying Data Transformations (Part4) Uncovering the leading problems in data transformation workflowsand practical ways to detect and preventthem In Parts 13 of this series of blogs, categories of data transformations were identified as among the top causes of data quality defects in data pipeline workflows.

Data Transformation

Data Transformation Testing Data Quality Strategy

Complex Data Transformations — Test Planning Best Practices

Wayne Yaddow

FEBRUARY 21, 2025

Complex Data TransformationsTest Planning Best Practices Ensuring data accuracy with structured testing and best practices Photo by Taylor Vick on Unsplash Introduction Data transformations and conversions are crucial for data pipelines, enabling organizations to process, integrate, and refine raw data into meaningful insights.

Testing

Testing Data Transformation Data Quality Data Integration

Key Challenges Affecting Data Transformations—Dev and Testing

Wayne Yaddow

FEBRUARY 6, 2025

Common challenges and practical mitigation strategies for reliable data transformations. Photo by Mika Baumeister on Unsplash Introduction Data transformations are important processes in data engineering, enabling organizations to structure, enrich, and integrate data for analytics , reporting, and operational decision-making.

Testing

Testing Data Transformation Data-driven Manufacturing

Bridging the gap between mainframe data and hybrid cloud environments

CIO Business Intelligence

FEBRUARY 27, 2025

Additionally, integrating mainframe data with the cloud enables enterprises to feed information into data lakes and data lake houses, which is ideal for authorized data professionals to easily leverage the best and most modern tools for analytics and forecasting. Four key challenges prevent them from doing so: 1.

Metadata

Metadata Data Lake Cost-Benefit Forecasting

Introducing simplified interaction with the Airflow REST API in Amazon MWAA

AWS Big Data

OCTOBER 23, 2024

The Airflow REST API facilitates a wide range of use cases, from centralizing and automating administrative tasks to building event-driven, data-aware data pipelines. In this post, we discuss the enhancement and present several use cases that the enhancement unlocks for your Amazon MWAA environment.

Interactive

Interactive Testing Data-driven Data Lake

Accelerate your data workflows with Amazon Redshift Data API persistent sessions

AWS Big Data

NOVEMBER 22, 2024

In this post, we’ll walk through an example ETL process that uses session reuse to efficiently create, populate, and query temporary staging tables across the full data transformation workflow—all within the same persistent Amazon Redshift database session.

Data Warehouse

Data Warehouse Recreation/Entertainment Cost-Benefit Data-driven

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

Once data is deemed high-quality, critical business processes and functions should run more efficiently and accurately, with a higher ROI and lower costs. Data Quality Management Best Practices. This means there are no unintended data errors, and it corresponds to its appropriate designation (e.g., date, month, and year).

Data Quality

Data Quality Metrics Data-driven Management

10 Examples of How Big Data in Logistics Can Transform The Supply Chain

datapine

MAY 2, 2023

The rise of SaaS business intelligence tools is answering that need, providing a dynamic vessel for presenting and interacting with essential insights in a way that is digestible and accessible. The future is bright for logistics companies that are willing to take advantage of big data.

Big Data

Big Data Internet of Things Cost-Benefit Optimization

MLOps and DevOps: Why Data Makes It Different

O'Reilly on Data

OCTOBER 19, 2021

To make the data scientists maximally productive, we want to provide supporting software architecture in terms of APIs and libraries that allow them to focus on data, not on the machines. Data Science Layers. However, none of these layers help with modeling and optimization. Model Development.

IT

IT Testing Experimentation Software

Introducing Self-Service, No-Code Airflow Authoring UI in Cloudera Data Engineering

Cloudera

OCTOBER 19, 2021

Airflow has been adopted by many Cloudera Data Platform (CDP) customers in the public cloud as the next generation orchestration service to setup and operationalize complex data pipelines. This presented challenges for users in building more complex multi-step pipelines that are typical of DE workflows.

Data Transformation

Data Transformation Interactive Machine Learning Testing

Fresca Group Selects insightsoftware To Transform Its Financial Consolidation and Disclosure Management Processes

Jet Global

MAY 15, 2025

Further, disclosure management capabilities will ensure secure, collaborative, narrative report production for recurring, multi-author reports, and presentations. We selected insightsoftware to transform the way our finance team operates day to day, said Laura Evison, CFO at Fresca Group.

Management

Management IT Finance ROI

Introducing blueprint discovery and other UI enhancements for Amazon OpenSearch Ingestion

AWS Big Data

MAY 22, 2024

Amazon OpenSearch Ingestion is a fully managed serverless pipeline that allows you to ingest, filter, transform, enrich, and route data to an Amazon OpenSearch Service domain or Amazon OpenSearch Serverless collection. When you create a new pipeline on the OpenSearch Service console, you’re presented with a new catalog page.

Data Architecture

Data Architecture Visualization Data Transformation Management

The Best Data Management Tools For Small Businesses

Smart Data Collective

APRIL 29, 2020

The extraction of raw data, transforming to a suitable format for business needs, and loading into a data warehouse. Data transformation. This process helps to transform raw data into clean data that can be analysed and aggregated. Data analytics and visualisation.

Management

Management Data Warehouse Digital Transformation Dashboards

How the BMW Group analyses semiconductor demand with AWS Glue

AWS Big Data

APRIL 26, 2023

We also split the data transformation into several modules (Data Aggregation, Data Filtering, and Data Preparation) to make the system more transparent and easier to maintain. Although each module is specific to a data source or a particular data transformation, we utilize reusable blocks inside of every job.

Forecasting

Forecasting Manufacturing Data Lake Big Data

DataOps Observability: Taming the Chaos (part 1)

DataKitchen

OCTOBER 5, 2022

Jason is the CDO for Company X, responsible for six teams of data professionals across several locations. The breadth of his teams’ work and the technologies they use present a significant challenge to his main goal: to deliver new and useful analytics solutions to the business. to monitor your data operations.

Testing

Testing Risk Data Processing Statistics

Straumann Group is transforming dentistry with data, AI

CIO Business Intelligence

FEBRUARY 16, 2023

The company’s orthodontics business, for instance, makes heavy use of image processing to the point that unstructured data is growing at a pace of roughly 20% to 25% per month. Advances in imaging technology present Straumann Group with the opportunity to provide its customers with new capabilities to offer their clients.

Unstructured Data

Unstructured Data Data Lake Prescriptive Analytics Data Warehouse

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

AWS Big Data

DECEMBER 4, 2024

Division-level metadata management and data governance – A centrally hosted service provides domain teams with the capability to publish their data products along with relevant metadata, like business definitions and lineage. A data portal for consumers to discover data products and access associated metadata.

Metadata

Metadata Data Governance Data Quality Data-driven

Semantization of Regulatory Documents in AECO

Ontotext

NOVEMBER 29, 2024

If we want to overcome the challenges of such transformations in sustainable ways, we need to look for solutions from multidimensional perspectives. The role of knowledge graphs in AECO transformation At present, knowledge graphs are the best-known technology capable of offering decentralized ways of going beyond existing data silos.

Modeling

Modeling Structured Data Technology Data Transformation

What is business analytics? Using data to improve business outcomes

CIO Business Intelligence

JULY 5, 2022

What is the difference between business analytics and data analytics? Business analytics is a subset of data analytics. Data analytics is used across disciplines to find trends and solve problems using data mining , data cleansing, data transformation, data modeling, and more.

Business Analytics

Business Analytics Prescriptive Analytics Data mining Diagnostic Analytics

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

AWS Big Data

NOVEMBER 9, 2023

In this post, we delve into a case study for a retail use case, exploring how the Data Build Tool (dbt) was used effectively within an AWS environment to build a high-performing, efficient, and modern data platform. It does this by helping teams handle the T in ETL (extract, transform, and load) processes.

Data Warehouse

Data Warehouse Testing Data Quality Reporting

Migrate from Apache Solr to OpenSearch

AWS Big Data

JULY 18, 2024

For the updateRequestProcessorChain , OpenSearch provides the ingest pipeline , allowing the enrichment or transformation of data before indexing. Multiple processor stages can be chained to form a pipeline for data transformation. However, it’s strongly recommended to do the data transformation outside OpenSearch.

Dashboards

Dashboards Testing Data-driven Visualization

Turning the page

Cloudera

JUNE 1, 2021

These acquisitions usher in a new era of “ self-service ” by automating complex operations so customers can focus on building great data-driven apps instead of managing infrastructure. Datacoral powers fast and easy data transformations for any type of data via a robust multi-tenant SaaS architecture that runs in AWS.

Uncertainty

Uncertainty Cost-Benefit Risk Strategy

AzureML and CRISP-DM – a Framework to help the Business Intelligence professional move to AI

Jen Stirrup

SEPTEMBER 30, 2021

Before we dive in, let’s define strands of AI, Machine Learning and Data Science: Business intelligence (BI) leverages software and services to transform data into actionable insights that inform an organization’s strategic and tactical business decisions.

Business Intelligence

Business Intelligence Data mining Machine Learning Testing

What is data analytics? Analyzing and managing data for decisions

CIO Business Intelligence

JUNE 7, 2022

Data analytics draws from a range of disciplines — including computer programming, mathematics, and statistics — to perform analysis on data in an effort to describe, predict, and improve performance. What are the four types of data analytics? In business analytics, this is the purview of business intelligence (BI).

Data Analytics

Data Analytics Diagnostic Analytics Management Analytics

Power enterprise-grade Data Vaults with Amazon Redshift – Part 1

AWS Big Data

NOVEMBER 16, 2023

Data Vault 2.0 allows for the following: Agile data warehouse development Parallel data ingestion A scalable approach to handle multiple data sources even on the same entity A high level of automation Historization Full lineage support However, Data Vault 2.0

Enterprise

Enterprise Data Warehouse Data Lake Optimization

Enriching metadata for accurate text-to-SQL generation for Amazon Athena

AWS Big Data

OCTOBER 14, 2024

Adding data transformation details to metadata can be challenging because of the dispersed nature of this information across data processing pipelines, making it difficult to extract and incorporate into table-level metadata. Maintaining lists of possible values for the columns requires continuous updates. Choose Stacks.

Metadata

Metadata Data Lake Modeling Data Warehouse

BMW Cloud Efficiency Analytics powered by Amazon QuickSight and Amazon Athena

AWS Big Data

NOVEMBER 15, 2023

The difference lies in when and where data transformation takes place. In ETL, data is transformed before it’s loaded into the data warehouse. In ELT, raw data is loaded into the data warehouse first, then it’s transformed directly within the warehouse.

Analytics

Analytics Dashboards Metadata Data Warehouse

Monitor data pipelines in a serverless data lake

AWS Big Data

AUGUST 9, 2023

The advent of rapid adoption of serverless data lake architectures—with ever-growing datasets that need to be ingested from a variety of sources, followed by complex data transformation and machine learning (ML) pipelines—can present a challenge.

Data Lake

Data Lake Metrics Testing Cost-Benefit

What is a DataOps Engineer?

DataKitchen

OCTOBER 5, 2021

The data organization wants to run the Value Pipeline as robustly as a six sigma factory, and it must be able to implement and deploy process improvements as rapidly as a Silicon Valley start-up. The data engineer builds data transformations. Their product is the data.

Testing

Testing Dashboards Measurement Experimentation

DataOps Observability: Taming the Chaos (part 1)

DataKitchen

OCTOBER 5, 2022

Jason is the CDO for Company X, responsible for six teams of data professionals across several locations. The breadth of his teams’ work and the technologies they use present a significant challenge to his main goal: to deliver new and useful analytics solutions to the business. to monitor your data operations.

Testing

Testing Risk Data Processing Statistics

Migrate Amazon Redshift from DC2 to RA3 to accommodate increasing data volumes and analytics demands

AWS Big Data

AUGUST 9, 2024

After the read query validation stage was complete and we were satisfied with the performance, we reconnected our orchestrator so that the data transformation queries could be run in the new cluster. At this point, only one-time queries and those made by Amazon QuickSight reached the new cluster.

Data Lake

Data Lake Analytics Data Warehouse Data-driven

Automate discovery of data relationships using ML and Amazon Neptune graph technology

AWS Big Data

APRIL 19, 2023

The data science algorithm Valentine is an effective tool for this. Valentine is presented in the paper Valentine: Evaluating Matching Techniques for Dataset Discovery (2021, Koutras et al.). Joel has led data transformation projects on fraud analytics, claims automation, and data governance.

Technology

Technology Data-driven Machine Learning Sales

12 data science certifications that will pay off

CIO Business Intelligence

JANUARY 19, 2024

Whether you’re looking to earn a certification from an accredited university, gain experience as a new grad, hone vendor-specific skills, or demonstrate your knowledge of data analytics, the following certifications (presented in alphabetical order) will work for you. Not finding what you’re looking for?

Data Science

Data Science Machine Learning Predictive Modeling Forecasting

Self-Service Data’s New Frontier: The Data Catalog

Alation

FEBRUARY 20, 2020

You can check out a self service data prep flow from catalog to viz in this recorded version here. Rita Sallam Introduces the Data Prep Rodeo. Which brings me to memorable moment #2 on Tuesday when Rita Sallam took the stage for her drill-down presentation on Self-Service Data Prep. To Coming Home, Home on the Range.

Scorecard

Scorecard ROI Data-driven Visualization

Measuring Maturity

Peter James Thomas

MARCH 9, 2020

Typically I break each of the areas appearing in boxes above into sub-areas, score the organisation against these, roll the results back up and present them back to the client with accompanying commentary; normally also including some sort of benchmark for comparison [2].

Measurement

Measurement Data Strategy Strategy Modeling

Perform upserts in a data lake using Amazon Athena and Apache Iceberg

AWS Big Data

APRIL 27, 2023

With these features, you can now build data pipelines completely in standard SQL that are serverless, more simple to build, and able to operate at scale. Typically, data transformation processes are used to perform this operation, and a final consistent view is stored in an S3 bucket or folder.

Data Lake

Data Lake Snapshot Optimization Data Transformation

Use fuzzy string matching to approximate duplicate records in Amazon Redshift

AWS Big Data

FEBRUARY 8, 2023

This post presents one possible approach to addressing this challenge in an Amazon Redshift data warehouse. Stored procedures Stored procedures are commonly used to encapsulate logic for data transformation, data validation, and business-specific logic. all URIDs that have are present).

Data Quality

Data Quality Testing Data Warehouse Unstructured Data

Orchestrate Amazon EMR Serverless jobs with AWS Step functions

AWS Big Data

OCTOBER 12, 2023

Prerequisites Before you get started, make sure you have the following prerequisites: An AWS account An IAM user with administrator access An S3 bucket Solution Architecture To automate the complete process, we use the following architecture, which integrates Step Functions for orchestration and Amazon EMR Serverless for data transformations.

Big Data

Big Data Data-driven Management Visualization

Extract time series from satellite weather data with AWS Lambda

AWS Big Data

JULY 6, 2023

It has not been specifically designed for heavy data transformation tasks. The proposed architecture enables extraction and repartitioning of the data in just a few minutes. It’s scalable and cost-effective, and can be adapted to other ETL and data processing use cases.

Machine Learning

Machine Learning Visualization IoT Digital Transformation

IKEA’s Data Transformation: Lessons from a Global Giant

Intro to R and Power BI Presentation and a Presenting Secret

Webinars

Trending Sources

Texas Rangers data transformation modernizes stadium operations

Webinars

Data transformation takes flight at Atlanta’s Hartsfield-Jackson airport

SAP Datasphere Powers Business at the Speed of Data

It’s Essential — Verifying Data Transformations (Part 4)

Complex Data Transformations — Test Planning Best Practices

Key Challenges Affecting Data Transformations—Dev and Testing

Bridging the gap between mainframe data and hybrid cloud environments

Introducing simplified interaction with the Airflow REST API in Amazon MWAA

Accelerate your data workflows with Amazon Redshift Data API persistent sessions

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

10 Examples of How Big Data in Logistics Can Transform The Supply Chain

MLOps and DevOps: Why Data Makes It Different

Introducing Self-Service, No-Code Airflow Authoring UI in Cloudera Data Engineering

Fresca Group Selects insightsoftware To Transform Its Financial Consolidation and Disclosure Management Processes

Introducing blueprint discovery and other UI enhancements for Amazon OpenSearch Ingestion

The Best Data Management Tools For Small Businesses

How the BMW Group analyses semiconductor demand with AWS Glue

DataOps Observability: Taming the Chaos (part 1)

Straumann Group is transforming dentistry with data, AI

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

Semantization of Regulatory Documents in AECO

What is business analytics? Using data to improve business outcomes

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

Migrate from Apache Solr to OpenSearch

Turning the page

AzureML and CRISP-DM – a Framework to help the Business Intelligence professional move to AI

What is data analytics? Analyzing and managing data for decisions

Power enterprise-grade Data Vaults with Amazon Redshift – Part 1

Enriching metadata for accurate text-to-SQL generation for Amazon Athena

BMW Cloud Efficiency Analytics powered by Amazon QuickSight and Amazon Athena

Monitor data pipelines in a serverless data lake

What is a DataOps Engineer?

DataOps Observability: Taming the Chaos (part 1)

Migrate Amazon Redshift from DC2 to RA3 to accommodate increasing data volumes and analytics demands

Automate discovery of data relationships using ML and Amazon Neptune graph technology

12 data science certifications that will pay off

Self-Service Data’s New Frontier: The Data Catalog

Measuring Maturity

Perform upserts in a data lake using Amazon Athena and Apache Iceberg

­­Use fuzzy string matching to approximate duplicate records in Amazon Redshift

Orchestrate Amazon EMR Serverless jobs with AWS Step functions

Extract time series from satellite weather data with AWS Lambda

Stay Connected

Use fuzzy string matching to approximate duplicate records in Amazon Redshift