Data Transformation and Data-driven

From Blob Storage to SQL Database Using Azure Data Factory

Analytics Vidhya

APRIL 29, 2022

This article was published as a part of the Data Science Blogathon. Introduction Azure data factory (ADF) is a cloud-based ETL (Extract, Transform, Load) tool and data integration service which allows you to create a data-driven workflow. In this article, I’ll show […].

Data-driven

Data-driven Data Science Data Transformation Data Integration

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

1) What Is Data Quality Management? 4) Data Quality Best Practices. 5) How Do You Measure Data Quality? 6) Data Quality Metrics Examples. 7) Data Quality Control: Use Case. 8) The Consequences Of Bad Data Quality. 9) 3 Sources Of Low-Quality Data. 10) Data Quality Solutions: Key Attributes.

Data Quality

Data Quality Metrics Data-driven Management

Transforming Your Data Pipeline with dbt(data build tool)

Analytics Vidhya

JUNE 14, 2024

Introduction Have you ever struggled with managing complex data transformations? In today’s data-driven world, extracting, transforming, and loading (ETL) data is crucial for gaining valuable insights. While many ETL tools exist, dbt (data build tool) is emerging as a game-changer.

Data-driven

Data-driven Data Transformation Management Analytics

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Most Frequently Asked Azure Data Factory Interview Questions

Analytics Vidhya

FEBRUARY 20, 2023

Introduction Azure data factory (ADF) is a cloud-based data ingestion and ETL (Extract, Transform, Load) tool. The data-driven workflow in ADF orchestrates and automates data movement and data transformation.

Data-driven

Data-driven Data Transformation Analytics Data Warehouse

What companies get wrong about data transformation

CIO Business Intelligence

JUNE 14, 2022

For years, IT and data leaders have been striving to help their companies become more data driven. But technology investment alone is not enough to make your organization data driven. A lot of organizations have tried to treat data as a project,” says Traci Gusher, EY Americas data and analytics leader. “It

Data Transformation

Data Transformation Data-driven Data Strategy Strategy

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

AWS Big Data

NOVEMBER 22, 2024

At AWS, we are committed to empowering organizations with tools that streamline data analytics and transformation processes. This integration enables data teams to efficiently transform and manage data using Athena with dbt Cloud’s robust features, enhancing the overall data workflow experience.

Data Lake

Data Lake Data Warehouse Cost-Benefit Data Transformation

Data transformation takes flight at Atlanta’s Hartsfield-Jackson airport

CIO Business Intelligence

AUGUST 9, 2024

At Atlanta’s Hartsfield-Jackson International Airport, an IT pilot has led to a wholesale data journey destined to transform operations at the world’s busiest airport, fueled by machine learning and generative AI. Data integrity presented a major challenge for the team, as there were many instances of duplicate data.

Data Transformation

Data Transformation Machine Learning Data Lake Dashboards

Introducing simplified interaction with the Airflow REST API in Amazon MWAA

AWS Big Data

OCTOBER 23, 2024

The Airflow REST API facilitates a wide range of use cases, from centralizing and automating administrative tasks to building event-driven, data-aware data pipelines. Event-driven architectures – The enhanced API facilitates seamless integration with external events, enabling the triggering of Airflow DAGs based on these events.

Interactive

Interactive Testing Data-driven Data Lake

Is Big Data Transforming Our Broken Hospital Management Systems?

Smart Data Collective

JULY 25, 2019

The healthcare industry is happily embracing big data. Hospitals around the world are finding that data can have a profound impact on their operations. A lot of the emphasis so far has been on the use of big data to better engage with external third-parties, but big data can be equally valuable for managing internal hospital systems.

Big Data

Big Data Data Transformation Management Software

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

Data is the foundation of innovation, agility and competitive advantage in todays digital economy. As technology and business leaders, your strategic initiatives, from AI-powered decision-making to predictive insights and personalized experiences, are all fueled by data. Data quality is no longer a back-office concern.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

SAP Datasphere Powers Business at the Speed of Data

Rocket-Powered Data Science

MARCH 20, 2023

We live in a data-rich, insights-rich, and content-rich world. Data collections are the ones and zeroes that encode the actionable insights (patterns, trends, relationships) that we seek to extract from our data through machine learning and data science. Plus, AI can also help find key insights encoded in data.

Data Warehouse

Data Warehouse Metadata Digital Transformation Machine Learning

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

For container terminal operators, data-driven decision-making and efficient data sharing are vital to optimizing operations and boosting supply chain efficiency. Together, these capabilities enable terminal operators to enhance efficiency and competitiveness in an industry that is increasingly data driven.

IoT

IoT Machine Learning Metadata Data-driven

Streamline AI-driven analytics with governance: Integrating Tableau with Amazon DataZone

AWS Big Data

OCTOBER 30, 2024

Amazon DataZone is a data management service that makes it faster and easier for customers to catalog, discover, share, and govern data stored across AWS, on premises, and from third-party sources. Using Amazon DataZone lets us avoid building and maintaining an in-house platform, allowing our developers to focus on tailored solutions.

Analytics

Analytics Visualization Data Governance Data-driven

The Journey to DataOps Success: Key Takeaways from Transformation Trailblazers

DataKitchen

APRIL 26, 2021

In early April 2021, DataKItchen sat down with Jonathan Hodges, VP Data Management & Analytics, at Workiva ; Chuck Smith, VP of R&D Data Strategy at GlaxoSmithKline (GSK) ; and Chris Bergh, CEO and Head Chef at DataKitchen, to find out about their enterprise DataOps transformation journey, including key successes and lessons learned.

Measurement

Measurement Metrics Data-driven Dashboards

Ensuring Data Transformation Quality with dbt Core

Wayne Yaddow

MARCH 14, 2025

How dbt Core aids data teams test, validate, and monitor complex data transformations and conversions Photo by NASA on Unsplash Introduction dbt Core, an open-source framework for developing, testing, and documenting SQL-based data transformations, has become a must-have tool for modern data teams as the complexity of data pipelines grows.

Data Transformation

Data Transformation Testing Unstructured Data Data Quality

Available Now! Automated Testing for Data Transformations

Wayne Yaddow

FEBRUARY 18, 2025

Selecting the strategies and tools for validating data transformations and data conversions in your data pipelines. Introduction Data transformations and data conversions are crucial to ensure that raw data is organized, processed, and ready for useful analysis.

Testing

Testing Data Transformation Data-driven Data Quality

Data Engineers Are Using AI to Verify Data Transformations

Wayne Yaddow

FEBRUARY 26, 2025

AI is transforming how senior data engineers and data scientists validate data transformations and conversions. Artificial intelligence-based verification approaches aid in the detection of anomalies, the enforcement of data integrity, and the optimization of pipelines for improved efficiency.

Data Transformation

Data Transformation Testing Data-driven Data Quality

MLOps and DevOps: Why Data Makes It Different

O'Reilly on Data

OCTOBER 19, 2021

As with many burgeoning fields and disciplines, we don’t yet have a shared canonical infrastructure stack or best practices for developing and deploying data-intensive applications. Why: Data Makes It Different. Not only is data larger, but models—deep learning models in particular—are much larger than before.

IT

IT Testing Experimentation Software

Key Challenges Affecting Data Transformations—Dev and Testing

Wayne Yaddow

FEBRUARY 6, 2025

Common challenges and practical mitigation strategies for reliable data transformations. Photo by Mika Baumeister on Unsplash Introduction Data transformations are important processes in data engineering, enabling organizations to structure, enrich, and integrate data for analytics , reporting, and operational decision-making.

Testing

Testing Data Transformation Data-driven Manufacturing

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

AWS Big Data

DECEMBER 4, 2024

In today’s rapidly evolving financial landscape, data is the bedrock of innovation, enhancing customer and employee experiences and securing a competitive edge. Like many large financial institutions, ANZ Institutional Division operated with siloed data practices and centralized data management teams.

Metadata

Metadata Data Governance Data Quality Data-driven

How to Streamline Report Management to Enhance Your Data-Driven Business

Sisense

OCTOBER 30, 2019

A critical part of effectively exploring your data, transforming it into actionable insights, and enhancing decision-making for your business is being empowered to slice and dice your data, and be less dependent on technical resources for new updates. Improved visibility into insights will enable you to get more out of them.

Data-driven

Data-driven Reporting Management Slice and Dice

Improve Business Agility by Hiring a DataOps Engineer

DataKitchen

DECEMBER 20, 2020

Data-driven companies sense change through data analytics. Companies turn to their data organization to provide the analytics that stimulates creative problem-solving. The speed at which the data team responds to these requests is critical. The agility of analytics directly relates to data analytics workflows.

Data-driven

Data-driven Manufacturing Data Architecture Data Analytics

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

AWS Big Data

OCTOBER 30, 2024

Amazon DataZone now launched authentication supports through the Amazon Athena JDBC driver, allowing data users to seamlessly query their subscribed data lake assets via popular business intelligence (BI) and analytics tools like Tableau, Power BI, Excel, SQL Workbench, DBeaver, and more.

Visualization

Visualization Data Lake Testing Data Governance

10 Examples of How Big Data in Logistics Can Transform The Supply Chain

datapine

MAY 2, 2023

Table of Contents 1) Benefits Of Big Data In Logistics 2) 10 Big Data In Logistics Use Cases Big data is revolutionizing many fields of business, and logistics analytics is no exception. The complex and ever-evolving nature of logistics makes it an essential use case for big data applications. Did you know?

Big Data

Big Data Internet of Things Cost-Benefit Optimization

Accelerate your data workflows with Amazon Redshift Data API persistent sessions

AWS Big Data

NOVEMBER 22, 2024

Amazon Redshift is a fast, scalable, secure, and fully managed cloud data warehouse that you can use to analyze your data at scale. Redshift Data API provides a secure HTTP endpoint and integration with AWS SDKs. Calls to the Data API are asynchronous.

Data Warehouse

Data Warehouse Recreation/Entertainment Cost-Benefit Data-driven

Data Mesh 101: How Data Mesh Helps Organizations Be Data-Driven and Achieve Velocity

Ontotext

FEBRUARY 12, 2024

In the final part of this three-part series, we’ll explore ho w data mesh bolsters performance and helps organizations and data teams work more effectively. Usually, organizations will combine different domain topologies, depending on the trade-offs, and choose to focus on specific aspects of data mesh.

Data-driven

Data-driven Data Lake Data Quality Business Objectives

Drive Growth with Data-Driven Strategies: Introducing Zenia Graph’s Salesforce Accelerator

Ontotext

MARCH 20, 2024

In today’s data-driven world, businesses are drowning in a sea of information. Traditional data integration methods struggle to bridge these gaps, hampered by high costs, data quality concerns, and inconsistencies. Unleashing the Power of Data Connections Zenia Graph isn’t just another data solution company.

Data-driven

Data-driven Strategy Sales Data Integration

How Your Finance Team Can Lead Your Enterprise Data Transformation

Alation

OCTOBER 26, 2021

Today’s best-performing organizations embrace data for strategic decision-making. Because of the criticality of the data they deal with, we think that finance teams should lead the enterprise adoption of data and analytics solutions. This is because accurate data is “table stakes” for finance teams.

Finance

Finance Data Transformation Enterprise Metrics

Eyes on Data: Transforming Data Challenges into Real Progress

TDAN

FEBRUARY 21, 2024

In a world increasingly dominated by data, organizations are grappling with the need to effectively manage and harness this valuable asset. At the same time, the data management […]

Data Transformation

Data Transformation Digital Transformation Data-driven Management

Unlocking near real-time analytics with petabytes of transaction data using Amazon Aurora Zero-ETL integration with Amazon Redshift and dbt Cloud

AWS Big Data

NOVEMBER 27, 2024

While customers can perform some basic analysis within their operational or transactional databases, many still need to build custom data pipelines that use batch or streaming jobs to extract, transform, and load (ETL) data into their data warehouse for more comprehensive analysis. or a later version) database.

Data Warehouse

Data Warehouse Analytics Testing Modeling

An AI Chat Bot Wrote This Blog Post …

DataKitchen

DECEMBER 9, 2022

ChatGPT> DataOps, or data operations, is a set of practices and technologies that organizations use to improve the speed, quality, and reliability of their data analytics processes. The goal of DataOps is to help organizations make better use of their data to drive business decisions and improve outcomes.

Machine Learning

Machine Learning Data-driven Optimization Data Analytics

The Ten Standard Tools To Develop Data Pipelines In Microsoft Azure

DataKitchen

JULY 27, 2023

The Ten Standard Tools To Develop Data Pipelines In Microsoft Azure. While working in Azure with our customers, we have noticed several standard Azure tools people use to develop data pipelines and ETL or ELT processes. We counted ten ‘standard’ ways to transform and set up batch data pipelines in Microsoft Azure.

Machine Learning

Machine Learning Cost-Benefit Data Transformation Testing

At AstraZeneca, data and AI are more than game changers – they are life changers

CIO Business Intelligence

OCTOBER 11, 2022

To accelerate growth through innovation, the company is expanding its use of data science and artificial intelligence (AI) across the business to improve patient outcomes. . This initiative alone has generated an explosion in the quantity and complexity of data the company collects, stores, and analyzes for insights. . “We

Machine Learning

Machine Learning Data Science Data-driven Testing

Introducing a new unified data connection experience with Amazon SageMaker Lakehouse unified data connectivity

AWS Big Data

DECEMBER 16, 2024

The need to integrate diverse data sources has grown exponentially, but there are several common challenges when integrating and analyzing data from multiple sources, services, and applications. First, you need to create and maintain independent connections to the same data source for different services.

Visualization

Visualization Data Processing Testing Publishing

What is Data Lineage? Top 5 Benefits of Data Lineage

erwin

APRIL 29, 2020

Data lineage is the journey data takes from its creation through its transformations over time. Tracing the source of data is an arduous task. With all these diverse data sources, and if systems are integrated, it is difficult to understand the complicated data web they form much less get a simple visual flow.

Metadata

Metadata Key Performance Indicator Data Governance Data Quality

The What & Why of Data Governance

erwin

MARCH 4, 2021

Modern data governance is a strategic, ongoing and collaborative practice that enables organizations to discover and track their data, understand what it means within a business context, and maximize its security, quality and value. The What: Data Governance Defined. Data governance has no standard definition.

Data Governance

Data Governance Digital Transformation Data-driven Cost-Benefit

From the Ground Up: The Truth About Data Innovation

Cloudera

APRIL 19, 2022

Data holds incredible untapped potential for Australian organisations across industries, regardless of individual business goals, and all organisations are at different points in their data transformation journey with some achieving success faster than others. . More importantly, effective data strategies don’t stand still.

Data-driven

Data-driven Data Strategy Big Data Strategy

Breaking down data silos for digital success

CIO Business Intelligence

NOVEMBER 7, 2023

For years, IT and business leaders have been talking about breaking down the data silos that exist within their organizations. Given the importance of sharing information among diverse disciplines in the era of digital transformation, this concept is arguably as important as ever. What are the challenges and potential rewards?

Data Warehouse

Data Warehouse Digital Transformation Data-driven Reporting

Making OT-IT integration a reality with new data architectures and generative AI

CIO Business Intelligence

FEBRUARY 20, 2024

Manufacturers have long held a data-driven vision for the future of their industry. It’s one where near real-time data flows seamlessly between IT and operational technology (OT) systems. Legacy data management is holding back manufacturing transformation Until now, however, this vision has remained out of reach.

Data Architecture

Data Architecture Unstructured Data Manufacturing IT

Talk Data to Me: Why Employee Data Literacy Matters

erwin

MARCH 26, 2020

Organizations are flooded with data, so they’re scrambling to find ways to derive meaningful insights from it – and then act on them to improve the bottom line. In today’s data-driven business, enabling employees to access and understand the data that’s relevant to their roles allows them to use data and put those insights into action.

Data-driven

Data-driven Unstructured Data Enterprise Machine Learning

Top 6 Benefits of Automating End-to-End Data Lineage

erwin

SEPTEMBER 17, 2020

Replace manual and recurring tasks for fast, reliable data lineage and overall data governance. It’s paramount that organizations understand the benefits of automating end-to-end data lineage. The importance of end-to-end data lineage is widely understood and ignoring it is risky business. Doing Data Lineage Right.

Cost-Benefit

Cost-Benefit Data Governance Metadata Reporting

The Best Data Management Tools For Small Businesses

Smart Data Collective

APRIL 29, 2020

As the world is gradually becoming more dependent on data, the services, tools and infrastructure are all the more important for businesses in every sector. Data management has become a fundamental business concern, and especially for businesses that are going through a digital transformation. What is data management?

Management

Management Data Warehouse Digital Transformation Dashboards

How the BMW Group analyses semiconductor demand with AWS Glue

AWS Big Data

APRIL 26, 2023

Additionally, this forecasting system needs to provide data enrichment steps including byproducts, serve as the master data around the semiconductor management, and enable further use cases at the BMW Group. To enable this use case, we used the BMW Group’s cloud-native data platform called the Cloud Data Hub.

Forecasting

Forecasting Manufacturing Data Lake Big Data

From Blob Storage to SQL Database Using Azure Data Factory

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

Webinars

Trending Sources

Transforming Your Data Pipeline with dbt(data build tool)

Webinars

Most Frequently Asked Azure Data Factory Interview Questions

What companies get wrong about data transformation

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

Data transformation takes flight at Atlanta’s Hartsfield-Jackson airport

Introducing simplified interaction with the Airflow REST API in Amazon MWAA

Is Big Data Transforming Our Broken Hospital Management Systems?

Data’s dark secret: Why poor quality cripples AI and growth

SAP Datasphere Powers Business at the Speed of Data

Top 10 Data Pipeline Interview Questions to Read in 2023

How EUROGATE established a data mesh architecture using Amazon DataZone

Streamline AI-driven analytics with governance: Integrating Tableau with Amazon DataZone

The Journey to DataOps Success: Key Takeaways from Transformation Trailblazers

Ensuring Data Transformation Quality with dbt Core

Available Now! Automated Testing for Data Transformations

Data Engineers Are Using AI to Verify Data Transformations

MLOps and DevOps: Why Data Makes It Different

Key Challenges Affecting Data Transformations—Dev and Testing

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

How to Streamline Report Management to Enhance Your Data-Driven Business

Improve Business Agility by Hiring a DataOps Engineer

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

10 Examples of How Big Data in Logistics Can Transform The Supply Chain

Accelerate your data workflows with Amazon Redshift Data API persistent sessions

Data Mesh 101: How Data Mesh Helps Organizations Be Data-Driven and Achieve Velocity

Drive Growth with Data-Driven Strategies: Introducing Zenia Graph’s Salesforce Accelerator

How Your Finance Team Can Lead Your Enterprise Data Transformation

Eyes on Data: Transforming Data Challenges into Real Progress

Unlocking near real-time analytics with petabytes of transaction data using Amazon Aurora Zero-ETL integration with Amazon Redshift and dbt Cloud

An AI Chat Bot Wrote This Blog Post …

The Ten Standard Tools To Develop Data Pipelines In Microsoft Azure

At AstraZeneca, data and AI are more than game changers – they are life changers

Introducing a new unified data connection experience with Amazon SageMaker Lakehouse unified data connectivity

What is Data Lineage? Top 5 Benefits of Data Lineage

The What & Why of Data Governance

From the Ground Up: The Truth About Data Innovation

Breaking down data silos for digital success

Making OT-IT integration a reality with new data architectures and generative AI

Talk Data to Me: Why Employee Data Literacy Matters

Top 6 Benefits of Automating End-to-End Data Lineage

The Best Data Management Tools For Small Businesses

How the BMW Group analyses semiconductor demand with AWS Glue

Stay Connected