Data Quality, Data Transformation and Visualization

Data Quality

Data Transformation

Visualization

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

1) What Is Data Quality Management? 4) Data Quality Best Practices. 5) How Do You Measure Data Quality? 6) Data Quality Metrics Examples. 7) Data Quality Control: Use Case. 8) The Consequences Of Bad Data Quality. 9) 3 Sources Of Low-Quality Data.

Data Quality

Data Quality Metrics Data-driven Management

Biggest Trends in Data Visualization Taking Shape in 2022

Smart Data Collective

OCTOBER 13, 2021

There are countless examples of big data transforming many different industries. It can be used for something as visual as reducing traffic jams, to personalizing products and services, to improving the experience in multiplayer video games. We would like to talk about data visualization and its role in the big data movement.

Visualization

Visualization Cost-Benefit Big Data Prescriptive Analytics

Join 42,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Trending Sources

Set up alerts and orchestrate data quality rules with AWS Glue Data Quality

AWS Big Data

JUNE 6, 2023

Alerts and notifications play a crucial role in maintaining data quality because they facilitate prompt and efficient responses to any data quality issues that may arise within a dataset. This proactive approach helps mitigate the risk of making decisions based on inaccurate information.

Data Quality

Data Quality Metrics Data-driven Visualization

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

SAP Datasphere Powers Business at the Speed of Data

Rocket-Powered Data Science

MARCH 20, 2023

Content includes reports, documents, articles, presentations, visualizations, video, and audio representations of the insights and knowledge that have been extracted from data. We could further refine our opening statement to say that our business users are too often in a state of being data-rich, but insights-poor, and content-hungry.

Data Warehouse

Data Warehouse Metadata Digital Transformation Machine Learning

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

In addition to real-time analytics and visualization, the data needs to be shared for long-term data analytics and machine learning applications. The data science and AI teams are able to explore and use new data sources as they become available through Amazon DataZone.

IoT

IoT Machine Learning Metadata Data-driven

Use AWS Glue DataBrew recipes in your AWS Glue Studio visual ETL jobs

AWS Big Data

JULY 27, 2023

AWS Glue Studio is a graphical interface that makes it easy to create, run, and monitor extract, transform, and load (ETL) jobs in AWS Glue. DataBrew is a visual data preparation tool that enables you to clean and normalize data without writing any code.

Visualization

Visualization Cost-Benefit Data Quality Publishing

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

AWS Big Data

DECEMBER 4, 2024

Domain ownership recognizes that the teams generating the data have the deepest understanding of it and are therefore best suited to manage, govern, and share it effectively. This principle makes sure data accountability remains close to the source, fostering higher data quality and relevance.

Metadata

Metadata Data Governance Data Quality Data-driven

From Disparate Data to Visualized Knowledge Part I: Moving from Spreadsheets to an RDF Database

Ontotext

NOVEMBER 18, 2021

And all of them are asking hard questions: “Can you integrate my data, with my particular format?”, “How well can you scale?”, “How many visualizations do you offer?”. Nowadays, data analytics doesn’t exist on its own. You have to take care of data extraction, transformation and loading, and of visualization.

Visualization

Visualization Reporting Metadata Enterprise

Modernize your ETL platform with AWS Glue Studio: A case study from BMS

AWS Big Data

DECEMBER 13, 2023

In addition to using native managed AWS services that BMS didn’t need to worry about upgrading, BMS was looking to offer an ETL service to non-technical business users that could visually compose data transformation workflows and seamlessly run them on the AWS Glue Apache Spark-based serverless data integration engine.

Metadata

Metadata Data Lake Visualization Data Quality

Available Now! Automated Testing for Data Transformations

Wayne Yaddow

FEBRUARY 18, 2025

Selecting the strategies and tools for validating data transformations and data conversions in your data pipelines. Introduction Data transformations and data conversions are crucial to ensure that raw data is organized, processed, and ready for useful analysis.

Testing

Testing Data Transformation Data-driven Data Quality

Data Engineers Are Using AI to Verify Data Transformations

Wayne Yaddow

FEBRUARY 26, 2025

AI is transforming how senior data engineers and data scientists validate data transformations and conversions. Artificial intelligence-based verification approaches aid in the detection of anomalies, the enforcement of data integrity, and the optimization of pipelines for improved efficiency.

Data Transformation

Data Transformation Testing Data-driven Data Quality

What is Data Lineage? Top 5 Benefits of Data Lineage

erwin

APRIL 29, 2020

These tools range from enterprise service bus (ESB) products, data integration tools; extract, transform and load (ETL) tools, procedural code, application program interfaces (API)s, file transfer protocol (FTP) processes, and even business intelligence (BI) reports that further aggregate and transform data. Data Quality.

Metadata

Metadata Key Performance Indicator Data Governance Data Quality

From Raw Inputs to Polished Outputs: The Art of Testing Data Transformations

Wayne Yaddow

MARCH 5, 2025

In this post, well see the fundamental procedures, tools, and techniques that data engineers, data scientists, and QA/testing teams use to ensure high-quality data as soon as its deployed. First, we look at how unit and integration tests uncover transformation errors at an early stage. Statistical tests (e.g.,

Testing

Testing Data Transformation Statistics Metadata

Accelerate analytics on Amazon OpenSearch Service with AWS Glue through its native connector

AWS Big Data

DECEMBER 21, 2023

AWS Glue is a serverless data integration service that makes it straightforward to discover, prepare, and combine data for analytics, machine learning (ML), and application development. AWS Glue provides both visual and code-based interfaces to make data integration effortless. Choose Create job and Visual ETL.

Analytics

Analytics IT Data Lake Visualization

Harnessing Streaming Data: Insights at the Speed of Life

Sisense

OCTOBER 15, 2020

Every data professional knows that ensuring data quality is vital to producing usable query results. Streaming data can be extra challenging in this regard, as it tends to be “dirty,” with new fields that are added without warning and frequent mistakes in the data collection process.

Dashboards

Dashboards IoT Optimization Internet of Things

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

AWS Big Data

AUGUST 1, 2024

However, you might face significant challenges when planning for a large-scale data warehouse migration. Data engineers are crucial for schema conversion and data transformation, and DBAs can handle cluster configuration and workload monitoring. Platform architects define a well-architected platform.

Data Warehouse

Data Warehouse KPI Optimization Cost-Benefit

Use fuzzy string matching to approximate duplicate records in Amazon Redshift

AWS Big Data

FEBRUARY 8, 2023

It’s common to ingest multiple data sources into Amazon Redshift to perform analytics. Often, each data source will have its own processes of creating and maintaining data, which can lead to data quality challenges within and across sources. Answering questions as simple as “How many unique customers do we have?”

Data Quality

Data Quality Testing Data Warehouse Unstructured Data

5 best open source data flow lineage tools

Octopai

AUGUST 11, 2024

Just as a navigation app provides a detailed map of roads, guiding you from your starting point to your destination while highlighting every turn and intersection, data flow lineage offers a comprehensive view of data movement and transformations throughout its lifecycle. Predefined type for various data sources.

Metadata

Metadata Visualization Data Quality Data Governance

NEW: Octopai Announces Support of Microsoft Azure Data Factory

Octopai

JANUARY 19, 2021

With Octopai’s support and analysis of Azure Data Factory, enterprises can now view complete end-to-end data lineage from Azure Data Factory all the way through to reporting for the first time ever.

Metadata

Metadata ROI Machine Learning Data Quality

Showpad accelerates data maturity to unlock innovation using Amazon QuickSight

AWS Big Data

APRIL 5, 2023

In this post, we share how Showpad used QuickSight to streamline data and insights access across teams and customers. Showpad migrated over 70 dashboards with over 1,000 visuals. Each of these tools were getting data from a different place, and that’s where it gets difficult,” says Jeroen Minnaert, head of data at Showpad. “If

Dashboards

Dashboards Reporting Cost-Benefit Visualization

How Tricentis unlocks insights across the software development lifecycle at speed and scale using Amazon Redshift

AWS Big Data

MARCH 3, 2023

Every event in the data source can be relevant, and our customers don’t tolerate data loss, poor data quality, or discrepancies between the source and Tricentis Analytics. While aggregating, summarizing, and aligning to a common information model, all transformations must not affect the integrity of data from its source.

Software

Software Data Lake Testing Cost-Benefit

What is a Data Pipeline?

Jet Global

MAY 9, 2024

Data Extraction : The process of gathering data from disparate sources, each of which may have its own schema defining the structure and format of the data and making it available for processing. This can include tasks such as data ingestion, cleansing, filtering, aggregation, or standardization.

Data Lake

Data Lake Data Warehouse Business Intelligence Machine Learning

What is Data Mapping?

Jet Global

FEBRUARY 23, 2024

The quick and dirty definition of data mapping is the process of connecting different types of data from various data sources. Data mapping is a crucial step in data modeling and can help organizations achieve their business goals by enabling data integration, migration, transformation, and quality.

Data Warehouse

Data Warehouse Reporting Data Transformation Visualization

Save Time and Stress with Dynamics Data Merging from Atlas

Jet Global

MARCH 13, 2024

Complex Data Structures and Integration Processes Dynamics data structures are already complex – finance teams navigating Dynamics data frequently require IT department support to complete their routine reporting.

Reporting

Reporting Finance Data Quality Sales

A Stitch in Time: How Jet Analytics Boosts Microsoft Fabric Time-to-Value

Jet Global

MARCH 14, 2024

Jet’s interface lets you handle data administration easily, without advanced coding skills. You don’t need technical skills to manage complex data workflows in the Fabric environment. Data Lineage and Documentation Jet Analytics simplifies the process of documenting data assets and tracking data lineage in Fabric.

Analytics

Analytics Management Reporting Data Quality

How DeNA Co., Ltd. accelerated anonymized data quality tests up to 100 times faster using Amazon Redshift Serverless and dbt

AWS Big Data

DECEMBER 17, 2024

Prevent the inclusion of invalid values in categorical data and process data without any data loss. Conduct data quality tests on anonymized data in compliance with data policies Conduct data quality tests to quickly identify and address data quality issues, maintaining high-quality data at all times.

Data Quality

Data Quality Testing Metrics Optimization

Automating Data Warehouses in the Era of AI, Data Products and Data Lakehouses

BI-Survey

MARCH 6, 2025

For data management teams, achieving more with fewer resources has become a familiar challenge. While efficiency is a priority, data quality and security remain non-negotiable. Developing and maintaining data transformation pipelines are among the first tasks to be targeted for automation.

Data Warehouse

Data Warehouse Metadata Unstructured Data Data-driven

“You Complete Me,” said Data Lineage to DataOps Observability.

DataKitchen

JANUARY 23, 2023

It allows organizations to see how data is being used, where it is coming from, its quality, and how it is being transformed. DataOps Observability includes monitoring and testing the data pipeline, data quality, data testing, and alerting. Data lineage does not directly improve data quality.

Testing

Testing Data Governance Data Quality Data-driven

Data Leaders Brief

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

Biggest Trends in Data Visualization Taking Shape in 2022

Webinars

Trending Sources

Set up alerts and orchestrate data quality rules with AWS Glue Data Quality

Webinars

SAP Datasphere Powers Business at the Speed of Data

How EUROGATE established a data mesh architecture using Amazon DataZone

Use AWS Glue DataBrew recipes in your AWS Glue Studio visual ETL jobs

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

From Disparate Data to Visualized Knowledge Part I: Moving from Spreadsheets to an RDF Database

Modernize your ETL platform with AWS Glue Studio: A case study from BMS

Available Now! Automated Testing for Data Transformations

Data Engineers Are Using AI to Verify Data Transformations

What is Data Lineage? Top 5 Benefits of Data Lineage

From Raw Inputs to Polished Outputs: The Art of Testing Data Transformations

Accelerate analytics on Amazon OpenSearch Service with AWS Glue through its native connector

Harnessing Streaming Data: Insights at the Speed of Life

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

Use fuzzy string matching to approximate duplicate records in Amazon Redshift

5 best open source data flow lineage tools

NEW: Octopai Announces Support of Microsoft Azure Data Factory

Showpad accelerates data maturity to unlock innovation using Amazon QuickSight

How Tricentis unlocks insights across the software development lifecycle at speed and scale using Amazon Redshift

What is a Data Pipeline?

What is Data Mapping?

Save Time and Stress with Dynamics Data Merging from Atlas

A Stitch in Time: How Jet Analytics Boosts Microsoft Fabric Time-to-Value

How DeNA Co., Ltd. accelerated anonymized data quality tests up to 100 times faster using Amazon Redshift Serverless and dbt

Automating Data Warehouses in the Era of AI, Data Products and Data Lakehouses

“You Complete Me,” said Data Lineage to DataOps Observability.

Stay Connected

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

Biggest Trends in Data Visualization Taking Shape in 2022

Webinars

Trending Sources

Set up alerts and orchestrate data quality rules with AWS Glue Data Quality

Webinars

SAP Datasphere Powers Business at the Speed of Data

How EUROGATE established a data mesh architecture using Amazon DataZone

Use AWS Glue DataBrew recipes in your AWS Glue Studio visual ETL jobs

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

From Disparate Data to Visualized Knowledge Part I: Moving from Spreadsheets to an RDF Database

Modernize your ETL platform with AWS Glue Studio: A case study from BMS

Available Now! Automated Testing for Data Transformations

Data Engineers Are Using AI to Verify Data Transformations

What is Data Lineage? Top 5 Benefits of Data Lineage

From Raw Inputs to Polished Outputs: The Art of Testing Data Transformations

Accelerate analytics on Amazon OpenSearch Service with AWS Glue through its native connector

Harnessing Streaming Data: Insights at the Speed of Life

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

­­Use fuzzy string matching to approximate duplicate records in Amazon Redshift

5 best open source data flow lineage tools

NEW: Octopai Announces Support of Microsoft Azure Data Factory

Showpad accelerates data maturity to unlock innovation using Amazon QuickSight

How Tricentis unlocks insights across the software development lifecycle at speed and scale using Amazon Redshift

What is a Data Pipeline?

What is Data Mapping?

Save Time and Stress with Dynamics Data Merging from Atlas

A Stitch in Time: How Jet Analytics Boosts Microsoft Fabric Time-to-Value

How DeNA Co., Ltd. accelerated anonymized data quality tests up to 100 times faster using Amazon Redshift Serverless and dbt

Automating Data Warehouses in the Era of AI, Data Products and Data Lakehouses

“You Complete Me,” said Data Lineage to DataOps Observability.

Stay Connected

Use fuzzy string matching to approximate duplicate records in Amazon Redshift