Data Quality, Metadata and Visualization

Announcing Open Source DataOps Data Quality TestGen 3.0

DataKitchen

FEBRUARY 20, 2025

Announcing DataOps Data Quality TestGen 3.0: Open-Source, Generative Data Quality Software. You don’t have to imagine — start using it today: [link] Introducing Data Quality Scoring in Open Source DataOps Data Quality TestGen 3.0! DataOps just got more intelligent.

Data Quality

Data Quality Scorecard Testing Dashboards

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

1) What Is Data Quality Management? 4) Data Quality Best Practices. 5) How Do You Measure Data Quality? 6) Data Quality Metrics Examples. 7) Data Quality Control: Use Case. 8) The Consequences Of Bad Data Quality. 9) 3 Sources Of Low-Quality Data.

Data Quality

Data Quality Metrics Data-driven Management

SAP Datasphere Powers Business at the Speed of Data

Rocket-Powered Data Science

MARCH 20, 2023

Content includes reports, documents, articles, presentations, visualizations, video, and audio representations of the insights and knowledge that have been extracted from data. We could further refine our opening statement to say that our business users are too often in a state of being data-rich, but insights-poor, and content-hungry.

Data Warehouse

Data Warehouse Metadata Digital Transformation Machine Learning

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Implement data quality checks on Amazon Redshift data assets and integrate with Amazon DataZone

AWS Big Data

AUGUST 15, 2024

Data quality is crucial in data pipelines because it directly impacts the validity of the business insights derived from the data. Today, many organizations use AWS Glue Data Quality to define and enforce data quality rules on their data at rest and in transit.

Data Quality

Data Quality Visualization Metadata Key Performance Indicator

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

AWS Big Data

JULY 29, 2024

It addresses many of the shortcomings of traditional data lakes by providing features such as ACID transactions, schema evolution, row-level updates and deletes, and time travel. In this blog post, we’ll discuss how the metadata layer of Apache Iceberg can be used to make data lakes more efficient.

Metadata

Metadata Snapshot Data Lake Metrics

Biggest Trends in Data Visualization Taking Shape in 2022

Smart Data Collective

OCTOBER 13, 2021

There are countless examples of big data transforming many different industries. It can be used for something as visual as reducing traffic jams, to personalizing products and services, to improving the experience in multiplayer video games. We would like to talk about data visualization and its role in the big data movement.

Visualization

Visualization Cost-Benefit Big Data Prescriptive Analytics

Amazon DataZone now integrates with AWS Glue Data Quality and external data quality solutions

AWS Big Data

APRIL 3, 2024

Today, we are pleased to announce that Amazon DataZone is now able to present data quality information for data assets. Other organizations monitor the quality of their data through third-party solutions. Additionally, Amazon DataZone now offers APIs for importing data quality scores from external systems.

Data Quality

Data Quality Visualization Metadata Metrics

Best Practices for Metadata Management

Alation

JULY 19, 2021

What Is Metadata? Metadata is information about data. A clothing catalog or dictionary are both examples of metadata repositories. Indeed, a popular online catalog, like Amazon, offers rich metadata around products to guide shoppers: ratings, reviews, and product details are all examples of metadata.

Metadata

Metadata Management Data Governance Machine Learning

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

AWS Big Data

DECEMBER 4, 2024

Domain ownership recognizes that the teams generating the data have the deepest understanding of it and are therefore best suited to manage, govern, and share it effectively. This principle makes sure data accountability remains close to the source, fostering higher data quality and relevance.

Metadata

Metadata Data Governance Data Quality Data-driven

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

In addition to real-time analytics and visualization, the data needs to be shared for long-term data analytics and machine learning applications. From here, the metadata is published to Amazon DataZone by using AWS Glue Data Catalog. This process is shown in the following figure.

IoT

IoT Machine Learning Metadata Data-driven

Data Insights Assure Quality Data and Confident Decisions!

Smarten

NOVEMBER 26, 2024

If the data is not easily gathered, managed and analyzed, it can overwhelm and complicate decision-makers. Data insight techniques provide a comprehensive set of tools, data analysis and quality assurance features to allow users to identify errors, enhance data quality, and boost productivity.’

Data Quality

Data Quality Machine Learning Predictive Modeling Metadata

The Missing Link in Enterprise Data Governance: Metadata

Octopai

JUNE 26, 2020

In order to figure out why the numbers in the two reports didn’t match, Steve needed to understand everything about the data that made up those reports – when the report was created, who created it, any changes made to it, which system it was created in, etc. Enterprise data governance. Metadata in data governance.

Metadata

Metadata Data Governance Enterprise Reporting

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

AWS Big Data

DECEMBER 4, 2024

Collaborate and build faster using familiar AWS tools for model development, generative AI, data processing, and SQL analytics with Amazon Q Developer , the most capable generative AI assistant for software development, helping you along the way. Having confidence in your data is key.

Data Analytics

Data Analytics Analytics Data Lake Data Quality

Perform data parity at scale for data modernization programs using AWS Glue Data Quality

AWS Big Data

OCTOBER 9, 2024

Some customers build custom in-house data parity frameworks to validate data during migration. Others use open source data quality products for data parity use cases. This takes away important person hours from the actual migration effort into building and maintaining a data parity framework.

Data Quality

Data Quality Data Lake Data Warehouse Metrics

Data Intelligence and Its Role in Combating Covid-19

erwin

MARCH 30, 2020

To marry the epidemiological data to the population data it will require a tremendous amount of data intelligence about the: Source of the data; Currency of the data; Quality of the data; and. Unraveling Data Complexities with Metadata Management. Data lineage to support impact analysis.

Metadata

Metadata IT Data Governance Data Quality

Collibra Provides a Platform for Data Intelligence

David Menninger's Analyst Perspectives

OCTOBER 8, 2024

As I recently noted , the term “data intelligence” has been used by multiple providers across analytics and data for several years and is becoming more widespread as software providers respond to the need to provide enterprises with a holistic view of data production and consumption.

Data Quality

Data Quality Data Governance Enterprise Visualization

Why data observability is essential to AI governance

erwin

DECEMBER 9, 2024

If you are not observing and reacting to the data, the model will accept every variant and it may end up one of the more than 50% of models, according to Gartner , that never make it to production because there are no clear insights and the results have nothing to do with the original intent of the model.

Metadata

Metadata Data Quality Sales Modeling

Bridging the Gap: How ‘Data in Place’ and ‘Data in Use’ Define Complete Data Observability

DataKitchen

SEPTEMBER 21, 2023

These layers help teams delineate different stages of data processing, storage, and access, offering a structured approach to data management. In the context of Data in Place, validating data quality automatically with Business Domain Tests is imperative for ensuring the trustworthiness of your data assets.

Testing

Testing Data Quality Predictive Modeling Metrics

What is data governance? Best practices for managing data assets

CIO Business Intelligence

MARCH 24, 2023

The Business Application Research Center (BARC) warns that data governance is a highly complex, ongoing program, not a “big bang initiative,” and it runs the risk of participants losing trust and interest over time. The program must introduce and support standardization of enterprise data.

Data Governance

Data Governance Management Metadata Data Quality

A Few Proven Suggestions for Handling Large Data Sets

Smart Data Collective

SEPTEMBER 26, 2021

Everyone has to manage raw data at one point or another; yet, not everybody stores it in a way that’s useful for further analysis or comparison to other data sets. As data sets become bigger, it becomes harder to visualize information. Draw a chart highlighting each endpoint in your data.

Metadata

Metadata Visualization Unstructured Data Data mining

Modernize your ETL platform with AWS Glue Studio: A case study from BMS

AWS Big Data

DECEMBER 13, 2023

In addition to using native managed AWS services that BMS didn’t need to worry about upgrading, BMS was looking to offer an ETL service to non-technical business users that could visually compose data transformation workflows and seamlessly run them on the AWS Glue Apache Spark-based serverless data integration engine.

Metadata

Metadata Data Lake Visualization Data Quality

Data integrity vs. data quality: Is there a difference?

IBM Big Data Hub

JULY 13, 2023

When we talk about data integrity, we’re referring to the overarching completeness, accuracy, consistency, accessibility, and security of an organization’s data. Together, these factors determine the reliability of the organization’s data. Data quality Data quality is essentially the measure of data integrity.

Data Quality

Data Quality Data Integration Metadata Cost-Benefit

What is Data Lineage? Top 5 Benefits of Data Lineage

erwin

APRIL 29, 2020

These tools range from enterprise service bus (ESB) products, data integration tools; extract, transform and load (ETL) tools, procedural code, application program interfaces (API)s, file transfer protocol (FTP) processes, and even business intelligence (BI) reports that further aggregate and transform data. Who are the data owners?

Key Performance Indicator

Key Performance Indicator Metadata Data Governance Data Quality

Top 10 Data Governance Trends for 2020: Data’s Real Value Comes Into Focus

erwin

JANUARY 3, 2020

As organizations become data-driven and awash in an overwhelming amount of data from multiple data sources (AI, IoT, ML, etc.), they will find new ways to get a handle on data quality and focus on data management processes and best practices.

Data Governance

Data Governance Digital Transformation IoT Metadata

From Disparate Data to Visualized Knowledge Part I: Moving from Spreadsheets to an RDF Database

Ontotext

NOVEMBER 18, 2021

And all of them are asking hard questions: “Can you integrate my data, with my particular format?”, “How well can you scale?”, “How many visualizations do you offer?”. Nowadays, data analytics doesn’t exist on its own. You have to take care of data extraction, transformation and loading, and of visualization.

Visualization

Visualization Reporting Metadata Enterprise

How to Do Data Modeling the Right Way

erwin

MAY 27, 2020

And it exists across these hybrid architectures in different formats: big and unstructured and traditional structured business data may physically sit in different places. What’s desperately needed is a way to understand the relationships and interconnections between so many entities in data sets in detail. Nine Steps to Data Modeling.

Modeling

Modeling Metadata Data Governance Visualization

Top 5 Data Catalog Benefits: Understanding Your Organization’s Data Lineage

erwin

AUGUST 7, 2019

A data catalog benefits organizations in a myriad of ways. With the right data catalog tool, organizations can automate enterprise metadata management – including data cataloging, data mapping, data quality and code generation for faster time to value and greater accuracy for data movement and/or deployment projects.

Metadata

Metadata Data Governance Data Quality Data Warehouse

Automating Data Governance

erwin

OCTOBER 29, 2020

We also looked at data preparation, governance and intelligence to see where organizations might be getting stuck and spending lots of time. Data quality and accuracy are recurring themes as well. Because without current and accurate data, a data governance initiative will fall apart.

Data Governance

Data Governance Metadata Digital Transformation ROI

What is BCBS 239 Compliance?

Octopai

JANUARY 19, 2020

Metadata Management is the Key to Successful Data Governance Learn more in the webinar, "Metadata Management Automation for the Governance Minded" Watch the Webinar! BCBS 239 and Automated Metadata Management Tools. You may recognize the common thread running through all of these principles: Metadata.

Metadata

Metadata Risk Management Business Intelligence Data Governance

7 enterprise data strategy trends

CIO Business Intelligence

NOVEMBER 22, 2022

Data fabric is an architecture that enables the end-to-end integration of various data pipelines and cloud environments through the use of intelligent and automated systems. The fabric, especially at the active metadata level, is important, Saibene notes.

Data Strategy

Data Strategy Strategy Enterprise Consulting

Data governance in the age of generative AI

AWS Big Data

FEBRUARY 29, 2024

However, enterprise data generated from siloed sources combined with the lack of a data integration strategy creates challenges for provisioning the data for generative AI applications. Implement data privacy policies. Implement data quality by data type and source.

Data Governance

Data Governance Unstructured Data Metadata Data Lake

6 Case Studies on The Benefits of Business Intelligence And Analytics

datapine

JANUARY 31, 2022

BI software uses algorithms to extract actionable insights from a company’s data and guide its strategic decisions. BI users analyze and present data in the form of dashboards and various types of reports to visualize complex information in an easier, more approachable way. 6) Smart and faster reporting. click to enlarge**.

Business Intelligence

Business Intelligence Analytics Cost-Benefit ROI

BI Cubed: Data Lineage on OLAP Anyone?

Octopai

JANUARY 21, 2020

How much time has your BI team wasted on finding data and creating metadata management reports? BI groups spend more than 50% of their time and effort manually searching for metadata. In fact, BI projects used to take many months to complete and require huge numbers of IT professionals to extract data. Cube to the rescue.

OLAP

OLAP Metadata Online Analytical Processing Data Quality

Introducing erwin Data Intelligence 14: Dive into data quality, ensure data reliability and leverage new deployment flexibility

erwin

SEPTEMBER 2, 2024

Added data quality capability ready for an AI era Data quality has never been more important than as we head into this next AI-focused era. erwin Data Quality is the data quality heart of erwin Data Intelligence. erwin Data Quality is the data quality heart of erwin Data Intelligence.

Data Quality

Data Quality Data Processing Measurement Metadata

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

AWS Big Data

DECEMBER 4, 2024

It’s the preferred choice when customers need more control and customization over the data integration process or require complex transformations. This flexibility makes Glue ETL suitable for scenarios where data must be transformed or enriched before analysis.

Data Integration

Data Integration Data Lake Statistics Data-driven

6 BI challenges IT teams must address

CIO Business Intelligence

DECEMBER 21, 2022

BI software helps companies do just that by shepherding the right data into analytical reports and visualizations so that users can make informed decisions. To gain employee buy-in, Stout’s team builds BI dashboards to show them how they can easily connect to and interact with their data, as well as visualize it in a meaningful way.

IT

IT Business Intelligence Sales Key Performance Indicator

How Fujitsu implemented a global data mesh architecture and democratized data

AWS Big Data

MAY 1, 2024

Solution overview OneData defines three personas: Publisher – This role includes the organizational and management team of systems that serve as data sources. Responsibilities include: Load raw data from the data source system at the appropriate frequency. Provide and keep up to date with technical metadata for loaded data.

Dashboards

Dashboards Publishing Data-driven Cost-Benefit

What’s Business Process Modeling Got to Do with It? – Choosing A BPM Tool

erwin

MARCH 21, 2019

Data governance also relies on business process modeling and analysis to drive improvement , including identifying business practices susceptible to security, compliance or other risks and adding controls to mitigate exposures. The lack of a central metadata repository is a far too common thorn in an organization’s side.

Modeling

Modeling Metadata Data Governance IT

5 Benefits of Data Visualization: Why Integrating a Data Catalog is Crucial

Alation

MAY 4, 2023

Are you an aspiring data scientist , or just want to understand the benefits of integrating data catalogs with visualization tools? In today’s ever-growing world of data, having an easy way to gain insights quickly is essential. What are data visualization tools?

Visualization

Visualization Metadata Data Governance KPI

6 DataOps Best Practices to Increase Your Data Analytics Output AND Your Data Quality

Octopai

OCTOBER 26, 2022

DataOps is an approach to best practices for data management that increases the quantity of data analytics products a data team can develop and deploy in a given time while drastically improving the level of data quality. Automated workflows for data product creation, testing and deployment.

Data Quality

Data Quality Data Analytics Analytics Manufacturing

AWS Lake Formation 2023 year in review

AWS Big Data

JANUARY 18, 2024

Easily and securely prepare, share, and query data – This session shows how you can use Lake Formation and the AWS Glue Data Catalog to share data without copying, transform and prepare data without coding, and query data. DataZone automatically manages the permissions of your shared data in the DataZone projects.

Data Lake

Data Lake Metadata Data Governance Statistics

Simplify and Improve Analytics with Self-Serve Data Prep!

Smarten

JANUARY 30, 2024

The right self-serve data prep solution can provide easy-to-use yet sophisticated data prep tools that are suitable for your business users, and enable data preparation techniques like: Connect and Mash Up Auto Suggesting Relationships JOINS and Types Sampling and Outliers Exploration, Cleaning, Shaping Reducing and Combining Data Insights (Data Quality (..)

Analytics

Analytics Visualization Data Quality Metadata

From Data Silos to Data Fabric with Knowledge Graphs

Ontotext

SEPTEMBER 15, 2020

The Data Fabric paradigm combines design principles and methodologies for building efficient, flexible and reliable data management ecosystems. Knowledge Graphs are the Warp and Weft of a Data Fabric. To implement any Data Fabric approach, it is essential to be able to understand the context of data.

Metadata

Metadata Knowledge Discovery Data Quality Strategy

Augmented data management: Data fabric versus data mesh

IBM Big Data Hub

APRIL 27, 2022

Gartner defines a data fabric as “a design concept that serves as an integrated layer of data and connecting processes. The data fabric architectural approach can simplify data access in an organization and facilitate self-service data consumption at scale.

Management

Management Metadata Data Architecture Data Lake

Announcing Open Source DataOps Data Quality TestGen 3.0

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

Webinars

Trending Sources

SAP Datasphere Powers Business at the Speed of Data

Webinars

Implement data quality checks on Amazon Redshift data assets and integrate with Amazon DataZone

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

Biggest Trends in Data Visualization Taking Shape in 2022

Amazon DataZone now integrates with AWS Glue Data Quality and external data quality solutions

Best Practices for Metadata Management

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

How EUROGATE established a data mesh architecture using Amazon DataZone

Data Insights Assure Quality Data and Confident Decisions!

The Missing Link in Enterprise Data Governance: Metadata

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Perform data parity at scale for data modernization programs using AWS Glue Data Quality

Data Intelligence and Its Role in Combating Covid-19

Collibra Provides a Platform for Data Intelligence

Why data observability is essential to AI governance

Bridging the Gap: How ‘Data in Place’ and ‘Data in Use’ Define Complete Data Observability

What is data governance? Best practices for managing data assets

A Few Proven Suggestions for Handling Large Data Sets

Modernize your ETL platform with AWS Glue Studio: A case study from BMS

Data integrity vs. data quality: Is there a difference?

What is Data Lineage? Top 5 Benefits of Data Lineage

Top 10 Data Governance Trends for 2020: Data’s Real Value Comes Into Focus

From Disparate Data to Visualized Knowledge Part I: Moving from Spreadsheets to an RDF Database

How to Do Data Modeling the Right Way

Top 5 Data Catalog Benefits: Understanding Your Organization’s Data Lineage

Automating Data Governance

What is BCBS 239 Compliance?

7 enterprise data strategy trends

Data governance in the age of generative AI

6 Case Studies on The Benefits of Business Intelligence And Analytics

BI Cubed: Data Lineage on OLAP Anyone?

Introducing erwin Data Intelligence 14: Dive into data quality, ensure data reliability and leverage new deployment flexibility

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

6 BI challenges IT teams must address

How Fujitsu implemented a global data mesh architecture and democratized data

What’s Business Process Modeling Got to Do with It? – Choosing A BPM Tool

5 Benefits of Data Visualization: Why Integrating a Data Catalog is Crucial

6 DataOps Best Practices to Increase Your Data Analytics Output AND Your Data Quality

AWS Lake Formation 2023 year in review

Simplify and Improve Analytics with Self-Serve Data Prep!

From Data Silos to Data Fabric with Knowledge Graphs

Augmented data management: Data fabric versus data mesh

Stay Connected