Analytics, Data Quality and Data Warehouse

Analytics

Data Quality

Data Warehouse

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

AWS Big Data

DECEMBER 4, 2024

This week on the keynote stages at AWS re:Invent 2024, you heard from Matt Garman, CEO, AWS, and Swami Sivasubramanian, VP of AI and Data, AWS, speak about the next generation of Amazon SageMaker , the center for all of your data, analytics, and AI. The relationship between analytics and AI is rapidly evolving.

Data Analytics

Data Analytics Analytics Data Lake Data Quality

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

1) What Is Data Quality Management? 4) Data Quality Best Practices. 5) How Do You Measure Data Quality? 6) Data Quality Metrics Examples. 7) Data Quality Control: Use Case. 8) The Consequences Of Bad Data Quality. 9) 3 Sources Of Low-Quality Data.

Data Quality

Data Quality Metrics Data-driven Management

Join 42,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

Talend Data Fabric Simplifies Data Life Cycle Management

David Menninger's Analyst Perspectives

NOVEMBER 16, 2021

Talend is a data integration and management software company that offers applications for cloud computing, big data integration, application integration, data quality and master data management.

Management

Management Data Warehouse Data Quality Data Integration

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

SAP Datasphere Powers Business at the Speed of Data

Rocket-Powered Data Science

MARCH 20, 2023

In fact, by putting a single label like AI on all the steps of a data-driven business process, we have effectively not only blurred the process, but we have also blurred the particular characteristics that make each step separately distinct, uniquely critical, and ultimately dependent on specialized, specific technologies at each step.

Data Warehouse

Data Warehouse Metadata Digital Transformation Machine Learning

How I Broke Our SLA and Delighted Our Customer

DataKitchen

MAY 17, 2025

One of our key data warehouse refreshes had failed. No new data. The refresh was long past its deadline, the projects key data engineer was on vacation, and I was playing backup. At the moment, I was flying home from a data quality conference. The value of data quality is often invisible.

Testing

Testing Data Quality Data Warehouse Dashboards

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

As technology and business leaders, your strategic initiatives, from AI-powered decision-making to predictive insights and personalized experiences, are all fueled by data. Yet, despite growing investments in advanced analytics and AI, organizations continue to grapple with a persistent and often underestimated challenge: poor data quality.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

Perform data parity at scale for data modernization programs using AWS Glue Data Quality

AWS Big Data

OCTOBER 9, 2024

Today, customers are embarking on data modernization programs by migrating on-premises data warehouses and data lakes to the AWS Cloud to take advantage of the scale and advanced analytical capabilities of the cloud. Some customers build custom in-house data parity frameworks to validate data during migration.

Data Quality

Data Quality Data Lake Data Warehouse Metrics

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

AWS Big Data

NOVEMBER 22, 2024

At AWS, we are committed to empowering organizations with tools that streamline data analytics and transformation processes. This integration enables data teams to efficiently transform and manage data using Athena with dbt Cloud’s robust features, enhancing the overall data workflow experience.

Data Lake

Data Lake Data Warehouse Cost-Benefit Data Transformation

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

AWS Big Data

AUGUST 15, 2024

Unifying these necessitates additional data processing, requiring each business unit to provision and maintain a separate data warehouse. This burdens business units focused solely on consuming the curated data for analysis and not concerned with data management tasks, cleansing, or comprehensive data processing.

Data Lake

Data Lake Data Warehouse Data Governance Publishing

Looker Simplifies Business Intelligence in the Cloud

David Menninger's Analyst Perspectives

MARCH 17, 2022

Organizations face various challenges with analytics and business intelligence processes, including data curation and modeling across disparate sources and data warehouses, maintaining data quality and ensuring security and governance.

Business Intelligence

Business Intelligence Data Warehouse Data Quality Dashboards

Implement data quality checks on Amazon Redshift data assets and integrate with Amazon DataZone

AWS Big Data

AUGUST 15, 2024

Data quality is crucial in data pipelines because it directly impacts the validity of the business insights derived from the data. Today, many organizations use AWS Glue Data Quality to define and enforce data quality rules on their data at rest and in transit.

Data Quality

Data Quality Visualization Metadata Key Performance Indicator

The DataOps Vendor Landscape, 2021

DataKitchen

APRIL 13, 2021

Testing and Data Observability. Process Analytics. We have also included vendors for the specific use cases of ModelOps, MLOps, DataGovOps and DataSecOps which apply DataOps principles to machine learning, AI, data governance, and data security operations. . Reflow — A system for incremental data processing in the cloud.

Testing

Testing Machine Learning Consulting Data Science

Visualize data quality scores and metrics generated by AWS Glue Data Quality

AWS Big Data

JUNE 6, 2023

AWS Glue Data Quality allows you to measure and monitor the quality of data in your data repositories. It’s important for business users to be able to see quality scores and metrics to make confident business decisions and debug data quality issues. An AWS Glue crawler crawls the results.

Data Quality

Data Quality Metrics Visualization Dashboards

The future of data: A 5-pillar approach to modern data management

CIO Business Intelligence

DECEMBER 11, 2024

This approach is repeatable, minimizes dependence on manual controls, harnesses technology and AI for data management and integrates seamlessly into the digital product development process. They must also select the data processing frameworks such as Spark, Beam or SQL-based processing and choose tools for ML.

Management

Management Data Governance Data Science Reporting

AWS Glue Data Quality is Generally Available

AWS Big Data

JUNE 6, 2023

We are excited to announce the General Availability of AWS Glue Data Quality. Our journey started by working backward from our customers who create, manage, and operate data lakes and data warehouses for analytics and machine learning.

Data Quality

Data Quality Statistics Data Lake Visualization

Database vs. Data Warehouse: What’s the Difference?

Jet Global

MAY 28, 2019

Whether the reporting is being done by an end user, a data science team, or an AI algorithm, the future of your business depends on your ability to use data to drive better quality for your customers at a lower cost. So, when it comes to collecting, storing, and analyzing data, what is the right choice for your enterprise?

Data Warehouse

Data Warehouse Reporting Business Intelligence Sales

Alation 2022.2: Open Data Quality Initiative and Enhanced Data Governance

Alation

MAY 24, 2022

generally available on May 24, Alation introduces the Open Data Quality Initiative for the modern data stack, giving customers the freedom to choose the data quality vendor that’s best for them with the added confidence that those tools will integrate seamlessly with Alation’s Data Catalog and Data Governance application.

Data Quality

Data Quality Data Governance Metadata Metrics

Getting started with AWS Glue Data Quality from the AWS Glue Data Catalog

AWS Big Data

JUNE 6, 2023

AWS Glue is a serverless data integration service that makes it simple to discover, prepare, and combine data for analytics, machine learning (ML), and application development. Hundreds of thousands of customers use data lakes for analytics and ML to make data-driven business decisions.

Data Quality

Data Quality Data-driven Data Lake Metrics

What is a Data Mesh?

DataKitchen

AUGUST 3, 2021

The past decades of enterprise data platform architectures can be summarized in 69 words. First-generation – expensive, proprietary enterprise data warehouse and business intelligence platforms maintained by a specialized team drowning in technical debt. The organizational concepts behind data mesh are summarized as follows.

Data Architecture

Data Architecture Data Lake Cost-Benefit Data Warehouse

Your Data Won’t Speak Unless You Ask It The Right Data Analysis Questions

datapine

JANUARY 24, 2021

Based on your company’s strategy, goals, budget, and target customers you should prepare a set of questions that will smoothly walk you through the online data analysis and help you arrive at relevant insights. This genie (who we’ll call Data Dan) embodies the idea of a perfect data analytics platform through his magic powers.

IT Statistics KPI Data-driven

What is DataOps? Collaborative, cross-functional analytics

CIO Business Intelligence

DECEMBER 22, 2022

DataOps (data operations) is an agile, process-oriented methodology for developing and delivering analytics. It brings together DevOps teams with data engineers and data scientists to provide the tools, processes, and organizational structures to support the data-focused enterprise. What is DataOps?

Analytics

Analytics Machine Learning Data mining Data Quality

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

AWS Big Data

DECEMBER 4, 2024

Domain ownership recognizes that the teams generating the data have the deepest understanding of it and are therefore best suited to manage, govern, and share it effectively. This principle makes sure data accountability remains close to the source, fostering higher data quality and relevance.

Metadata

Metadata Data Governance Data Quality Data-driven

Cloud Data Warehouse Migration 101: Expert Tips

Alation

JULY 28, 2022

There was a time when most CIOs would never consider putting their crown jewels — AKA customer data and associated analytics — into the cloud. But today, there is a magic quadrant for cloud databases and warehouses comprising more than 20 vendors. The cloud is no longer synonymous with risk. What do you migrate, how, and when?

Data Warehouse

Data Warehouse Cost-Benefit Data-driven Data Governance

Set up advanced rules to validate quality of multiple datasets with AWS Glue Data Quality

AWS Big Data

JUNE 6, 2023

Poor-quality data can lead to incorrect insights, bad decisions, and lost opportunities. AWS Glue Data Quality measures and monitors the quality of your dataset. It supports both data quality at rest and data quality in AWS Glue extract, transform, and load (ETL) pipelines.

Data Quality

Data Quality Data Lake Visualization Data-driven

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

For container terminal operators, data-driven decision-making and efficient data sharing are vital to optimizing operations and boosting supply chain efficiency. Enhance agility by localizing changes within business domains and clear data contracts. Eliminate centralized bottlenecks and complex data pipelines.

IoT

IoT Machine Learning Metadata Data-driven

Why Good Data Management Is Essential to Data Analytics

Jet Global

JANUARY 20, 2021

As the volume of available information continues to grow, data management will become an increasingly important factor in effective business management. Lack of proactive data management, on the other hand, can result in incompatible or inconsistent sources of information, as well as data quality problems.

Data Analytics

Data Analytics Management Analytics Data Warehouse

Your 5-Step Journey from Analytics to AI

CIO Business Intelligence

MARCH 22, 2022

How do you introduce AI into your data and analytics infrastructure? To companies entrenched in decades-old business and IT processes, data fiefdoms, and legacy systems, the task may seem insurmountable. Another option is a data warehouse, which stores processed and refined data. Outcomes you can expect.

Analytics

Analytics Key Performance Indicator Data Warehouse Data-driven

The Reason Many AI and Analytics Projects Fail—and How to Make Sure Yours Doesn’t

CIO Business Intelligence

JANUARY 20, 2023

Topping the list of executive priorities for 2023—a year heralded by escalating economic woes and climate risks—is the need for data driven insights to propel efficiency, resiliency, and other key initiatives. Many companies have been experimenting with advanced analytics and artificial intelligence (AI) to fill this need.

Analytics

Analytics Key Performance Indicator Unstructured Data Deep Learning

Become More Data-Driven by Evolving Analytics Workloads

CIO Business Intelligence

MARCH 9, 2022

Organizations are increasingly trying to grow revenue by mining their data to quickly show insights and provide value. In the past, one option was to use open-source data analytics platforms to analyze data using on-premises infrastructure. Cloudera and Dell Technologies for More Data Insights.

Data-driven

Data-driven Analytics Machine Learning Deep Learning

What is a data engineer? An analytics role in high demand

CIO Business Intelligence

AUGUST 9, 2022

What is a data engineer? Data engineers design, build, and optimize systems for data collection, storage, access, and analytics at scale. They create data pipelines used by data scientists, data-centric applications, and other data consumers. Data engineer job description.

Analytics

Analytics Data Science Statistics Data mining

Building a Beautiful Data Lakehouse

CIO Business Intelligence

MARCH 9, 2022

Applying artificial intelligence (AI) to data analytics for deeper, better insights and automation is a growing enterprise IT priority. But the data repository options that have been around for a while tend to fall short in their ability to serve as the foundation for big data analytics powered by AI.

Data Lake

Data Lake Unstructured Data Data Warehouse Big Data

Take Your SQL Skills To The Next Level With These Popular SQL Books

datapine

SEPTEMBER 27, 2022

Business leaders, developers, data heads, and tech enthusiasts – it’s time to make some room on your business intelligence bookshelf because once again, datapine has new books for you to add. We have already given you our top data visualization books , top business intelligence books , and best data analytics books.

Business Intelligence

Business Intelligence Data Warehouse Data Processing Data mining

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

AWS Big Data

AUGUST 1, 2024

Large-scale data warehouse migration to the cloud is a complex and challenging endeavor that many organizations undertake to modernize their data infrastructure, enhance data management capabilities, and unlock new business opportunities. This makes sure the new data platform can meet current and future business goals.

Data Warehouse

Data Warehouse KPI Optimization Cost-Benefit

How gaming companies can use Amazon Redshift Serverless to build scalable analytical applications faster and easier

AWS Big Data

MARCH 7, 2023

This post provides guidance on how to build scalable analytical solutions for gaming industry use cases using Amazon Redshift Serverless. Flexible and easy to use – The solutions should provide less restrictive, easy-to-access, and ready-to-use data. A data warehouse is one of the components in a data hub.

Analytics

Analytics Data Warehouse Data Lake Metadata

Informatica’s new data management clouds target health, finance services

CIO Business Intelligence

MAY 24, 2022

Some of the accelerators included as part of the new platform are integrations with Salesforce, NPI data, National Patient Account Services, Workday, Oracle Fusion HCM Cloud, Orange HRM, Salesforce Health Cloud, MedPro, healthcare-focused cloud company Veeva, and HR vendor UltiPro. Analytics for faster decision making.

Finance

Finance Management Metadata Machine Learning

Data architecture strategy for data quality

IBM Big Data Hub

JANUARY 5, 2023

Poor data quality is one of the top barriers faced by organizations aspiring to be more data-driven. Ill-timed business decisions and misinformed business processes, missed revenue opportunities, failed business initiatives and complex data systems can all stem from data quality issues.

Data Architecture

Data Architecture Data Quality Strategy Data Lake

The top 15 big data and data analytics certifications

CIO Business Intelligence

JUNE 14, 2023

Data and big data analytics are the lifeblood of any successful business. Getting the technology right can be challenging but building the right team with the right skills to undertake data initiatives can be even harder — a challenge reflected in the rising demand for big data and analytics skills and certifications.

Big Data

Big Data Data Analytics Analytics Predictive Modeling

Bridging the Gap: How ‘Data in Place’ and ‘Data in Use’ Define Complete Data Observability

DataKitchen

SEPTEMBER 21, 2023

In the contemporary data landscape, data teams commonly utilize data warehouses or lakes to arrange their data into L1, L2, and L3 layers. These layers help teams delineate different stages of data processing, storage, and access, offering a structured approach to data management. What is Data in Use?

Testing

Testing Data Quality Predictive Modeling Metrics

How HPE Aruba Supply Chain optimized cost and performance by migrating to an AWS modern data architecture

AWS Big Data

SEPTEMBER 11, 2024

Through a commitment to cutting-edge technologies and a relentless pursuit of quality, HPE Aruba designed this next-generation solution as a cloud-based cross-functional supply chain workflow and analytics tool. In addition, they use AWS Glue jobs for orchestrating validation jobs and moving data through the data warehouse.

Data Architecture

Data Architecture Optimization Data Warehouse Metadata

Power of ETL: Transforming Business Decision Making with Data Insights

Smart Data Collective

JULY 9, 2023

ETL (Extract, Transform, Load) is a crucial process in the world of data analytics and business intelligence. By understanding the power of ETL, organisations can harness the potential of their data and gain valuable insights that drive informed choices. Both approaches aim to improve data quality and enable accurate analysis.

Data Quality

Data Quality Data Warehouse Data-driven Reporting

Breaking down data silos for digital success

CIO Business Intelligence

NOVEMBER 7, 2023

Centralized reporting boosts data value For more than a decade, pediatric health system Phoenix Children’s has operated a data warehouse containing more than 120 separate data systems, providing the ability to connect data from disparate systems. Companies should also incorporate data discovery, Higginson says.

Data Warehouse

Data Warehouse Digital Transformation Data-driven Reporting

Accelerate analytics on Amazon OpenSearch Service with AWS Glue through its native connector

AWS Big Data

DECEMBER 21, 2023

As the volume and complexity of analytics workloads continue to grow, customers are looking for more efficient and cost-effective ways to ingest and analyse data. OpenSearch Service is used for multiple purposes, such as observability, search analytics, consolidation, cost savings, compliance, and integration.

Analytics

Analytics IT Data Lake Visualization

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

AWS Big Data

DECEMBER 4, 2024

With this new functionality, customers can create up-to-date replicas of their data from applications such as Salesforce, ServiceNow, and Zendesk in an Amazon SageMaker Lakehouse and Amazon Redshift. SageMaker Lakehouse gives you the flexibility to access and query your data in-place with all Apache Iceberg compatible tools and engines.

Data Integration

Data Integration Data Lake Statistics Data-driven

Data governance in the age of generative AI

AWS Big Data

FEBRUARY 29, 2024

First, many LLM use cases rely on enterprise knowledge that needs to be drawn from unstructured data such as documents, transcripts, and images, in addition to structured data from data warehouses. Implement data privacy policies. Implement data quality by data type and source.

Data Governance

Data Governance Unstructured Data Metadata Data Lake

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

Webinars

Trending Sources

Talend Data Fabric Simplifies Data Life Cycle Management

Webinars

SAP Datasphere Powers Business at the Speed of Data

How I Broke Our SLA and Delighted Our Customer

Data’s dark secret: Why poor quality cripples AI and growth

Perform data parity at scale for data modernization programs using AWS Glue Data Quality

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

Looker Simplifies Business Intelligence in the Cloud

Implement data quality checks on Amazon Redshift data assets and integrate with Amazon DataZone

The DataOps Vendor Landscape, 2021

Visualize data quality scores and metrics generated by AWS Glue Data Quality

The future of data: A 5-pillar approach to modern data management

AWS Glue Data Quality is Generally Available

Database vs. Data Warehouse: What’s the Difference?

Alation 2022.2: Open Data Quality Initiative and Enhanced Data Governance

Getting started with AWS Glue Data Quality from the AWS Glue Data Catalog

What is a Data Mesh?

Your Data Won’t Speak Unless You Ask It The Right Data Analysis Questions

What is DataOps? Collaborative, cross-functional analytics

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

Cloud Data Warehouse Migration 101: Expert Tips

Set up advanced rules to validate quality of multiple datasets with AWS Glue Data Quality

How EUROGATE established a data mesh architecture using Amazon DataZone

Why Good Data Management Is Essential to Data Analytics

Your 5-Step Journey from Analytics to AI

The Reason Many AI and Analytics Projects Fail—and How to Make Sure Yours Doesn’t

Become More Data-Driven by Evolving Analytics Workloads

What is a data engineer? An analytics role in high demand

Building a Beautiful Data Lakehouse

Take Your SQL Skills To The Next Level With These Popular SQL Books

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

How gaming companies can use Amazon Redshift Serverless to build scalable analytical applications faster and easier

Informatica’s new data management clouds target health, finance services

Data architecture strategy for data quality

The top 15 big data and data analytics certifications

Bridging the Gap: How ‘Data in Place’ and ‘Data in Use’ Define Complete Data Observability

How HPE Aruba Supply Chain optimized cost and performance by migrating to an AWS modern data architecture

Power of ETL: Transforming Business Decision Making with Data Insights

Breaking down data silos for digital success

Accelerate analytics on Amazon OpenSearch Service with AWS Glue through its native connector

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Data governance in the age of generative AI

Stay Connected