Data Processing, Data Quality and Modeling

Data Processing

Data Quality

Modeling

The future of data: A 5-pillar approach to modern data management

CIO Business Intelligence

DECEMBER 11, 2024

Digital transformation started creating a digital presence of everything we do in our lives, and artificial intelligence (AI) and machine learning (ML) advancements in the past decade dramatically altered the data landscape. Implementing ML capabilities can help find the right thresholds.

Management

Management Data Governance Data Science Reporting

7 types of tech debt that could cripple your business

CIO Business Intelligence

MARCH 25, 2025

Data debt that undermines decision-making In Digital Trailblazer , I share a story of a private company that reported a profitable year to the board, only to return after the holiday to find that data quality issues and calculation mistakes turned it into an unprofitable one.

Risk

Risk Cost-Benefit Data-driven Digital Transformation

Join 42,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

How to Streamline Payment Applications & Lien Waivers Through Innovative Construction Technology

Airflow Best Practices for ETL/ELT Pipelines

MORE WEBINARS

Trending Sources

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

In addition to real-time analytics and visualization, the data needs to be shared for long-term data analytics and machine learning applications. To achieve this, EUROGATE designed an architecture that uses Amazon DataZone to publish specific digital twin data sets, enabling access to them with SageMaker in a separate AWS account.

IoT

IoT Machine Learning Metadata Data-driven

Webinars

How to Streamline Payment Applications & Lien Waivers Through Innovative Construction Technology

Airflow Best Practices for ETL/ELT Pipelines

MORE WEBINARS

Scaling AI Solutions with Cloudera: A Deep Dive into AI Inference and Solution Patterns

Cloudera

DECEMBER 9, 2024

Many enterprises encounter bottlenecks related to data quality, model deployment, and infrastructure requirements that hinder scaling efforts. Effortless Model Deployment with Cloudera AI Inference Cloudera AI Inference service offers a powerful, production-grade environment for deploying AI models at scale.

Predictive Analytics

Predictive Analytics Data Processing Modeling Optimization

Akeneo aims to transform the retail playbook with AI and data consistency

CIO Business Intelligence

JANUARY 9, 2025

They struggle with ensuring consistency, accuracy, and relevance in their product information, which is critical for delivering exceptional shopping experiences, training reliable AI models, and building trust with their customers. The platform offers tailored solutions for different market segments.

B2B

B2B Cost-Benefit Data-driven Sales

How To Use Airbyte, dbt-teradata, Dagster, and Teradata Vantage™ for Seamless Data Integration

Teradata

MAY 30, 2025

Fortunately, Teradata offers integrations to many modular tools that facilitate routine processes allowing data engineers to focus on high-value tasks such as governance, data quality, and efficiency. The data pipeline is composed of multiple stages, beginning with the ingestion of raw data through Airbyte.

Data Integration

Data Integration Data Processing Metadata Testing

Cloud analytics migration: how to exceed expectations

CIO Business Intelligence

NOVEMBER 19, 2024

They are often unable to handle large, diverse data sets from multiple sources. Another issue is ensuring data quality through cleansing processes to remove errors and standardize formats. Staffing teams with skilled data scientists and AI specialists is difficult, given the severe global shortage of talent.

Analytics

Analytics Cost-Benefit Finance Data-driven

As AI scales, infrastructure challenges emerge

CIO Business Intelligence

OCTOBER 23, 2024

That means creating custom models, fine-tuning existing models, or using retrieval augmented generation (RAG) embedding to give gen AI systems access to up-to-date and accurate corporate information. We didn’t want our data going into a public model,” says Matt Bostrom, Spirent’s VP of enterprise technology and strategy.

Cost-Benefit

Cost-Benefit Enterprise Testing Consulting

Why 60% of AI Initiatives Fail: Data Matters

Jen Stirrup

JULY 13, 2025

The Solution: Data Strategy as well as AI It is tempting to think that the answer to succesful AI is to implement more AI! Instead, organizations must prioritize: Cohesive data governance Rigorous data quality controls Effective integration frameworks A strong data strategy is the foundation of sustainable AI success.

Digital Transformation

Digital Transformation Consulting Data Strategy Business Intelligence

HEMA accelerates their data governance journey with Amazon DataZone

AWS Big Data

DECEMBER 19, 2024

Data has become an invaluable asset for businesses, offering critical insights to drive strategic decision-making and operational optimization. Each service is hosted in a dedicated AWS account and is built and maintained by a product owner and a development team, as illustrated in the following figure.

Data Governance

Data Governance Publishing Data-driven Metadata

How Stifel built a modern data platform using AWS Glue and an event-driven domain architecture

AWS Big Data

JULY 7, 2025

This approach showed limitations as the data complexity increased, data volumes grew, and demand for quick, business-driven insights rose. These challenges are encountered by financial institutions worldwide, leading to a reassessment of traditional data management practices.

Data-driven

Data-driven Metadata Digital Transformation Data Lake

Capture data lineage from dbt, Apache Airflow, and Apache Spark with Amazon SageMaker

AWS Big Data

JUNE 24, 2025

SageMaker brings together AWS artificial intelligence and machine learning (AI/ML) and analytics capabilities and delivers an integrated experience for analytics and AI with unified access to data. The SageMaker compatibility with OpenLineage can help simplify governance of your data assets and increase trust in your data.

Testing

Testing Metadata Data Governance Data Processing

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

AWS Big Data

DECEMBER 4, 2024

Domain ownership recognizes that the teams generating the data have the deepest understanding of it and are therefore best suited to manage, govern, and share it effectively. This principle makes sure data accountability remains close to the source, fostering higher data quality and relevance.

Metadata

Metadata Data Governance Data Quality Data-driven

Build a Data Mesh Architecture Using Teradata VantageCloud on AWS

Teradata

MAY 30, 2025

Data domains In the data mesh architecture, data domains represent distinct business areas or use cases within an organization. Each data domain is owned and managed by a dedicated team responsible for its data quality, governance, and accessibility.

Machine Learning

Machine Learning Cost-Benefit Experimentation Interactive

Lower your Large Language Model costs with Graphwise GraphDB

Ontotext

DECEMBER 6, 2024

Though vector embedding and high dimensional mapping to the Vector Space Model (VSM) has recently gained prominence, headspace, and usage with the advent of GenAI, it has been used as a key information retrieval technique for over two decades. Popular full-text search engines have been leveraging VSM for years. GraphDBs v 10.8

Modeling

Modeling Cost-Benefit Experimentation Optimization

Prioritizing AI investments: Balancing short-term gains with long-term vision

CIO Business Intelligence

FEBRUARY 18, 2025

My journey started by looking at the AI opportunity landscape in terms of business and technology maturity models, patterns, risk, reward and the path to business value. Start with data as an AI foundation Data quality is the first and most critical investment priority for any viable enterprise AI strategy.

Machine Learning

Machine Learning Data Quality Enterprise Sales

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

1) What Is Data Quality Management? 4) Data Quality Best Practices. 5) How Do You Measure Data Quality? 6) Data Quality Metrics Examples. 7) Data Quality Control: Use Case. 8) The Consequences Of Bad Data Quality. 9) 3 Sources Of Low-Quality Data.

Data Quality

Data Quality Metrics Data-driven Management

Why you should care about debugging machine learning models

O'Reilly on Data

DECEMBER 12, 2019

Not least is the broadening realization that ML models can fail. And that’s why model debugging, the art and science of understanding and fixing problems in ML models, is so critical to the future of ML. Because all ML models make mistakes, everyone who cares about ML should also care about model debugging. [1]

Machine Learning

Machine Learning Modeling Testing Risk Management

The DataOps Vendor Landscape, 2021

DataKitchen

APRIL 13, 2021

DataOps needs a directed graph-based workflow that contains all the data access, integration, model and visualization steps in the data analytic production process. It orchestrates complex pipelines, toolchains, and tests across teams, locations, and data centers. OwlDQ — Predictive data quality.

Testing

Testing Machine Learning Consulting Data Science

How to Deliver Data Quality with Data Governance: Ryan Doupe, CDO of American Fidelity, 9-Step Process

Alation

JANUARY 20, 2022

Several weeks ago (prior to the Omicron wave), I got to attend my first conference in roughly two years: Dataversity’s Data Quality and Information Quality Conference. Ryan Doupe, Chief Data Officer of American Fidelity, held a thought-provoking session that resonated with me. Step 2: Data Definitions.

Data Quality

Data Quality Data Governance Metrics Statistics

Set up advanced rules to validate quality of multiple datasets with AWS Glue Data Quality

AWS Big Data

JUNE 6, 2023

Poor-quality data can lead to incorrect insights, bad decisions, and lost opportunities. AWS Glue Data Quality measures and monitors the quality of your dataset. It supports both data quality at rest and data quality in AWS Glue extract, transform, and load (ETL) pipelines.

Data Quality

Data Quality Data Lake Visualization Data-driven

What you need to know about product management for AI

O'Reilly on Data

MARCH 31, 2020

But there’s a host of new challenges when it comes to managing AI projects: more unknowns, non-deterministic outcomes, new infrastructures, new processes and new tools. For machine learning systems used in consumer internet companies, models are often continuously retrained many times a day using billions of entirely new input-output pairs.

Management

Management Machine Learning Experimentation Metrics

Take Your SQL Skills To The Next Level With These Popular SQL Books

datapine

SEPTEMBER 27, 2022

Some of these ‘structures’ may include putting all the information; for instance, a structure could be about cars, placing them into tables that consist of makes, models, year of manufacture, and color. With a MySQL dashboard builder , for example, you can connect all the data with a few clicks. Viescas, Douglas J.

Business Intelligence

Business Intelligence Data Warehouse Data Processing Data mining

Introducing erwin Data Intelligence 14: Dive into data quality, ensure data reliability and leverage new deployment flexibility

erwin

SEPTEMBER 2, 2024

Added data quality capability ready for an AI era Data quality has never been more important than as we head into this next AI-focused era. erwin Data Quality is the data quality heart of erwin Data Intelligence. erwin Data Quality is the data quality heart of erwin Data Intelligence.

Data Quality

Data Quality Data Processing Measurement Metadata

Introducing the technology behind watsonx.ai, IBM’s AI and data platform for enterprise

IBM Big Data Hub

MAY 9, 2023

Over the past decade, deep learning arose from a seismic collision of data availability and sheer compute power, enabling a host of impressive AI capabilities. Data must be laboriously collected, curated, and labeled with task-specific annotations to train AI models. We stand on the frontier of an AI revolution.

Enterprise

Enterprise Technology Modeling Cost-Benefit

Accomplish Agile Business Intelligence & Analytics For Your Business

datapine

APRIL 15, 2020

Business intelligence is moving away from the traditional engineering model: analysis, design, construction, testing, and implementation. In the traditional model communication between developers and business users is not a priority. You need to determine if you are going with an on-premise or cloud-hosted strategy.

Business Intelligence

Business Intelligence Analytics Testing Dashboards

12 Cloud Computing Risks & Challenges Businesses Are Facing In These Days

datapine

MAY 31, 2022

Instead of installing software on your own servers, SaaS companies enable you to rent software that’s hosted, this is typically the case for a monthly or yearly subscription fee. More and more CRM, marketing, and finance-related tools use SaaS business intelligence and technology, and even Adobe’s Creative Suite has adopted the model.

Risk

Risk Cost-Benefit Business Intelligence Data-driven

Data Governance Maturity and Tracking Progress

erwin

APRIL 16, 2021

Data governance is best defined as the strategic, ongoing and collaborative processes involved in managing data’s access, availability, usability, quality and security in line with established internal policies and relevant data regulations. Click here to read our success story on how E.ON

Data Governance

Data Governance Metadata Cost-Benefit Data-driven

The Art and Science of FP&A Storytelling

Timo Elliott

JANUARY 4, 2021

I recently participated in a web seminar on the Art and Science of FP&A Storytelling, hosted by the founder and CEO of FP&A Research Larysa Melnychuk along with other guests Pasquale della Puca , part of the global finance team at Beckman Coulter and Angelica Ancira , Global Digital Planning Lead at PepsiCo. The key takeaways.

Dashboards

Dashboards Visualization Data-driven Forecasting

Empowering data-driven excellence: How the Bluestone Data Platform embraced data mesh for success

AWS Big Data

FEBRUARY 27, 2024

Four-layered data lake and data warehouse architecture – The architecture comprises four layers, including the analytical layer, which houses purpose-built facts and dimension datasets that are hosted in Amazon Redshift. This enables data-driven decision-making across the organization.

Data-driven

Data-driven Data Lake Data Quality Data Governance

From Excel to AI: How Liberty Dental revolutionized care management

CIO Business Intelligence

OCTOBER 17, 2024

So, we aggregated all this data, applied some machine learning algorithms on top of it and then fed it into large language models (LLMs) and now use generative AI (genAI), which gives us an output of these care plans. We created our data model in a way that satisfied the requirements of what we had a vision of.

Management

Management Insurance ROI Cost-Benefit

Data Governance and Strategy for the Global Enterprise

Cloudera

OCTOBER 1, 2022

Adam Wood, director of data governance and data quality at a financial services institution (FSI). Sam Charrington, founder and host of the TWIML AI Podcast. Sam Charrington, founder and host of the TWIML AI Podcast. Common data governance challenges for global enterprises: Setting up a multidisciplinary data team.

Data Governance

Data Governance Strategy Enterprise Machine Learning

Oracle makes its pitch for the enterprise cloud. Should CIOs listen?

CIO Business Intelligence

APRIL 11, 2024

Oracle Cloud Infrastructure is now capable of hosting a full range of traditional and modern IT workloads, and for many enterprise customers, Oracle is a proven vendor,” says David Wright, vice president of research for cloud infrastructure strategies at research firm Gartner. The inherent risk is trust.

Enterprise

Enterprise IT Data Processing Strategy

Governing data in relational databases using Amazon DataZone

AWS Big Data

MAY 7, 2024

Data governance is a key enabler for teams adopting a data-driven culture and operational model to drive innovation with data. This post explains how you can extend the governance capabilities of Amazon DataZone to data assets hosted in relational databases based on MySQL, PostgreSQL, Oracle or SQL Server engines.

Metadata

Metadata Data Lake Data Processing Data-driven

The importance of data ingestion and integration for enterprise AI

IBM Big Data Hub

JANUARY 9, 2024

Companies still often accept the risk of using internal data when exploring large language models (LLMs) because this contextual data is what enables LLMs to change from general-purpose to domain-specific knowledge. In the generative AI or traditional AI development cycle, data ingestion serves as the entry point.

Enterprise

Enterprise Data Integration Data Quality Contextual Data

Automating Model Risk Compliance: Model Validation

DataRobot Blog

MAY 26, 2022

Last time , we discussed the steps that a modeler must pay attention to when building out ML models to be utilized within the financial institution. In summary, to ensure that they have built a robust model, modelers must make certain that they have designed the model in a way that is backed by research and industry-adopted practices.

Risk

Risk Modeling Metrics Business Objectives

Optimizing Risk and Exposure Management – Roundtable Highlights

Cloudera

JULY 15, 2021

We recently hosted a roundtable focused on o ptimizing risk and exposure management with data insights. Across the industry, the pandemic caused a huge breakdown in model performance due to the change in macroeconomic conditions and government stimulus packages. Mental health of employees is a critical area to monitor .

Risk

Risk Optimization Management Insurance

UK’s new digital strategy promises change – will it deliver?

CIO Business Intelligence

JUNE 14, 2022

The mission also sets forward a target of 50% of high-priority data quality issues to be resolved within a period defined by a cross-government framework. These systems will also be hosted – or are planned to be hosted – in appropriate environments aligned to the cross-government cloud and technology infrastructure strategy.

Strategy

Strategy IT Digital Transformation Data Processing

For IT leaders, operationalized gen AI is still a moving target

CIO Business Intelligence

FEBRUARY 28, 2024

However, getting into the more difficult types of implementations — the fine-tuned models, vector databases to provide context and up-to-date information to the AI systems, and APIs to integrate gen AI into workflows — is where problems might crop up. That’s fine, but language models are great for language. They need stability.

IT Consulting Modeling Enterprise

Why enterprise CIOs need to plan for Microsoft gen AI

CIO Business Intelligence

AUGUST 14, 2024

It’s embedded in the applications we use every day and the security model overall is pretty airtight. Microsoft has also made investments beyond OpenAI, for example in Mistral and Meta’s LLAMA models, in its own small language models like Phi, and by partnering with providers like Cohere, Hugging Face, and Nvidia. That’s risky.”

Enterprise

Enterprise Cost-Benefit Experimentation Modeling

SaaS Dashboard Examples For Modern Business Management Practices

datapine

AUGUST 7, 2019

If you’re part of a growing SaaS company and are looking to accelerate your success, leveraging the power of data is the way to gain a real competitive edge. A SaaS dashboard is a powerful business intelligence tool that offers a host of benefits for ambitious tech businesses. That’s where SaaS dashboards enter the fold.

Dashboards

Dashboards Management KPI Cost-Benefit

National Grid’s energy transformation is fueled by IT

CIO Business Intelligence

MAY 20, 2022

National Grid is a big Microsoft Azure cloud customer due to its secure, proprietary nature, says Karaboutis, and is using a bevy of leading-edge tools, from Snowflake, Azure, and Matallion ETL for data tooling, Informatica for data quality, Reltio for master data management, and Blue Prism for RPA, to name a few.

IT Internet of Things Digital Transformation Data Architecture

Top500: The Supercomputers Advancing Cyber Security, Renewable Energy, and Black Hole Research

CIO Business Intelligence

JUNE 2, 2022

HPC5’s performance level enables sophisticated in-house algorithms to process subsoil data, as well as geophysical and seismic information from around the world. For optimizing existing resources, Eni uses HPC5 to model, study, and ultimately improve refinement operations. .

Informatics

Informatics Modeling Deep Learning Testing

15 best data science bootcamps for boosting your career

CIO Business Intelligence

APRIL 25, 2022

It culminates with a capstone project that requires creating a machine learning model. Data Science Dojo. Due to the short nature of the course, it’s tailored to those already in the industry who want to learn more about data science or brush up on the latest skills. Switchup rating: 5.0 (out Cost: $1,099.

Data Science

Data Science Machine Learning Deep Learning Statistics

The future of data: A 5-pillar approach to modern data management

7 types of tech debt that could cripple your business

Webinars

Trending Sources

How EUROGATE established a data mesh architecture using Amazon DataZone

Webinars

Scaling AI Solutions with Cloudera: A Deep Dive into AI Inference and Solution Patterns

Akeneo aims to transform the retail playbook with AI and data consistency

How To Use Airbyte, dbt-teradata, Dagster, and Teradata Vantage™ for Seamless Data Integration

Cloud analytics migration: how to exceed expectations

As AI scales, infrastructure challenges emerge

Why 60% of AI Initiatives Fail: Data Matters

HEMA accelerates their data governance journey with Amazon DataZone

How Stifel built a modern data platform using AWS Glue and an event-driven domain architecture

Capture data lineage from dbt, Apache Airflow, and Apache Spark with Amazon SageMaker

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

Build a Data Mesh Architecture Using Teradata VantageCloud on AWS

Lower your Large Language Model costs with Graphwise GraphDB

Prioritizing AI investments: Balancing short-term gains with long-term vision

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

Why you should care about debugging machine learning models

The DataOps Vendor Landscape, 2021

How to Deliver Data Quality with Data Governance: Ryan Doupe, CDO of American Fidelity, 9-Step Process

Set up advanced rules to validate quality of multiple datasets with AWS Glue Data Quality

What you need to know about product management for AI

Take Your SQL Skills To The Next Level With These Popular SQL Books

Introducing erwin Data Intelligence 14: Dive into data quality, ensure data reliability and leverage new deployment flexibility

Introducing the technology behind watsonx.ai, IBM’s AI and data platform for enterprise

Accomplish Agile Business Intelligence & Analytics For Your Business

12 Cloud Computing Risks & Challenges Businesses Are Facing In These Days

Data Governance Maturity and Tracking Progress

The Art and Science of FP&A Storytelling

Empowering data-driven excellence: How the Bluestone Data Platform embraced data mesh for success

From Excel to AI: How Liberty Dental revolutionized care management

Data Governance and Strategy for the Global Enterprise

Oracle makes its pitch for the enterprise cloud. Should CIOs listen?

Governing data in relational databases using Amazon DataZone

The importance of data ingestion and integration for enterprise AI

Automating Model Risk Compliance: Model Validation

Optimizing Risk and Exposure Management – Roundtable Highlights

UK’s new digital strategy promises change – will it deliver?

For IT leaders, operationalized gen AI is still a moving target

Why enterprise CIOs need to plan for Microsoft gen AI

SaaS Dashboard Examples For Modern Business Management Practices

National Grid’s energy transformation is fueled by IT

Top500: The Supercomputers Advancing Cyber Security, Renewable Energy, and Black Hole Research

15 best data science bootcamps for boosting your career

Stay Connected