Data Processing, Data Quality and Machine Learning

Why you should care about debugging machine learning models

O'Reilly on Data

DECEMBER 12, 2019

For all the excitement about machine learning (ML), there are serious impediments to its widespread adoption. Security vulnerabilities : adversarial actors can compromise the confidentiality, integrity, or availability of an ML model or the data associated with the model, creating a host of undesirable outcomes.

Machine Learning

Machine Learning Modeling Testing Risk Management

The DataOps Vendor Landscape, 2021

DataKitchen

APRIL 13, 2021

We have also included vendors for the specific use cases of ModelOps, MLOps, DataGovOps and DataSecOps which apply DataOps principles to machine learning, AI, data governance, and data security operations. . Dagster / ElementL — A data orchestrator for machine learning, analytics, and ETL. .

Testing

Testing Machine Learning Consulting Data Science

What you need to know about product management for AI

O'Reilly on Data

MARCH 31, 2020

If you’re already a software product manager (PM), you have a head start on becoming a PM for artificial intelligence (AI) or machine learning (ML). But there’s a host of new challenges when it comes to managing AI projects: more unknowns, non-deterministic outcomes, new infrastructures, new processes and new tools.

Management

Management Machine Learning Experimentation Metrics

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

The following requirements were essential to decide for adopting a modern data mesh architecture: Domain-oriented ownership and data-as-a-product : EUROGATE aims to: Enable scalable and straightforward data sharing across organizational boundaries. Eliminate centralized bottlenecks and complex data pipelines.

IoT

IoT Machine Learning Metadata Data-driven

The future of data: A 5-pillar approach to modern data management

CIO Business Intelligence

DECEMBER 11, 2024

It was not alive because the business knowledge required to turn data into value was confined to individuals minds, Excel sheets or lost in analog signals. We are now deciphering rules from patterns in data, embedding business knowledge into ML models, and soon, AI agents will leverage this data to make decisions on behalf of companies.

Management

Management Data Governance Data Science Reporting

How to Deliver Data Quality with Data Governance: Ryan Doupe, CDO of American Fidelity, 9-Step Process

Alation

JANUARY 20, 2022

Several weeks ago (prior to the Omicron wave), I got to attend my first conference in roughly two years: Dataversity’s Data Quality and Information Quality Conference. Ryan Doupe, Chief Data Officer of American Fidelity, held a thought-provoking session that resonated with me. Step 2: Data Definitions.

Data Quality

Data Quality Data Governance Metrics Statistics

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

AWS Big Data

DECEMBER 4, 2024

Domain ownership recognizes that the teams generating the data have the deepest understanding of it and are therefore best suited to manage, govern, and share it effectively. This principle makes sure data accountability remains close to the source, fostering higher data quality and relevance.

Metadata

Metadata Data Governance Data Quality Data-driven

Take Your SQL Skills To The Next Level With These Popular SQL Books

datapine

SEPTEMBER 27, 2022

With a MySQL dashboard builder , for example, you can connect all the data with a few clicks. A host of notable brands and retailers with colossal inventories and multiple site pages use SQL to enhance their site’s structure functionality and MySQL reporting processes. Viescas, Douglas J. Steele, and Ben J.

Business Intelligence

Business Intelligence Data Warehouse Data Processing Data mining

15 best data science bootcamps for boosting your career

CIO Business Intelligence

APRIL 25, 2022

The data science path you ultimately choose will depend on your skillset and interests, but each career path will require some level of programming, data visualization, statistics, and machine learning knowledge and skills. It culminates with a capstone project that requires creating a machine learning model.

Data Science

Data Science Machine Learning Deep Learning Statistics

Get The Most Out Of Smart Business Intelligence Reporting

datapine

JANUARY 21, 2020

Reporting in business intelligence is a seamless process since historical data is also provided within an online reporting tool that can process and generate all the business information needed. Another crucial factor to consider is the possibility to utilize real-time data. Enhanced data quality. Enhanced data quality.

Business Intelligence

Business Intelligence Reporting Cost-Benefit Dashboards

Empowering data-driven excellence: How the Bluestone Data Platform embraced data mesh for success

AWS Big Data

FEBRUARY 27, 2024

Four-layered data lake and data warehouse architecture – The architecture comprises four layers, including the analytical layer, which houses purpose-built facts and dimension datasets that are hosted in Amazon Redshift. AWS services like AWS Lake Formation in conjunction with Atlan help govern data access and policies.

Data-driven

Data-driven Data Lake Data Quality Data Governance

Oracle makes its pitch for the enterprise cloud. Should CIOs listen?

CIO Business Intelligence

APRIL 11, 2024

Oracle Cloud Infrastructure is now capable of hosting a full range of traditional and modern IT workloads, and for many enterprise customers, Oracle is a proven vendor,” says David Wright, vice president of research for cloud infrastructure strategies at research firm Gartner.

Enterprise

Enterprise IT Data Processing Strategy

2020 Data Impact Award Winner Spotlight: Experian

Cloudera

NOVEMBER 26, 2020

This year’s Data Impact Awards were like none other that we’ve ever hosted. While all our winners are doing phenomenal work, one of the most exciting awards of the night was The Data for Enterprise AI category. The company employs approximately 17,000 people and aggregates information on over one billion people and businesses.

B2B

B2B Machine Learning Big Data Data-driven

Data Governance and Strategy for the Global Enterprise

Cloudera

OCTOBER 1, 2022

These companies face a unique set of data governance challenges regarding infrastructure and compliance on local, national, and international levels. Some organizations are choosing to confront these challenges with the help of tools like machine learning (ML) and artificial intelligence (AI) to automate, streamline, and scale compliance. .

Data Governance

Data Governance Strategy Enterprise Machine Learning

From Excel to AI: How Liberty Dental revolutionized care management

CIO Business Intelligence

OCTOBER 17, 2024

So, we aggregated all this data, applied some machine learning algorithms on top of it and then fed it into large language models (LLMs) and now use generative AI (genAI), which gives us an output of these care plans. But the biggest point is data governance. Data governance was the biggest piece that we took care of.

Management

Management Insurance ROI Cost-Benefit

Smart Cities Increase Efficiency, Safety and Sustainability

CIO Business Intelligence

APRIL 11, 2022

On top of a double-digit population growth rate over the past decade, the city hosts more than 40 million visitors in a typical year. The smart city solution incorporates video and sound data inputs from the area, integrated with publicly available, historical data sources, such as crime, weather and social media data.

IoT

IoT Measurement Cost-Benefit Deep Learning

The Art and Science of FP&A Storytelling

Timo Elliott

JANUARY 4, 2021

I recently participated in a web seminar on the Art and Science of FP&A Storytelling, hosted by the founder and CEO of FP&A Research Larysa Melnychuk along with other guests Pasquale della Puca , part of the global finance team at Beckman Coulter and Angelica Ancira , Global Digital Planning Lead at PepsiCo. The key takeaways.

Dashboards

Dashboards Visualization Data-driven Forecasting

Governing data in relational databases using Amazon DataZone

AWS Big Data

MAY 7, 2024

As you experience the benefits of consolidating your data governance strategy on top of Amazon DataZone, you may want to extend its coverage to new, diverse data repositories (either self-managed or as managed services) including relational databases, third-party data warehouses, analytic platforms and more.

Metadata

Metadata Data Lake Data Processing Data-driven

National Grid’s energy transformation is fueled by IT

CIO Business Intelligence

MAY 20, 2022

National Grid is a big Microsoft Azure cloud customer due to its secure, proprietary nature, says Karaboutis, and is using a bevy of leading-edge tools, from Snowflake, Azure, and Matallion ETL for data tooling, Informatica for data quality, Reltio for master data management, and Blue Prism for RPA, to name a few.

IT

IT Internet of Things Digital Transformation Data Architecture

Optimizing Risk and Exposure Management – Roundtable Highlights

Cloudera

JULY 15, 2021

We recently hosted a roundtable focused on o ptimizing risk and exposure management with data insights. To enhance the risk models, one participant identified the need to look at additional data sources, how to do data exchange and how to link data sets to best manage the models.To

Risk

Risk Optimization Management Insurance

Sony Pictures Entertainment’s Acclaimed Sequel to its 2004 ERP Implementation

CIO Business Intelligence

AUGUST 30, 2022

SPE wanted to combine their rich reservoirs of data into a single, readily accessible, insights-driven platform that would provide a single source of truth, improving efficiency while reducing cost of ownership and removing redundancies. Doubling down on risky business. The Strategy – ESOAR lets Sony roar.

Recreation/Entertainment

Recreation/Entertainment IT Finance Strategy

Sony Pictures Entertainment’s Acclaimed Sequel to its 2004 ERP Implementation

CIO Business Intelligence

SEPTEMBER 2, 2022

SPE wanted to combine their rich reservoirs of data into a single, readily accessible, insights-driven platform that would provide a single source of truth, improving efficiency while reducing cost of ownership and removing redundancies. Doubling down on risky business. The Strategy – ESOAR lets Sony roar.

Recreation/Entertainment

Recreation/Entertainment IT Finance Strategy

How data literacy allows gen AI to drive productivity at Dow

CIO Business Intelligence

JULY 31, 2024

At the core, digital at Dow is about changing how we work, which includes how we interact with systems, data, and each other to be more productive and to grow. Data is at the heart of everything we do today, from AI to machine learning or generative AI. That’s what we’re running our AI and our machine learning against.

Manufacturing

Manufacturing Cost-Benefit Digital Transformation Forecasting

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

AWS Big Data

JULY 20, 2023

One key component that plays a central role in modern data architectures is the data lake, which allows organizations to store and analyze large amounts of data in a cost-effective manner and run advanced analytics and machine learning (ML) at scale.

Data Lake

Data Lake Analytics Snapshot Data Quality

Modernize your ETL platform with AWS Glue Studio: A case study from BMS

AWS Big Data

DECEMBER 13, 2023

For the past 5 years, BMS has used a custom framework called Enterprise Data Lake Services (EDLS) to create ETL jobs for business users. BMS’s EDLS platform hosts over 5,000 jobs and is growing at 15% YoY (year over year). Shovan Kanjilal is a Senior Data Lake Architect working with strategic accounts in AWS Professional Services.

Metadata

Metadata Data Lake Visualization Data Quality

9 ways IT leaders can impact sustainability initiatives

CIO Business Intelligence

JANUARY 4, 2023

Cloud migrations have been on the rise in recent years for a host of business reasons, but CIOs serious about sustainability are pulling out all the stops. On-prem data centers have an outsized impact on carbon emissions and waste. So, too, are business leaders.

IT

IT Internet of Things Reporting Cost-Benefit

The Ever-growing Importance of MLOps: The Transformative Effect of DataRobot

DataRobot Blog

FEBRUARY 11, 2022

This second part will dive deeper into DataRobot’s Machine Learning Operations capability, and its transformative effect on the machine learning lifecycle. DataRobot’s MLOps product offers a host of features designed to transform organizations’ user experience, firstly, through its model-monitoring agents.

Machine Learning

Machine Learning Cost-Benefit ROI Data Processing

For IT leaders, operationalized gen AI is still a moving target

CIO Business Intelligence

FEBRUARY 28, 2024

A retail company, for example, might have a 360-degree view of customers, which is all fed into analytics engines, machine learning, and other traditional AI to calculate the next best action. So by using the company’s data, a general-purpose language model becomes a useful business tool. “If They’re not great for knowledge.”

IT

IT Consulting Modeling Enterprise

Modern Data Architecture for Telecommunications

Cloudera

SEPTEMBER 6, 2022

It is an edge-to-AI suite of capabilities, including edge analytics, data staging, data quality control, data visualization tools, and machine learning. It comprises data applications and transformation functions as well as maintaining relations between public cloud and on-premise assets.

Data Architecture

Data Architecture Cost-Benefit Digital Transformation Business Driver

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

A Gartner Marketing survey found only 14% of organizations have successfully implemented a C360 solution, due to lack of consensus on what a 360-degree view means, challenges with data quality, and lack of cross-functional governance structure for customer data.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

How the Sports and Entertainment Industry Is Reinventing the Fan Experience and Enhancing Revenues with Computer Vision

CIO Business Intelligence

MAY 16, 2022

The data gathered from cameras and sensors as part of a computer vision system, along with machine learning, make it easier to find missing persons and to identify people who are not allowed to be in a venue. Ready to evolve your analytics strategy or improve your data quality? Just starting out with analytics?

Recreation/Entertainment

Recreation/Entertainment Broadcasting Cost-Benefit Advertising

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

AWS Big Data

AUGUST 1, 2024

Migrating to Amazon Redshift offers organizations the potential for improved price-performance, enhanced data processing, faster query response times, and better integration with technologies such as machine learning (ML) and artificial intelligence (AI).

Data Warehouse

Data Warehouse KPI Optimization Cost-Benefit

Cloudera Data Engineering – Integration steps to leverage spark on Kubernetes

Cloudera

APRIL 14, 2021

Precisely Data Integration, Change Data Capture and Data Quality tools support CDP Public Cloud as well as CDP Private Cloud. Data pipelines that are bursty in nature can leverage the public cloud CDE service while longer running persistent loads can run on-prem. .

Data Warehouse

Data Warehouse Data Processing Machine Learning Data Quality

How to build a successful AI strategy

IBM Big Data Hub

DECEMBER 20, 2023

This calls for the organization to also make important decisions regarding data, talent and technology: A well-crafted strategy will provide a clear plan for managing, analyzing and leveraging data for AI initiatives. Algorithms: Algorithms are the rules or instructions that enable machines to learn, analyze data and make decisions.

Strategy

Strategy Business Objectives Cost-Benefit Consulting

How Financial Services and Insurance Streamline AI Initiatives with a Hybrid Data Platform

Cloudera

SEPTEMBER 7, 2023

The way to manage this is by embedding data integration, data quality-monitoring, and other capabilities into the data platform itself , allowing financial firms to streamline these processes, and freeing them to focus on operationalizing AI solutions while promoting access to data, maintaining data quality, and ensuring compliance.

Insurance

Insurance Risk Data-driven Finance

KGF 2023: Bikes To The Moon, Datastrophies, Abstract Art And A Knowledge Graph Forum To Embrace Them All

Ontotext

DECEMBER 1, 2023

He outlined the challenges of working effectively with AI and machine learning, where knowledge graphs are a differentiator. According to him, “failing to ensure data quality in capturing and structuring knowledge, turns any knowledge graph into a piece of abstract art”.

Metadata

Metadata Sales Machine Learning Consulting

What’s New and What’s Next in 2023 for HPC

CIO Business Intelligence

JANUARY 4, 2023

Recently members of our community came together for a roundtable discussion, hosted by Dell Technologies, about trends, trials, and all the excitement around what’s next. Advances in Artificial Intelligence and Machine Learning (AI/ML): AI/ML will continue growing as an important workload in HPC.

Cost-Benefit

Cost-Benefit Machine Learning Uncertainty Modeling

Common Data Governance Challenges & Their Solutions

Alation

JULY 6, 2021

Modern data governance relies on automation, which reduces costs. Automated tools make data governance processes very cost-effective. Machine learning plays a key role, as it can increase the speed and accuracy of metadata capture and categorization. This empowers leaders to see and refine human processes around data.

Data Governance

Data Governance Metadata Data Quality Risk

The Third Pillar of Data Culture: Data Governance

Alation

NOVEMBER 23, 2021

However it’s defined, data governance is among the hottest topics in data management. Organizations are governing data already, simply informally. Organizations don’t need to spend a lot of money to get data governance. What organizations actually govern is data-consumer behavior, and not the data itself.

Data Governance

Data Governance Data-driven Data Processing Consulting

Showcasing the Power of AI in Investment Management: a Real Estate Case Study

DataRobot Blog

DECEMBER 20, 2022

This usually involved gathering market and property information, socio-economic data about a city on a zip code level and information regarding access to amenities (e.g., DataRobot combines these datasets and data types into one training dataset used to build machine learning models. Rapid Modeling with DataRobot AutoML.

Management

Management Machine Learning Optimization Modeling

Power enterprise-grade Data Vaults with Amazon Redshift – Part 2

AWS Big Data

NOVEMBER 16, 2023

Amazon Redshift is a popular cloud data warehouse, offering a fully managed cloud-based service that seamlessly integrates with an organization’s Amazon Simple Storage Service (Amazon S3) data lake, real-time streams, machine learning (ML) workflows, transactional workflows, and much more—all while providing up to 7.9x

Enterprise

Enterprise Data Warehouse Snapshot Cost-Benefit

Automating Model Risk Compliance: Model Validation

DataRobot Blog

MAY 26, 2022

Validating Modern Machine Learning (ML) Methods Prior to Productionization. Validating Machine Learning Models. Last time , we discussed the steps that a modeler must pay attention to when building out ML models to be utilized within the financial institution.

Risk

Risk Modeling Metrics Business Objectives

Funding and Our Future

Alation

FEBRUARY 13, 2020

I had the pleasure of chatting with John Furrier of theCUBE about how our recent round of funding will fuel innovation within the Alation Data Catalog. I’m John Furrier, co-host of theCUBE. Now you have data challenges around, okay, where’s the data deployed? Where am I using the data?

ROI

ROI Data-driven Finance Data Quality

10 Best Big Data Analytics Tools You Need To Know in 2023

FineReport

APRIL 26, 2023

Recently, Spark set a new record by processing 100 terabytes of data in just 23 minutes, surpassing Hadoop’s previous world record of 71 minutes. This is why big tech companies are switching to Spark as it is highly suitable for machine learning and artificial intelligence.

Big Data

Big Data Data Analytics Analytics Cost-Benefit

Why you should care about debugging machine learning models

The DataOps Vendor Landscape, 2021

Webinars

Trending Sources

What you need to know about product management for AI

Webinars

How EUROGATE established a data mesh architecture using Amazon DataZone

The future of data: A 5-pillar approach to modern data management

How to Deliver Data Quality with Data Governance: Ryan Doupe, CDO of American Fidelity, 9-Step Process

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

Take Your SQL Skills To The Next Level With These Popular SQL Books

15 best data science bootcamps for boosting your career

Get The Most Out Of Smart Business Intelligence Reporting

Empowering data-driven excellence: How the Bluestone Data Platform embraced data mesh for success

Oracle makes its pitch for the enterprise cloud. Should CIOs listen?

2020 Data Impact Award Winner Spotlight: Experian

Data Governance and Strategy for the Global Enterprise

From Excel to AI: How Liberty Dental revolutionized care management

Smart Cities Increase Efficiency, Safety and Sustainability

The Art and Science of FP&A Storytelling

Governing data in relational databases using Amazon DataZone

National Grid’s energy transformation is fueled by IT

Optimizing Risk and Exposure Management – Roundtable Highlights

Sony Pictures Entertainment’s Acclaimed Sequel to its 2004 ERP Implementation

Sony Pictures Entertainment’s Acclaimed Sequel to its 2004 ERP Implementation

How data literacy allows gen AI to drive productivity at Dow

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

Modernize your ETL platform with AWS Glue Studio: A case study from BMS

9 ways IT leaders can impact sustainability initiatives

The Ever-growing Importance of MLOps: The Transformative Effect of DataRobot

For IT leaders, operationalized gen AI is still a moving target

Modern Data Architecture for Telecommunications

Create an end-to-end data strategy for Customer 360 on AWS

How the Sports and Entertainment Industry Is Reinventing the Fan Experience and Enhancing Revenues with Computer Vision

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

Cloudera Data Engineering – Integration steps to leverage spark on Kubernetes

How to build a successful AI strategy

How Financial Services and Insurance Streamline AI Initiatives with a Hybrid Data Platform

KGF 2023: Bikes To The Moon, Datastrophies, Abstract Art And A Knowledge Graph Forum To Embrace Them All

What’s New and What’s Next in 2023 for HPC

Common Data Governance Challenges & Their Solutions

The Third Pillar of Data Culture: Data Governance

Showcasing the Power of AI in Investment Management: a Real Estate Case Study

Power enterprise-grade Data Vaults with Amazon Redshift – Part 2

Automating Model Risk Compliance: Model Validation

Funding and Our Future

10 Best Big Data Analytics Tools You Need To Know in 2023

Stay Connected