Experimentation, Measurement and Modeling

Practical Skills for The AI Product Manager

O'Reilly on Data

MAY 14, 2020

AI PMs should enter feature development and experimentation phases only after deciding what problem they want to solve as precisely as possible, and placing the problem into one of these categories. Experimentation: It’s just not possible to create a product by building, evaluating, and deploying a single model.

Management

Management Experimentation B2B Machine Learning

Escaping POC Purgatory: Evaluation-Driven Development for AI Systems

O'Reilly on Data

MARCH 25, 2025

Throughout this article, well explore real-world examples of LLM application development and then consolidate what weve learned into a set of first principlescovering areas like nondeterminism, evaluation approaches, and iteration cyclesthat can guide your work regardless of which models or frameworks you choose. Which multiagent frameworks?

Testing

Testing Data-driven Software Measurement

What you need to know about product management for AI

O'Reilly on Data

MARCH 31, 2020

Instead of writing code with hard-coded algorithms and rules that always behave in a predictable manner, ML engineers collect a large number of examples of input and output pairs and use them as training data for their models. The model is produced by code, but it isn’t code; it’s an artifact of the code and the training data.

Management

Management Machine Learning Experimentation Metrics

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

MLOps and DevOps: Why Data Makes It Different

O'Reilly on Data

OCTOBER 19, 2021

Let’s start by considering the job of a non-ML software engineer: writing traditional software deals with well-defined, narrowly-scoped inputs, which the engineer can exhaustively and cleanly model in the code. Not only is data larger, but models—deep learning models in particular—are much larger than before.

IT

IT Testing Experimentation Software

Experimentation and Testing: A Primer

Occam's Razor

MAY 22, 2006

This post is a primer on the delightful world of testing and experimentation (A/B, Multivariate, and a new term from me: Experience Testing). Experimentation and testing help us figure out we are wrong, quickly and repeatedly and if you think about it that is a great thing for our customers, and for our employers. Counter claims?

Experimentation

Experimentation Testing Optimization Measurement

From project to product: Architecting the future of enterprise technology

CIO Business Intelligence

JANUARY 14, 2025

By articulating fitness functions automated tests tied to specific quality attributes like reliability, security or performance teams can visualize and measure system qualities that align with business goals. Experimentation: The innovation zone Progressive cities designate innovation districts where new ideas can be tested safely.

Enterprise

Enterprise Technology Metrics Measurement

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

Rocket-Powered Data Science

FEBRUARY 15, 2023

While generative AI has been around for several years , the arrival of ChatGPT (a conversational AI tool for all business occasions, built and trained from large language models) has been like a brilliant torch brought into a dark room, illuminating many previously unseen opportunities.

Strategy

Strategy Experimentation Uncertainty Machine Learning

Multi-Channel Attribution Modeling: The Good, Bad and Ugly Models

Occam's Razor

AUGUST 12, 2013

than multi-channel attribution modeling. By the time you are done with this post you'll have complete knowledge of what's ugly and bad when it comes to attribution modeling. You'll know how to use the good model, even if it is far from perfect. Multi-Channel Attribution Models. Linear Attribution Model.

Modeling

Modeling Optimization Marketing Interactive

How to Set AI Goals

O'Reilly on Data

SEPTEMBER 15, 2020

In my book, I introduce the Technical Maturity Model: I define technical maturity as a combination of three factors at a given point of time. Technical sophistication: Sophistication measures a team’s ability to use advanced tools and techniques (e.g., PyTorch, TensorFlow, reinforcement learning, self-supervised learning).

Advertising

Advertising Cost-Benefit ROI Machine Learning

Where CIOs should place their 2025 AI bets

CIO Business Intelligence

JANUARY 21, 2025

Deloittes State of Generative AI in the Enterprise reports nearly 70% have moved 30% or fewer of their gen AI experiments into production, and 41% of organizations have struggled to define and measure the impacts of their gen AI efforts. Even this breakdown leaves out data management, engineering, and security functions.

Cost-Benefit

Cost-Benefit Data-driven Strategy Marketing

Measuring Incrementality: Controlled Experiments to the Rescue!

Occam's Razor

SEPTEMBER 19, 2011

How do you get over the frustration of having done attribution modeling and realizing that it is not even remotely the solution to your challenge of using multiple media channels? We'll measure Revenue, Profit (the money we make less cost of goods sold), Expense (cost of campaign), Net (bottom-line impact). ask for a raise.

Measurement

Measurement Advertising Testing Marketing

Do You Need a DataOps Dojo?

DataKitchen

JANUARY 20, 2021

Centralizing analytics helps the organization standardize enterprise-wide measurements and metrics. Central DataOps process measurement function with reports. The center of excellence (COE) model leverages the DataOps team to solve real-world challenges. DataOps Center of Excellence.

Metrics

Metrics Experimentation Measurement Testing

3 musts when recruiting vendors for AI

CIO Business Intelligence

MARCH 5, 2025

Two years of experimentation may have given rise to several valuable use cases for gen AI , but during the same period, IT leaders have also learned that the new, fast-evolving technology isnt something to jump into blindly. The next thing is to make sure they have an objective way of testing the outcome and measuring success.

Testing

Testing Measurement Technology Experimentation

AI Product Management After Deployment

O'Reilly on Data

OCTOBER 13, 2020

Similarly, in “ Building Machine Learning Powered Applications: Going from Idea to Product ,” Emmanuel Ameisen states: “Indeed, exposing a model to users in production comes with a set of challenges that mirrors the ones that come with debugging a model.”.

Management

Management Machine Learning Metrics Modeling

The DataOps Vendor Landscape, 2021

DataKitchen

APRIL 13, 2021

DataOps needs a directed graph-based workflow that contains all the data access, integration, model and visualization steps in the data analytic production process. DataOps requires that teams measure their analytic processes in order to see how they are improving over time. Datatron — Automates deployment and monitoring of AI models.

Testing

Testing Machine Learning Consulting Data Science

Dear Avinash: Attribution Modeling, Org Culture, Deeper Analysis

Occam's Razor

AUGUST 13, 2012

Yehoshua Coren: Best ways to measure user behavior in a multi-touch, multi-device digital world. Yehoshua I've covered this topic in detail in this blog post: Multi-Channel Attribution: Definitions, Models and a Reality Check. What's possible to measure. What's not possible to measure. Let's do this!

Modeling

Modeling Metrics Data Quality Data-driven

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

EUROGATEs data science team aims to create machine learning models that integrate key data sources from various AWS accounts, allowing for training and deployment across different container terminals. Insights from ML models can be channeled through Amazon DataZone to inform internal key decision makers internally and external partners.

IoT

IoT Machine Learning Metadata Data-driven

What Are ChatGPT and Its Friends?

O'Reilly on Data

MARCH 23, 2023

It’s important to understand that ChatGPT is not actually a language model. It’s a convenient user interface built around one specific language model, GPT-3.5, is one of a class of language models that are sometimes called “large language models” (LLMs)—though that term isn’t very helpful. with specialized training.

IT

IT Modeling Testing Risk

Interview with: Sankar Narayanan, Chief Practice Officer at Fractal Analytics

Corinium

JUNE 6, 2019

Beyond that, we recommend setting up the appropriate data management and engineering framework including infrastructure, harmonization, governance, toolset strategy, automation, and operating model. It is also important to have a strong test and learn culture to encourage rapid experimentation.

Insurance

Insurance Analytics Forecasting Deep Learning

What high-performance IT teams look like today — and how to build one

CIO Business Intelligence

AUGUST 20, 2024

Our mental models of what constitutes a high-performance team have evolved considerably over the past five years. Post-pandemic, high-performance teams excelled at remote and hybrid working models, were more empathetic to individual needs, and leveraged automation to reduce manual work.

IT

IT Digital Transformation Experimentation Risk

Towards optimal experimentation in online systems

The Unofficial Google Data Science Blog

APRIL 23, 2024

the weight given to Likes in our video recommendation algorithm) while $Y$ is a vector of outcome measures such as different metrics of user experience (e.g., Experiments, Parameters and Models At Youtube, the relationships between system parameters and metrics often seem simple — straight-line models sometimes fit our data well.

Experimentation

Experimentation Optimization Uncertainty Metrics

The Lean Analytics Cycle: Metrics > Hypothesis > Experiment > Act

Occam's Razor

APRIL 8, 2013

Let's listen in as Alistair discusses the lean analytics model… The Lean Analytics Cycle is a simple, four-step process that shows you how to improve a part of your business. Another way to find the metric you want to change is to look at your business model. The business model also tells you what the metric should be.

Metrics

Metrics KPI Analytics Key Performance Indicator

Achieving cloud excellence and efficiency with cloud maturity models

IBM Big Data Hub

MAY 17, 2024

Cloud maturity models are a useful tool for addressing these concerns, grounding organizational cloud strategy and proceeding confidently in cloud adoption with a plan. Cloud maturity models (or CMMs) are frameworks for evaluating an organization’s cloud adoption readiness on both a macro and individual service level.

Modeling

Modeling Cost-Benefit Optimization Digital Transformation

Experiment design and modeling for long-term studies in ads

The Unofficial Google Data Science Blog

OCTOBER 7, 2015

by HENNING HOHNHOLD, DEIRDRE O'BRIEN, and DIANE TANG In this post we discuss the challenges in measuring and modeling the long-term effect of ads on user behavior. We describe experiment designs which have proven effective for us and discuss the subtleties of trying to generalize the results via modeling.

Modeling

Modeling Experimentation Knowledge Discovery Testing

Glossary of Digital Terminology for Career Relevance

Rocket-Powered Data Science

JULY 7, 2019

Autonomous Vehicles: Self-driving (guided without a human), informed by data streaming from many sensors (cameras, radar, LIDAR), and makes decisions and actions based on computer vision algorithms (ML and AI models for people, things, traffic signs,…). Examples: Cars, Trucks, Taxis. See [link]. Industry 4.0 2) Connected cars. (3)

Internet of Things

Internet of Things Machine Learning Manufacturing IoT

Expectations vs. reality: A real-world check on generative AI

CIO Business Intelligence

MAY 1, 2024

Gen AI takes us from single-use models of machine learning (ML) to AI tools that promise to be a platform with uses in many areas, but you still need to validate they’re appropriate for the problems you want solved, and that your users know how to use gen AI effectively. Pilots can offer value beyond just experimentation, of course.

Cost-Benefit

Cost-Benefit Metrics Insurance Measurement

US Air Force seeks generative AI test pilots

CIO Business Intelligence

JUNE 13, 2024

Proof that even the most rigid of organizations are willing to explore generative AI arrived this week when the US Department of the Air Force (DAF) launched an experimental initiative aimed at Guardians, Airmen, civilian employees, and contractors. It is not training the model, nor are responses refined based on any user inputs.

Testing

Testing Experimentation Data Processing Modeling

3 steps to eliminate shadow AI

CIO Business Intelligence

SEPTEMBER 12, 2024

Unmonitored AI tools can lead to decisions or actions that undermine regulatory and corporate compliance measures, particularly in sectors where data handling and processing are tightly regulated, such as finance and healthcare. Generative AI models can perpetuate and amplify biases in training data when constructing output.

Experimentation

Experimentation Risk Cost-Benefit Strategy

Model Interpretability: The Conversation Continues

Domino Data Lab

NOVEMBER 14, 2019

Model interpretability continues to spark public discourse among industry. We have covered model interpretability previously, including a proposed definition of machine learning (ML) interpretability. Yet there are tradeoffs to consider when selecting a model. Errors like these may occur when the model is being constructed.

Modeling

Modeling Machine Learning Data Science Experimentation

The top 15 big data and data analytics certifications

CIO Business Intelligence

JUNE 14, 2023

Certifications measure your knowledge and skills against industry- and vendor-specific benchmarks to prove to employers that you have the right skillset. They can also transform the data, create data models, visualize data, and share assets by using Power BI. The number of data analytics certs is expanding rapidly.

Big Data

Big Data Data Analytics Analytics Predictive Modeling

Of Muffins and Machine Learning Models

Cloudera

FEBRUARY 16, 2022

In this example, the Machine Learning (ML) model struggles to differentiate between a chihuahua and a muffin. Will the model correctly determine it is a muffin or get confused and think it is a chihuahua? The extent to which we can predict how the model will classify an image given a change input (e.g. Model Visibility.

Machine Learning

Machine Learning Modeling Metadata Recreation/Entertainment

Is the gen AI bubble due to burst? CIOs face rethink ahead

CIO Business Intelligence

AUGUST 15, 2024

When we do planning sessions with our clients, two thirds of the solutions they need don’t necessarily fit the generative AI model. However, foundational models will always have a place as the core backbone for the industry.” “There are a lot of cool AI solutions that are cheaper than generative AI,” Stephenson says.

ROI

ROI Cost-Benefit Experimentation Deep Learning

Email Marketing: Campaign Analysis, Metrics, Best Practices

Occam's Razor

JULY 18, 2011

The only requirement is that your mental model (and indeed, company culture) should be solidly rooted in permission marketing. You just have to have the right mental model (see Seth Godin above) and you have to… wait for it… wait for it… measure everything you do! Just to ensure you are executing against your right mental model.

Metrics

Metrics Marketing Measurement Cost-Benefit

Learning from the AI leaders

CIO Business Intelligence

DECEMBER 11, 2024

Research from IDC predicts that we will move from the experimentation phase, the GenAI scramble that we saw in 2023 and 2024, and mature into the adoption phase in 2025/26 before moving into AI-fuelled businesses in 2027 and beyond. So what are the leaders doing differently?

ROI

ROI Experimentation Insurance Strategy

Changing assignment weights with time-based confounders

The Unofficial Google Data Science Blog

JULY 22, 2020

Instead, we focus on the case where an experimenter has decided to run a full traffic ramp-up experiment and wants to use the data from all of the epochs in the analysis. When there are changing assignment weights and time-based confounders, this complication must be considered either in the analysis or the experimental design.

Experimentation

Experimentation Statistics Testing Knowledge Discovery

Embracing Generative AI in health: focus on adoption, execution, outcomes and the human side

CIO Business Intelligence

FEBRUARY 1, 2024

Prioritising and measuring is key Generative AI represents a welcome shot in the arm for a sector in desperate need of efficiency and productivity gains. In the short term, healthcare CIOs need to focus on prioritising their use cases and ensuring they have a robust measuring framework in place to assess the results of trial deployment.

Experimentation

Experimentation Forecasting Measurement Cost-Benefit

IDC Directions: The META region’s path to AI-Driven digital transformation

CIO Business Intelligence

JANUARY 28, 2025

This transition represents more than just a shift from traditional systemsit marks a significant pivot from experimentation and proof-of-concept to scaled adoption and measurable value.

Digital Transformation

Digital Transformation Data-driven Experimentation Strategy

AI incident reporting shortcomings leave regulatory safety hole

CIO Business Intelligence

JULY 1, 2024

Notable examples of AI safety incidents include: Trading algorithms causing market “flash crashes” ; Facial recognition systems leading to wrongful arrests ; Autonomous vehicle accidents ; AI models providing harmful or misleading information through social media channels.

Reporting

Reporting Risk Management Experimentation Risk

Drug Discovery Needs AI To Discover More Treatments

Smart Data Collective

FEBRUARY 4, 2020

Current R&D Models Provide Diminishing Returns. The greatest advantage of AI is that it can digest vast amounts of medical knowledge — from thousands of published reports and scientific papers, say — and devise novel predictions and formulations that would take human researchers years of inefficient experimentation to find.

Experimentation

Experimentation Testing Modeling Strategy

Bayer Crop Science blends gen AI and data science for innovative edge

CIO Business Intelligence

AUGUST 23, 2024

Data scientists at Bayer have developed several proofs of concept of generative AI models on the new platform that remain in discovery and evaluation phase for “efficacy,” McQueen says, adding that the models won’t be in production until 2025. The R&D pipeline is pretty highly confidential at this point,” he says. It’s additive.”

Data Science

Data Science Experimentation Testing Modeling

Why enterprise CIOs need to plan for Microsoft gen AI

CIO Business Intelligence

AUGUST 14, 2024

It’s embedded in the applications we use every day and the security model overall is pretty airtight. Microsoft has also made investments beyond OpenAI, for example in Mistral and Meta’s LLAMA models, in its own small language models like Phi, and by partnering with providers like Cohere, Hugging Face, and Nvidia. That’s risky.”

Enterprise

Enterprise Cost-Benefit Experimentation Modeling

The great divide: Separating operations and innovation

CIO Business Intelligence

JULY 8, 2022

After transforming their organization’s operating model, realigning teams to products rather than to projects , CIOs we consult arrive at an inevitable question: “What next?” Splitting these responsibilities without a clear vision and careful plan, however, can spell disaster, reversing the progress begotten by a new operating model.

Cost-Benefit

Cost-Benefit Measurement Modeling Metrics

10 digital transformation roadblocks — and 5 tips for overcoming them

CIO Business Intelligence

NOVEMBER 13, 2023

Foster a culture of innovation: Digital transformation requires innovation and experimentation, and thus a culture for embracing new technologies and ideas. Leaders must clearly define what they want to achieve through digital transformation and how they plan to do it. Digital Transformation, IT Leadership, IT Strategy

Digital Transformation

Digital Transformation Cost-Benefit Experimentation Measurement

4 tips to improve employee experiences while maintaining security and governance

CIO Business Intelligence

JULY 7, 2023

Tip 1: Embrace the need for balance Hybrid work models have shifted the goalposts for just about all organizational objectives, especially in terms of providing employee experiences that are both productive and secure. They are expected to make smarter and faster decisions using data, analytics, and machine learning models.

Measurement

Measurement Risk Experimentation Enterprise

Practical Skills for The AI Product Manager

Escaping POC Purgatory: Evaluation-Driven Development for AI Systems

Webinars

Trending Sources

What you need to know about product management for AI

Webinars

MLOps and DevOps: Why Data Makes It Different

Experimentation and Testing: A Primer

From project to product: Architecting the future of enterprise technology

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

Multi-Channel Attribution Modeling: The Good, Bad and Ugly Models

How to Set AI Goals

Where CIOs should place their 2025 AI bets

Measuring Incrementality: Controlled Experiments to the Rescue!

Do You Need a DataOps Dojo?

3 musts when recruiting vendors for AI

AI Product Management After Deployment

The DataOps Vendor Landscape, 2021

Dear Avinash: Attribution Modeling, Org Culture, Deeper Analysis

How EUROGATE established a data mesh architecture using Amazon DataZone

What Are ChatGPT and Its Friends?

Interview with: Sankar Narayanan, Chief Practice Officer at Fractal Analytics

What high-performance IT teams look like today — and how to build one

Towards optimal experimentation in online systems

The Lean Analytics Cycle: Metrics > Hypothesis > Experiment > Act

Achieving cloud excellence and efficiency with cloud maturity models

Experiment design and modeling for long-term studies in ads

Glossary of Digital Terminology for Career Relevance

Expectations vs. reality: A real-world check on generative AI

US Air Force seeks generative AI test pilots

3 steps to eliminate shadow AI

Model Interpretability: The Conversation Continues

The top 15 big data and data analytics certifications

Of Muffins and Machine Learning Models

Is the gen AI bubble due to burst? CIOs face rethink ahead

Email Marketing: Campaign Analysis, Metrics, Best Practices

Learning from the AI leaders

Changing assignment weights with time-based confounders

Embracing Generative AI in health: focus on adoption, execution, outcomes and the human side

IDC Directions: The META region’s path to AI-Driven digital transformation

AI incident reporting shortcomings leave regulatory safety hole

Drug Discovery Needs AI To Discover More Treatments

Bayer Crop Science blends gen AI and data science for innovative edge

Why enterprise CIOs need to plan for Microsoft gen AI

The great divide: Separating operations and innovation

10 digital transformation roadblocks — and 5 tips for overcoming them

4 tips to improve employee experiences while maintaining security and governance

Stay Connected