Experimentation, Metrics and Modeling

Practical Skills for The AI Product Manager

O'Reilly on Data

MAY 14, 2020

AI PMs should enter feature development and experimentation phases only after deciding what problem they want to solve as precisely as possible, and placing the problem into one of these categories. Experimentation: It’s just not possible to create a product by building, evaluating, and deploying a single model.

Management

Management Experimentation B2B Machine Learning

Escaping POC Purgatory: Evaluation-Driven Development for AI Systems

O'Reilly on Data

MARCH 25, 2025

Throughout this article, well explore real-world examples of LLM application development and then consolidate what weve learned into a set of first principlescovering areas like nondeterminism, evaluation approaches, and iteration cyclesthat can guide your work regardless of which models or frameworks you choose. Which multiagent frameworks?

Testing

Testing Data-driven Software Measurement

What you need to know about product management for AI

O'Reilly on Data

MARCH 31, 2020

Instead of writing code with hard-coded algorithms and rules that always behave in a predictable manner, ML engineers collect a large number of examples of input and output pairs and use them as training data for their models. The model is produced by code, but it isn’t code; it’s an artifact of the code and the training data.

Management

Management Machine Learning Experimentation Metrics

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

The Lean Analytics Cycle: Metrics > Hypothesis > Experiment > Act

Occam's Razor

APRIL 8, 2013

To win in business you need to follow this process: Metrics > Hypothesis > Experiment > Act. We are far too enamored with data collection and reporting the standard metrics we love because others love them because someone else said they were nice so many years ago. This should not be news to you. But it is not routine.

Metrics

Metrics KPI Analytics Key Performance Indicator

From project to product: Architecting the future of enterprise technology

CIO Business Intelligence

JANUARY 14, 2025

Understanding and tracking the right software delivery metrics is essential to inform strategic decisions that drive continuous improvement. They achieve this through models, patterns, and peer review taking complex challenges and breaking them down into understandable components that stakeholders can grasp and discuss.

Enterprise

Enterprise Technology Metrics Measurement

Email Marketing: Campaign Analysis, Metrics, Best Practices

Occam's Razor

JULY 18, 2011

The only requirement is that your mental model (and indeed, company culture) should be solidly rooted in permission marketing. You just have to have the right mental model (see Seth Godin above) and you have to… wait for it… wait for it… measure everything you do! Just to ensure you are executing against your right mental model.

Metrics

Metrics Marketing Measurement Cost-Benefit

AI Product Management After Deployment

O'Reilly on Data

OCTOBER 13, 2020

Similarly, in “ Building Machine Learning Powered Applications: Going from Idea to Product ,” Emmanuel Ameisen states: “Indeed, exposing a model to users in production comes with a set of challenges that mirrors the ones that come with debugging a model.”.

Management

Management Machine Learning Metrics Modeling

Experimentation and Testing: A Primer

Occam's Razor

MAY 22, 2006

This post is a primer on the delightful world of testing and experimentation (A/B, Multivariate, and a new term from me: Experience Testing). Experimentation and testing help us figure out we are wrong, quickly and repeatedly and if you think about it that is a great thing for our customers, and for our employers. Counter claims?

Experimentation

Experimentation Testing Optimization Measurement

Do You Need a DataOps Dojo?

DataKitchen

JANUARY 20, 2021

Centralizing analytics helps the organization standardize enterprise-wide measurements and metrics. With a standard metric supported by a centralized technical team, the organization maintains consistency in analytics. The center of excellence (COE) model leverages the DataOps team to solve real-world challenges.

Metrics

Metrics Experimentation Measurement Testing

Multi-Channel Attribution Modeling: The Good, Bad and Ugly Models

Occam's Razor

AUGUST 12, 2013

than multi-channel attribution modeling. By the time you are done with this post you'll have complete knowledge of what's ugly and bad when it comes to attribution modeling. You'll know how to use the good model, even if it is far from perfect. Multi-Channel Attribution Models. Linear Attribution Model.

Modeling

Modeling Optimization Marketing Interactive

What’s driving the global common data capability at RGA

CIO Business Intelligence

MARCH 19, 2025

Mark Brooks, who became CIO of Reinsurance Group of America in 2023, did just that, and restructured the technology organization to support the platform, redefined the programs success metrics, and proved to the board that IT is a good steward of the dollar. One significant change we made was in our use of metrics to challenge my team.

Metrics

Metrics Enterprise Cost-Benefit Experimentation

Where CIOs should place their 2025 AI bets

CIO Business Intelligence

JANUARY 21, 2025

With traditional OCR and AI models, you might get 60% straight-through processing, 70% if youre lucky, but now generative AI solves all of the edge cases, and your processing rates go up to 99%, Beckley says. In 2024, departments and teams experimented with gen AI tools tied to their workflows and operating metrics.

Cost-Benefit

Cost-Benefit Data-driven Strategy Marketing

The DataOps Vendor Landscape, 2021

DataKitchen

APRIL 13, 2021

DataOps needs a directed graph-based workflow that contains all the data access, integration, model and visualization steps in the data analytic production process. A complete DataOps program will have a unified, system-wide view of process metrics using a common data store. Datatron — Automates deployment and monitoring of AI models.

Testing

Testing Machine Learning Consulting Data Quality

Introducing Amazon MWAA micro environments for Apache Airflow

AWS Big Data

NOVEMBER 19, 2024

Customers maintain multiple MWAA environments to separate development stages, optimize resources, manage versions, enhance security, ensure redundancy, customize settings, improve scalability, and facilitate experimentation. micro, remember to monitor its performance using the recommended metrics to maintain optimal operation.

Metadata

Metadata Cost-Benefit Metrics Optimization

Top 10 Data Innovation Trends During 2020

Rocket-Powered Data Science

JULY 6, 2021

MLOps takes the modeling, algorithms, and data wrangling out of the experimental “one off” phase and moves the best models into deployment and sustained operational phase. the monitoring of very important operational ML characteristics: data drift, concept drift, and model security).

Machine Learning

Machine Learning Data-driven Deep Learning IoT

Dear Avinash: Attribution Modeling, Org Culture, Deeper Analysis

Occam's Razor

AUGUST 13, 2012

Yehoshua I've covered this topic in detail in this blog post: Multi-Channel Attribution: Definitions, Models and a Reality Check. I explain three different models (Online to Store, Across Multiple Devices, Across Digital Channels) and for each I've highlighted: 1. That means: All of these metrics are off.

Modeling

Modeling Metrics Data Quality Data-driven

10 Technical Blogs for Data Scientists to Advance AI/ML Skills

DataRobot Blog

DECEMBER 6, 2022

Other organizations are just discovering how to apply AI to accelerate experimentation time frames and find the best models to produce results. Taking a Multi-Tiered Approach to Model Risk Management. Learn how to leverage Google BigQuery large datasets for large scale Time Series forecasting models in the DataRobot AI platform.

Experimentation

Experimentation Forecasting Data-driven Machine Learning

12 Marketing Reports Examples You Can Use For Annual, Monthly, Weekly And Daily Reporting Practice

datapine

FEBRUARY 4, 2020

Structure your metrics. As with any report you might need to create, structuring and implementing metrics that will tell an interesting and educational data-story is crucial in our digital age. That way you can choose the best possible metrics for your case. Regularly monitor your data. 1) Marketing CMO report.

Reporting

Reporting Marketing Advertising Metrics

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

EUROGATEs data science team aims to create machine learning models that integrate key data sources from various AWS accounts, allowing for training and deployment across different container terminals. Insights from ML models can be channeled through Amazon DataZone to inform internal key decision makers internally and external partners.

IoT

IoT Machine Learning Metadata Data-driven

Towards optimal experimentation in online systems

The Unofficial Google Data Science Blog

APRIL 23, 2024

the weight given to Likes in our video recommendation algorithm) while $Y$ is a vector of outcome measures such as different metrics of user experience (e.g., Experiments, Parameters and Models At Youtube, the relationships between system parameters and metrics often seem simple — straight-line models sometimes fit our data well.

Experimentation

Experimentation Optimization Uncertainty Metrics

Achieving cloud excellence and efficiency with cloud maturity models

IBM Big Data Hub

MAY 17, 2024

Cloud maturity models are a useful tool for addressing these concerns, grounding organizational cloud strategy and proceeding confidently in cloud adoption with a plan. Cloud maturity models (or CMMs) are frameworks for evaluating an organization’s cloud adoption readiness on both a macro and individual service level.

Modeling

Modeling Cost-Benefit Optimization Digital Transformation

Deep Learning Illustrated: Building Natural Language Processing Models

Domino Data Lab

AUGUST 22, 2019

The excerpt covers how to create word vectors and utilize them as an input into a deep learning model. While the field of computational linguistics, or Natural Language Processing (NLP), has been around for decades, the increased interest in and use of deep learning models has also propelled applications of NLP forward within industry.

Deep Learning

Deep Learning Modeling Metrics Testing

3 principles for regulatory-grade large language model application

CIO Business Intelligence

JULY 11, 2023

In recent years, we have witnessed a tidal wave of progress and excitement around large language models (LLMs) such as ChatGPT and GPT-4. In short, providers must demonstrate that their models are safe and effective.

Modeling

Modeling Testing Experimentation Risk Management

What LinkedIn learned leveraging LLMs for its billion users

CIO Business Intelligence

APRIL 25, 2024

During the summer of 2023, at the height of the first wave of interest in generative AI, LinkedIn began to wonder whether matching candidates with employers and making feeds more useful would be better served with the help of large language models (LLMs).

IT

IT Cost-Benefit Experimentation Metrics

Interview with: Sankar Narayanan, Chief Practice Officer at Fractal Analytics

Corinium

JUNE 6, 2019

Beyond that, we recommend setting up the appropriate data management and engineering framework including infrastructure, harmonization, governance, toolset strategy, automation, and operating model. It is also important to have a strong test and learn culture to encourage rapid experimentation.

Insurance

Insurance Analytics Forecasting Deep Learning

Expectations vs. reality: A real-world check on generative AI

CIO Business Intelligence

MAY 1, 2024

Gen AI takes us from single-use models of machine learning (ML) to AI tools that promise to be a platform with uses in many areas, but you still need to validate they’re appropriate for the problems you want solved, and that your users know how to use gen AI effectively. Pilots can offer value beyond just experimentation, of course.

Cost-Benefit

Cost-Benefit Metrics Insurance Measurement

Integrate sparse and dense vectors to enhance knowledge retrieval in RAG using Amazon OpenSearch Service

AWS Big Data

SEPTEMBER 5, 2024

In the context of Retrieval-Augmented Generation (RAG), knowledge retrieval plays a crucial role, because the effectiveness of retrieval directly impacts the maximum potential of large language model (LLM) generation. document-only) ~ 20%(bi-encoder) higher NDCG@10, comparable to the TAS-B dense vector model.

Metrics

Metrics Testing Experimentation Modeling

DataRobot Notebooks: Enhanced Code-First Experience for Rapid AI Experimentation

DataRobot Blog

JANUARY 10, 2023

Most, if not all, machine learning (ML) models in production today were born in notebooks before they were put into production. Data science teams of all sizes need a productive, collaborative method for rapid AI experimentation. Capabilities Beyond Classic Jupyter for End-to-end Experimentation. Auto-scale compute.

Experimentation

Experimentation Machine Learning Data Science Modeling

Is the gen AI bubble due to burst? CIOs face rethink ahead

CIO Business Intelligence

AUGUST 15, 2024

A virtual assistant may save employees time when searching for old documents or composing emails, but most organizations have no idea how much time those tasks have taken historically, having never tracked such metrics before, she says. However, foundational models will always have a place as the core backbone for the industry.”

ROI

ROI Cost-Benefit Experimentation Deep Learning

Learning from the AI leaders

CIO Business Intelligence

DECEMBER 11, 2024

Research from IDC predicts that we will move from the experimentation phase, the GenAI scramble that we saw in 2023 and 2024, and mature into the adoption phase in 2025/26 before moving into AI-fuelled businesses in 2027 and beyond. Issues around data governance and challenges around clear metrics follow the top challenge areas.

ROI

ROI Experimentation Insurance Strategy

Of Muffins and Machine Learning Models

Cloudera

FEBRUARY 16, 2022

In this example, the Machine Learning (ML) model struggles to differentiate between a chihuahua and a muffin. Will the model correctly determine it is a muffin or get confused and think it is a chihuahua? The extent to which we can predict how the model will classify an image given a change input (e.g. Model Visibility.

Machine Learning

Machine Learning Modeling Metadata Recreation/Entertainment

Experiment design and modeling for long-term studies in ads

The Unofficial Google Data Science Blog

OCTOBER 7, 2015

by HENNING HOHNHOLD, DEIRDRE O'BRIEN, and DIANE TANG In this post we discuss the challenges in measuring and modeling the long-term effect of ads on user behavior. We describe experiment designs which have proven effective for us and discuss the subtleties of trying to generalize the results via modeling.

Modeling

Modeling Experimentation Knowledge Discovery Testing

Product Management for AI

Domino Data Lab

JUNE 23, 2019

Skomoroch proposes that managing ML projects are challenging for organizations because shipping ML projects requires an experimental culture that fundamentally changes how many companies approach building and shipping software. Another pattern that I’ve seen in good PMs is that they’re very metric-driven.

Management

Management Machine Learning Experimentation Metrics

5 best practices to successfully implement gen AI

CIO Business Intelligence

JULY 5, 2024

So, to maximize the ROI of gen AI efforts and investments, it’s important to move from ad-hoc experimentation to a more purposeful strategy and systematic approach to implementation. Define which strategic themes relate to your business model, processes, products, and services. This may impact some of your vendor selections as well.

ROI

ROI Experimentation Risk Cost-Benefit

6 Case Studies on The Benefits of Business Intelligence And Analytics

datapine

JANUARY 31, 2022

They’re about having the mindset of an experimenter and being willing to let data guide a company’s decision-making process. To do so, the company started by defining the goals, and finding a way to translate employees’ behavior and experience into data, so as to model against actual outcomes.

Business Intelligence

Business Intelligence Analytics Cost-Benefit ROI

How the DataRobot AI Platform Is Delivering Value-Driven AI

DataRobot Blog

MARCH 16, 2023

Why model-driven AI falls short of delivering value Teams that just focus model performance using model-centric and data-centric ML risk missing the big picture business context. We are also thrilled to share the innovations and capabilities that we have developed at DataRobot to meet and exceed those requirements.

Experimentation

Experimentation Data-driven Modeling Metadata

Six Visual Solutions To Complex Digital Marketing/Analytics Challenges

Occam's Razor

SEPTEMBER 30, 2013

Frameworks, because if I can teach someone a new mental model, a different way of thinking, they can be incredibly successful. the company are organized and incentivized (as in what metrics determine their bonus). Make sure your executive dashboards obsess about acquisition, behavior and outcome metrics. So fix that.

Marketing Analytics

Marketing Analytics Marketing Visualization Analytics

The great divide: Separating operations and innovation

CIO Business Intelligence

JULY 8, 2022

After transforming their organization’s operating model, realigning teams to products rather than to projects , CIOs we consult arrive at an inevitable question: “What next?” Splitting these responsibilities without a clear vision and careful plan, however, can spell disaster, reversing the progress begotten by a new operating model.

Cost-Benefit

Cost-Benefit Measurement Modeling Metrics

Climate tech opportunities for IT pros

CIO Business Intelligence

DECEMBER 19, 2024

While crucial, if organizations are only monitoring environmental metrics, they are missing critical pieces of a comprehensive environmental, social, and governance (ESG) program and are unable to fully understand their impacts. of survey respondents) and circular economy implementations (40.2%).

IT

IT IoT Machine Learning Optimization

How Model Observability Provides a 360° View of Models in Production

DataRobot Blog

SEPTEMBER 30, 2022

How do you track the integrity of a machine learning model in production? Model Observability can help. By tracking service, drift, prediction data, training data, and custom metrics, you can keep your models and predictions relevant in a fast-changing world. Model Observability Features.

Slice and Dice

Slice and Dice Modeling Metrics Experimentation

3 AI Trends from the Big Data & AI Toronto Conference

DataRobot Blog

OCTOBER 18, 2022

Monitoring and Managing AI Projects with Model Observability. Model Observability – the ability to track key health and service metrics for models in production – remains a top priority for AI-enabled organizations. DataRobot Booth at Big Data & AI Toronto 2022.

Big Data

Big Data Forecasting Experimentation Machine Learning

7 steps for turning shadow IT into a competitive edge

CIO Business Intelligence

NOVEMBER 21, 2023

Set parameters and emphasize collaboration To address one root cause of shadow IT, CIOs must also establish a governance and delivery model for evaluating, procuring, and implementing department technology solutions. CIOs need a way to capture lightweight business cases or forecast business value to help prioritize new opportunities.

IT

IT Risk Cost-Benefit Data Science

Mastering budget control in the age of AI: Leveraging on-premises and cloud XaaS for success

IBM Big Data Hub

JUNE 10, 2024

As organizations strive to harness the power of AI while controlling costs, leveraging anything as a service (XaaS) models emerges as a strategic approach. Embracing the power of XaaS XaaS encompasses a broad spectrum of cloud-based and on-premises service models that offer scalable and cost-effective solutions to businesses.

Experimentation

Experimentation Optimization Risk Management Machine Learning

Keynote Takeaways From Gartner Data & Analytics Summit

Sisense

MARCH 6, 2019

Gartner chose to group the rest of the keynote into three main messages according to the following categories: Here are some of the highlights as presented for each of them: Data Driven – “Adopt an Experimental Mindset”. At Sisense we’ve been preaching for BI prototyping and experimentation for quite a while now.

Data Analytics

Data Analytics Analytics Experimentation Data-driven

Practical Skills for The AI Product Manager

Escaping POC Purgatory: Evaluation-Driven Development for AI Systems

Webinars

Trending Sources

What you need to know about product management for AI

Webinars

The Lean Analytics Cycle: Metrics > Hypothesis > Experiment > Act

From project to product: Architecting the future of enterprise technology

Email Marketing: Campaign Analysis, Metrics, Best Practices

AI Product Management After Deployment

Experimentation and Testing: A Primer

Do You Need a DataOps Dojo?

Multi-Channel Attribution Modeling: The Good, Bad and Ugly Models

What’s driving the global common data capability at RGA

Where CIOs should place their 2025 AI bets

The DataOps Vendor Landscape, 2021

Introducing Amazon MWAA micro environments for Apache Airflow

Top 10 Data Innovation Trends During 2020

Dear Avinash: Attribution Modeling, Org Culture, Deeper Analysis

10 Technical Blogs for Data Scientists to Advance AI/ML Skills

12 Marketing Reports Examples You Can Use For Annual, Monthly, Weekly And Daily Reporting Practice

How EUROGATE established a data mesh architecture using Amazon DataZone

Towards optimal experimentation in online systems

Achieving cloud excellence and efficiency with cloud maturity models

Deep Learning Illustrated: Building Natural Language Processing Models

3 principles for regulatory-grade large language model application

What LinkedIn learned leveraging LLMs for its billion users

Interview with: Sankar Narayanan, Chief Practice Officer at Fractal Analytics

Expectations vs. reality: A real-world check on generative AI

Integrate sparse and dense vectors to enhance knowledge retrieval in RAG using Amazon OpenSearch Service

DataRobot Notebooks: Enhanced Code-First Experience for Rapid AI Experimentation

Is the gen AI bubble due to burst? CIOs face rethink ahead

Learning from the AI leaders

Of Muffins and Machine Learning Models

Experiment design and modeling for long-term studies in ads

Product Management for AI

5 best practices to successfully implement gen AI

6 Case Studies on The Benefits of Business Intelligence And Analytics

How the DataRobot AI Platform Is Delivering Value-Driven AI

Six Visual Solutions To Complex Digital Marketing/Analytics Challenges

The great divide: Separating operations and innovation

Climate tech opportunities for IT pros

How Model Observability Provides a 360° View of Models in Production

3 AI Trends from the Big Data & AI Toronto Conference

7 steps for turning shadow IT into a competitive edge

Mastering budget control in the age of AI: Leveraging on-premises and cloud XaaS for success

Keynote Takeaways From Gartner Data & Analytics Summit

Stay Connected