Data Science, Experimentation and Metrics

Practical Skills for The AI Product Manager

O'Reilly on Data

MAY 14, 2020

AI PMs should enter feature development and experimentation phases only after deciding what problem they want to solve as precisely as possible, and placing the problem into one of these categories. Experimentation: It’s just not possible to create a product by building, evaluating, and deploying a single model.

Management

Management Experimentation B2B Machine Learning

The DataOps Vendor Landscape, 2021

DataKitchen

APRIL 13, 2021

Piperr.io — Pre-built data pipelines across enterprise stakeholders, from IT to analytics, tech, data science and LoBs. Prefect Technologies — Open-source data engineering platform that builds, tests, and runs data workflows. Genie — Distributed big data orchestration service by Netflix.

Testing

Testing Machine Learning Consulting Data Science

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

For container terminal operators, data-driven decision-making and efficient data sharing are vital to optimizing operations and boosting supply chain efficiency. Two use cases illustrate how this can be applied for business intelligence (BI) and data science applications, using AWS services such as Amazon Redshift and Amazon SageMaker.

IoT

IoT Machine Learning Metadata Data-driven

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Is The Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Introducing Amazon MWAA micro environments for Apache Airflow

AWS Big Data

NOVEMBER 19, 2024

Customers maintain multiple MWAA environments to separate development stages, optimize resources, manage versions, enhance security, ensure redundancy, customize settings, improve scalability, and facilitate experimentation. micro, remember to monitor its performance using the recommended metrics to maintain optimal operation.

Metadata

Metadata Cost-Benefit Metrics Optimization

AI Product Management After Deployment

O'Reilly on Data

OCTOBER 13, 2020

Ideally, AI PMs would steer development teams to incorporate I/O validation into the initial build of the production system, along with the instrumentation needed to monitor model accuracy and other technical performance metrics. But in practice, it is common for model I/O validation steps to be added later, when scaling an AI product.

Management

Management Machine Learning Metrics Modeling

Top 10 Data Innovation Trends During 2020

Rocket-Powered Data Science

JULY 6, 2021

2) MLOps became the expected norm in machine learning and data science projects. MLOps takes the modeling, algorithms, and data wrangling out of the experimental “one off” phase and moves the best models into deployment and sustained operational phase. And the goodness doesn’t stop there.

Machine Learning

Machine Learning Data-driven Deep Learning IoT

Where CIOs should place their 2025 AI bets

CIO Business Intelligence

JANUARY 21, 2025

Why should CIOs bet on unifying their data and AI practices? In 2024, departments and teams experimented with gen AI tools tied to their workflows and operating metrics. It created fragmented practices in the interest of experimentation, rapid learning, and widespread adoption and it paid back productivity dividends in many areas.

Cost-Benefit

Cost-Benefit Data-driven Strategy Marketing

10 Technical Blogs for Data Scientists to Advance AI/ML Skills

DataRobot Blog

DECEMBER 6, 2022

Savvy data scientists are already applying artificial intelligence and machine learning to accelerate the scope and scale of data-driven decisions in strategic organizations. These data science teams are seeing tremendous results—millions of dollars saved, new customers acquired, and new innovations that create a competitive advantage.

Experimentation

Experimentation Forecasting Data-driven Machine Learning

Domino Paves the Way for the Future of Enterprise Data Science with Latest Release

Domino Data Lab

JUNE 10, 2020

Today, we announced the latest release of Domino’s data science platform which represents a big step forward for enterprise data science teams. Domino’s best-in-class Workbench is now even more powerful for data scientists.

Data Science

Data Science Enterprise Experimentation Modeling

Adopting the 4 Step Data Science Lifecycle for Data Science Projects

Domino Data Lab

AUGUST 6, 2021

Data science is an incredibly complex field. Framing data science projects within the four steps of the data science lifecycle (DSLC) makes it much easier to manage limited resources and control timelines, while ensuring projects meet or exceed the business requirements they were designed for.

Data Science

Data Science Testing Modeling Publishing

Towards optimal experimentation in online systems

The Unofficial Google Data Science Blog

APRIL 23, 2024

the weight given to Likes in our video recommendation algorithm) while $Y$ is a vector of outcome measures such as different metrics of user experience (e.g., Experiments, Parameters and Models At Youtube, the relationships between system parameters and metrics often seem simple — straight-line models sometimes fit our data well.

Experimentation

Experimentation Optimization Uncertainty Metrics

DataRobot Notebooks: Enhanced Code-First Experience for Rapid AI Experimentation

DataRobot Blog

JANUARY 10, 2023

ML model builders spend a ton of time running multiple experiments in a data science notebook environment before moving the well-tested and robust models from those experiments to a secure, production-grade environment for general consumption. 42% of data scientists are solo practitioners or on teams of five or fewer people.

Experimentation

Experimentation Machine Learning Data Science Modeling

What is a data scientist? A key data analytics role and a lucrative career

CIO Business Intelligence

MARCH 21, 2022

What is a data scientist? Data scientists are analytical data experts who use data science to discover insights from massive amounts of structured and unstructured data to help shape or meet specific business needs and goals. Data scientist salary. Data scientist skills.

Unstructured Data

Unstructured Data Data Analytics Analytics Data Science

Best Practice of Using Data Science Competitions Skills to Improve Business Value

DataRobot Blog

JULY 28, 2022

This article presents a case study of how DataRobot was able to achieve high accuracy and low cost by actually using techniques learned through Data Science Competitions in the process of solving a DataRobot customer’s problem. Sensor Data Analysis Examples. The Best Way to Achieve Both Accuracy and Cost Control.

Data Science

Data Science Machine Learning Statistics Modeling

7 steps for turning shadow IT into a competitive edge

CIO Business Intelligence

NOVEMBER 21, 2023

Develop citizen data science and self-service capabilities CIOs have embraced citizen data science because data visualization tools and other self-service business intelligence platforms are easy for business people to use and reduce the reporting and querying work IT departments used to support.

IT

IT Risk Cost-Benefit Data Science

3 AI Trends from the Big Data & AI Toronto Conference

DataRobot Blog

OCTOBER 18, 2022

Model Observability – the ability to track key health and service metrics for models in production – remains a top priority for AI-enabled organizations. These accelerators are specifically designed to help organizations accelerate from data to results. DataRobot Fireside Chat at Big Data & AI Toronto 2022. Request a Demo.

Big Data

Big Data Forecasting Experimentation Machine Learning

6 Case Studies on The Benefits of Business Intelligence And Analytics

datapine

JANUARY 31, 2022

It’s all about using data to get a clearer understanding of reality so that your company can make more strategically sound decisions (instead of relying only on gut instinct or corporate inertia). Ultimately, business intelligence and analytics are about much more than the technology used to gather and analyze data.

Business Intelligence

Business Intelligence Analytics Cost-Benefit ROI

Climate tech opportunities for IT pros

CIO Business Intelligence

DECEMBER 19, 2024

In especially high demand are IT pros with software development, data science and machine learning skills. While crucial, if organizations are only monitoring environmental metrics, they are missing critical pieces of a comprehensive environmental, social, and governance (ESG) program and are unable to fully understand their impacts.

IT

IT IoT Machine Learning Optimization

Mastering budget control in the age of AI: Leveraging on-premises and cloud XaaS for success

IBM Big Data Hub

JUNE 10, 2024

XaaS models offer organizations greater predictability and transparency in cost management by providing detailed billing metrics and usage analytics. Accessing specialized expertise Implementing AI initiatives often requires specialized skills and expertise in areas such as data science, machine learning and AI development.

Experimentation

Experimentation Optimization Risk Management Machine Learning

Product Management for AI

Domino Data Lab

JUNE 23, 2019

Skomoroch proposes that managing ML projects are challenging for organizations because shipping ML projects requires an experimental culture that fundamentally changes how many companies approach building and shipping software. Another pattern that I’ve seen in good PMs is that they’re very metric-driven.

Management

Management Machine Learning Experimentation Metrics

Digital transformation’s fundamental change management mistake

CIO Business Intelligence

APRIL 2, 2024

Joanne Friedman, PhD, CEO, and principal of smart manufacturing at Connektedminds, says orchestrating success in digital transformation requires a symphony of integration across disciplines : “CIOs face the challenge of harmonizing diverse disciplines like design thinking, product management, agile methodologies, and data science experimentation.

Management

Management Digital Transformation Testing Experimentation

Next Stop – Predicting on Data with Cloudera Machine Learning

Cloudera

APRIL 9, 2021

To effectively leverage their predictive capabilities and maximize time-to-value these companies need an ML infrastructure that allows them to quickly move models from data pipelines, to experimentation and into the business. Finally, CML has built-in model security and governance thanks to Cloudera Shared Data Experience (SDX).

Machine Learning

Machine Learning Forecasting Manufacturing Predictive Analytics

Experiment design and modeling for long-term studies in ads

The Unofficial Google Data Science Blog

OCTOBER 7, 2015

Nevertheless, A/B testing has challenges and blind spots, such as: the difficulty of identifying suitable metrics that give "works well" a measurable meaning. accounting for effects "orthogonal" to the randomization used in experimentation. accounting for effects "orthogonal" to the randomization used in experimentation.

Modeling

Modeling Experimentation Knowledge Discovery KDD

Bringing More AI to Snowflake, the Data Cloud

DataRobot Blog

FEBRUARY 28, 2023

A seamless user experience when deploying and monitoring DataRobot models to Snowflake Monitoring service health, drift, and accuracy of DataRobot models in Snowflake “Organizations are looking for mature data science platforms that can scale to the size of their entire business. Learn more about DataRobot hosted notebooks.

Data Processing

Data Processing Experimentation Machine Learning Data Warehouse

Strong Speakers List Highlights DataRobot’s 2021 AI Experience Worldwide Conference

DataRobot

APRIL 29, 2021

This list includes: Rachik Laouar is Head of Data Science for the Adecco Group. Rachik is working to transform that company’s products through data analytics and AI and will be speaking on the topic, Executive Track: Turning an Industry Upside Down. . Eric Weber is Head of Experimentation And Metrics for Yelp.

Machine Learning

Machine Learning Experimentation Data Science Data-driven

Performing Non-Compartmental Analysis with Julia and Pumas AI

Domino Data Lab

DECEMBER 4, 2020

Approximating the region under the graph of as a series of trapezoids and calculating the sum of their area (in the case of non-uniformly distributed data points) is given by. Having calculated AUC/AUMC, we can further derive a number of useful metrics like: Total clearance of the drug from plasma. Mean residence time. and many others.

Metrics

Metrics Data Science Knowledge Discovery Measurement

Understanding Simpson’s Paradox to Avoid Faulty Conclusions

Sisense

JANUARY 21, 2020

One of the simplest ways to start exploring your data is to aggregate the metrics you are interested in by their relevant dimensions. To better understand when the data should be grouped, you should be familiar with causal inference. How common is Simpson’s paradox ? See “ How likely is Simpson’s paradox? (See

Testing

Testing Data-driven Risk Statistics

Make Better Data-Driven Decisions with DataRobot AI Platform Single-Tenant SaaS on Microsoft Azure

DataRobot Blog

MARCH 7, 2023

DataRobot on Azure accelerates the machine learning lifecycle with advanced capabilities for rapid experimentation across new data sources and multiple problem types. The capability to rapidly build an AI-powered organization with industry-specific solutions and expertise.

Data-driven

Data-driven Machine Learning Experimentation Data Lake

GoDaddy benchmarking results in up to 24% better price-performance for their Spark workloads with AWS Graviton2 on Amazon EMR Serverless

AWS Big Data

NOVEMBER 2, 2023

It showcases the potential of Graviton2 in delivering enhanced price-performance ratios, making it an attractive choice for organizations seeking to optimize their big data workloads. GoDaddy benchmark During our initial experimentation, we observed that arm64 on EMR Serverless consistently outperformed or performed on par with x86_64.

Cost-Benefit

Cost-Benefit Big Data Testing Optimization

Of Muffins and Machine Learning Models

Cloudera

FEBRUARY 16, 2022

In the case of CDP Public Cloud, this includes virtual networking constructs and the data lake as provided by a combination of a Cloudera Shared Data Experience (SDX) and the underlying cloud storage. Each project consists of a declarative series of steps or operations that define the data science workflow.

Machine Learning

Machine Learning Modeling Metadata Recreation/Entertainment

Designing A/B tests in a collaboration network

The Unofficial Google Data Science Blog

JANUARY 16, 2018

Experimentation on networks A/B testing is a standard method of measuring the effect of changes by randomizing samples into different treatment groups. However, the downside of using a larger unit of randomization is that we lose experimental power. Consider the case where experiment metrics are evaluated at the per-user level.

Testing

Testing Experimentation Measurement Modeling

Deep Learning Illustrated: Building Natural Language Processing Models

Domino Data Lab

AUGUST 22, 2019

Data scientists and researchers require an extensive array of techniques, packages, and tools to accelerate core work flow tasks including prepping, processing, and analyzing data. Utilizing NLP helps researchers and data scientists complete core tasks faster. The Area under the ROC Curve. 0.85 = 0.15. Yet, because 0.51

Deep Learning

Deep Learning Modeling Metrics Testing

Compliance bias in mobile experiments

The Unofficial Google Data Science Blog

MARCH 22, 2018

But what if users don't immediately uptake the new experimental version? Background At Google, experimentation is an invaluable tool for making decisions and inference about new products and features. Naturally, this issue is of particular concern to us in the Play Data Science team.

Experimentation

Experimentation Measurement Modeling Statistics

Building AI with AutoML and Composable ML

DataRobot

JUNE 23, 2021

To help data scientists experiment faster, DataRobot has added Composable ML to automated machine learning. This allows data science teams to incorporate any machine learning algorithm or feature engineering method and seamlessly combine them with hundreds of built-in methods. So let’s dig in!

Machine Learning

Machine Learning Visualization Experimentation Modeling

Mind Your Units

The Unofficial Google Data Science Blog

JULY 31, 2016

To figure this out, let's consider an appropriate experimental design. In other words, the teacher is our second kind of unit, the unit of experimentation. This type of experimental design is known as a group-randomized or cluster-randomized trial. When analyzing the outcome measure (e.g.,

Experimentation

Experimentation Testing Measurement Metrics

Unleashing the power of Presto: The Uber case study

IBM Big Data Hub

SEPTEMBER 25, 2023

Uber chose Presto for the flexibility it provides with compute separated from data storage. As a result, they continue to expand their use cases to include ETL, data science , data exploration, online analytical processing (OLAP), data lake analytics and federated queries.

OLAP

OLAP Data Lake Data-driven Online Analytical Processing

Data scientist as scientist

The Unofficial Google Data Science Blog

OCTOBER 21, 2015

Our post describes how we arrived at recent changes to design principles for the Google search page, and thus highlights aspects of a data scientist’s role which involve practicing the scientific method. There has been debate as to whether the term “data science” is necessary. Some don’t see the point.

Slice and Dice

Slice and Dice Experimentation Data-driven Data Science

Customer Experience and Emerging Technologies: My CXChat Summary on Artificial Intelligence, Machine Learning and the Customer

Business Over Broadway

MAY 22, 2019

According to Gartner, companies need to adopt these practices: build culture of collaboration and experimentation; start with a 3-way partnership among executives leading digital initiative, line of business and IT. Also, loyalty leaders infuse analytics into CX programs, including machine learning, data science and data integration.

Machine Learning

Machine Learning Technology Digital Transformation Data Science

Uncertainties: Statistical, Representational, Interventional

The Unofficial Google Data Science Blog

DECEMBER 14, 2021

by AMIR NAJMI & MUKUND SUNDARARAJAN Data science is about decision making under uncertainty. This blog post introduces the notions of representational uncertainty and interventional uncertainty to paint a fuller picture of what the practicing data scientist is up against. Vignette: Data Science at fluff.ai

Uncertainty

Uncertainty Statistics Measurement Cost-Benefit

Bringing an AI Product to Market

O'Reilly on Data

JULY 28, 2020

The first step in building an AI solution is identifying the problem you want to solve, which includes defining the metrics that will demonstrate whether you’ve succeeded. It sounds simplistic to state that AI product managers should develop and ship products that improve metrics the business cares about. Agreeing on metrics.

Marketing

Marketing Experimentation Metrics Testing

Estimating causal effects using geo experiments

The Unofficial Google Data Science Blog

MAY 31, 2016

A geo experiment is an experiment where the experimental units are defined by geographic regions. This means it is possible to specify exactly in which geos an ad campaign will be served – and to observe the ad spend and the response metric at the geo level. They are non-overlapping geo-targetable regions. by turning campaigns off).

Advertising

Advertising Testing Sales Statistics

To Balance or Not to Balance?

The Unofficial Google Data Science Blog

JUNE 30, 2016

By IVAN DIAZ & JOSEPH KELLY Determining the causal effects of an action—which we call treatment—on an outcome of interest is at the heart of many data analysis efforts. In an ideal world, experimentation through randomization of the treatment assignment allows the identification and consistent estimation of causal effects.

Statistics

Statistics Optimization Modeling Experimentation

Why models fail to deliver value and what you can do about it.

Domino Data Lab

SEPTEMBER 10, 2020

Yet despite all this hard work, few models ever make it into production (VentureBeat AI concluded that just 13% of data science projects make it into production) and in terms of delivering value to the business, Gartner predicts that only 20% of analytics projects will deliver business outcomes that improve performance.

Modeling

Modeling Data Science IT Metrics

Misadventures in experiments for growth

The Unofficial Google Data Science Blog

APRIL 16, 2019

by MICHAEL FORTE Large-scale live experimentation is a big part of online product development. This means a small and growing product has to use experimentation differently and very carefully. This blog post is about experimentation in this regime. Such decisions involve an actual hypothesis test on specific metrics (e.g.

Experimentation

Experimentation Sales Metrics Measurement

Practical Skills for The AI Product Manager

The DataOps Vendor Landscape, 2021

Webinars

Trending Sources

How EUROGATE established a data mesh architecture using Amazon DataZone

Webinars

Introducing Amazon MWAA micro environments for Apache Airflow

AI Product Management After Deployment

Top 10 Data Innovation Trends During 2020

Where CIOs should place their 2025 AI bets

10 Technical Blogs for Data Scientists to Advance AI/ML Skills

Domino Paves the Way for the Future of Enterprise Data Science with Latest Release

Adopting the 4 Step Data Science Lifecycle for Data Science Projects

Towards optimal experimentation in online systems

DataRobot Notebooks: Enhanced Code-First Experience for Rapid AI Experimentation

What is a data scientist? A key data analytics role and a lucrative career

Best Practice of Using Data Science Competitions Skills to Improve Business Value

7 steps for turning shadow IT into a competitive edge

3 AI Trends from the Big Data & AI Toronto Conference

6 Case Studies on The Benefits of Business Intelligence And Analytics

Climate tech opportunities for IT pros

Mastering budget control in the age of AI: Leveraging on-premises and cloud XaaS for success

Product Management for AI

Digital transformation’s fundamental change management mistake

Next Stop – Predicting on Data with Cloudera Machine Learning

Experiment design and modeling for long-term studies in ads

Bringing More AI to Snowflake, the Data Cloud

Strong Speakers List Highlights DataRobot’s 2021 AI Experience Worldwide Conference

Performing Non-Compartmental Analysis with Julia and Pumas AI

Understanding Simpson’s Paradox to Avoid Faulty Conclusions

Make Better Data-Driven Decisions with DataRobot AI Platform Single-Tenant SaaS on Microsoft Azure

GoDaddy benchmarking results in up to 24% better price-performance for their Spark workloads with AWS Graviton2 on Amazon EMR Serverless

Of Muffins and Machine Learning Models

Designing A/B tests in a collaboration network

Deep Learning Illustrated: Building Natural Language Processing Models

Compliance bias in mobile experiments

Building AI with AutoML and Composable ML

Mind Your Units

Unleashing the power of Presto: The Uber case study

Data scientist as scientist

Customer Experience and Emerging Technologies: My CXChat Summary on Artificial Intelligence, Machine Learning and the Customer

Uncertainties: Statistical, Representational, Interventional

Bringing an AI Product to Market

Estimating causal effects using geo experiments

To Balance or Not to Balance?

Why models fail to deliver value and what you can do about it.

Misadventures in experiments for growth

Stay Connected