Data Processing, Experimentation and Testing

Experimentation and Testing: A Primer

Occam's Razor

MAY 22, 2006

This post is a primer on the delightful world of testing and experimentation (A/B, Multivariate, and a new term from me: Experience Testing). Experimentation and testing help us figure out we are wrong, quickly and repeatedly and if you think about it that is a great thing for our customers, and for our employers.

Experimentation

Experimentation Testing Optimization Measurement

The DataOps Vendor Landscape, 2021

DataKitchen

APRIL 13, 2021

Testing and Data Observability. It orchestrates complex pipelines, toolchains, and tests across teams, locations, and data centers. Prefect Technologies — Open-source data engineering platform that builds, tests, and runs data workflows. Testing and Data Observability. Production Monitoring and Development Testing.

Testing

Testing Machine Learning Consulting Data Quality

US Air Force seeks generative AI test pilots

CIO Business Intelligence

JUNE 13, 2024

Proof that even the most rigid of organizations are willing to explore generative AI arrived this week when the US Department of the Air Force (DAF) launched an experimental initiative aimed at Guardians, Airmen, civilian employees, and contractors. For now, AFRL is experimenting with self-hosted open-source LLMs in a controlled environment.

Testing

Testing Experimentation Data Processing Modeling

Webinars

How to Achieve High-Accuracy Results When Using LLMs

MORE WEBINARS

Apply Modern CRM Dashboards & Reports Into Your Business – Examples & Templates

datapine

MAY 20, 2020

At its core, CRM dashboard software is a smart vessel for data analytics and business intelligence – digital innovation that hosts a wealth of insightful CRM reports. This most value-driven CRM dashboard and a powerful piece of CRM reporting software host a cohesive mix of visual KPIs. Test, tweak, evolve. Sales Activity.

Dashboards

Dashboards Reporting KPI Visualization

What you need to know about product management for AI

O'Reilly on Data

MARCH 31, 2020

But there’s a host of new challenges when it comes to managing AI projects: more unknowns, non-deterministic outcomes, new infrastructures, new processes and new tools. This has serious implications for software testing, versioning, deployment, and other core development processes.

Management

Management Machine Learning Experimentation Metrics

The mainframe is dying: Long live the mainframe application!

CIO Business Intelligence

MARCH 10, 2022

Instead, it’s targeting test and development functions, with the goal of making it easier for enterprises to set up such environments whenever they need them, without having to leave costly excess mainframe capacity sitting idle the rest of the time.

Sales

Sales Testing Experimentation Data Processing

Changing assignment weights with time-based confounders

The Unofficial Google Data Science Blog

JULY 22, 2020

Another reason to use ramp-up is to test if a website's infrastructure can handle deploying a new arm to all of its users. For example, consider a smaller website that is considering adding a video hosting feature to increase engagement on the site. Here, day-of-week is a time-based confounder.

Experimentation

Experimentation Statistics Testing Knowledge Discovery

3 steps to eliminate shadow AI

CIO Business Intelligence

SEPTEMBER 12, 2024

These same decision-makers identify a host of challenges in implementing generative AI, so chances are that a significant portion of use is “unsanctioned.” If the code isn’t appropriately tested and validated, the software in which it’s embedded may be unstable or error-prone, presenting long-term maintenance issues and costs.

Experimentation

Experimentation Risk Cost-Benefit Strategy

DataRobot Notebooks: Enhanced Code-First Experience for Rapid AI Experimentation

DataRobot Blog

JANUARY 10, 2023

ML model builders spend a ton of time running multiple experiments in a data science notebook environment before moving the well-tested and robust models from those experiments to a secure, production-grade environment for general consumption. A host of open-source libraries. Deep Dive into DataRobot Notebooks. Auto-scale compute.

Experimentation

Experimentation Machine Learning Data Science Modeling

6 best practices to develop a corporate use policy for generative AI

CIO Business Intelligence

APRIL 14, 2023

In fact, it’s likely your organization has a large number of employees currently experimenting with generative AI, and as this activity moves from experimentation to real-life deployment, it’s important to be proactive before unintended consequences happen. This may include developing training videos and hosting live sessions.

Risk

Risk Experimentation Marketing Risk Management

Try semantic search with the Amazon OpenSearch Service vector engine

AWS Big Data

AUGUST 21, 2023

For the demo, we’re using the Amazon Titan foundation model hosted on Amazon Bedrock for embeddings, with no fine tuning. Amazon OpenSearch Service has long supported both lexical and vector search, since the introduction of its kNN plugin in 2020. With OpenSearch’s Search Comparison Tool , you can compare the different approaches.

Data Processing

Data Processing Visualization Experimentation Metrics

What’s new with Amazon MWAA support for Apache Airflow version 2.4.3

AWS Big Data

MAY 2, 2023

The workflow steps are as follows: The producer DAG makes an API call to a publicly hosted API to retrieve data. Test the feature To test this feature, run the producer DAG. Removal of experimental Smart Sensors. Test the feature Upload the four sample text files from the local data folder to an S3 bucket data folder.

Testing

Testing Experimentation Management Metadata

Rebranding IT for the modernized IT mission

CIO Business Intelligence

JUNE 11, 2024

At a recent Coffee with Digital Trailblazers event that I host on Fridays at 11 am ET, we debated not if but when and how top CIOs should rebrand and recast IT’s mission. Recasting the mission requires a steadfast commitment to retaining top talent, fostering transformational leadership, and nurturing the careers of digital trailblazers.”

IT

IT Digital Transformation Broadcasting Business Objectives

Advancing AI Cloud with Release 7.2

DataRobot

SEPTEMBER 14, 2021

And for those that do make it past the experimental stage, it typically takes over 18 months for the value to be realized. Even models that are tested for bias during the development can become biased once in production. DataRobot also now has an integrated and cloud-hosted notebook solution from our recent acquisition of Zepl.

Machine Learning

Machine Learning Experimentation Modeling Visualization

Keys to AI Success for IT Staff

DataRobot Blog

FEBRUARY 9, 2022

This approach gives freedom to move its AI artifacts around, regardless of whether they are hosted on a major cloud platform or its own on-premise infrastructure. Machine learning operations (MLOps) solutions allow all models to be monitored from a central location, regardless of where they are hosted or deployed.

IT

IT Experimentation Machine Learning Data Processing

Deploy and Scale AI Applications With Cloudera AI Inference Service

Cloudera

OCTOBER 8, 2024

By 2023, the focus shifted towards experimentation. Typically, organizations approach generative AI POCs in one of two ways: by using third-party services, which are easy to implement but require sharing private data externally, or by developing self-hosted solutions using a mix of open-source and commercial tools.

Optimization

Optimization Experimentation Metrics Enterprise

Who Owns Web Analytics? A Framework For Critical Thinking.

Occam's Razor

DECEMBER 9, 2009

The four pronged real world tested probing and loaded with politics framework to find a home for Web Analytics: 1. Who owns the power to make changes to the site (not who owns updating pages or hosting the site)? I hope it is of value to you all (and now you don't have to pay me large sums of money to do this for you!).

Analytics

Analytics Data-driven Marketing Sales

Themes and Conferences per Pacoid, Episode 9

Domino Data Lab

MAY 8, 2019

At CMU I joined a panel hosted by Zachary Lipton where someone in the audience asked a question about machine learning model interpretation. They also require advanced skills in statistics, experimental design, causal inference, and so on – more than most data science teams will have. Let’s look through some antidotes.

Machine Learning

Machine Learning Data Science Modeling Visualization

How Swisscom automated Amazon Redshift as part of their One Data Platform solution using AWS CDK – Part 1

AWS Big Data

JUNE 12, 2024

This module is experimental and under active development and may have changes that aren’t backward compatible. This module provides higher-level constructs (specifically, Layer 2 constructs ), including convenience and helper methods, as well as sensible default values. cluster = aws_redshift_alpha.Cluster( scope, cluster_identifier, #.

Data Architecture

Data Architecture Cost-Benefit Data-driven Experimentation

Six Nudges: Creating A Sense Of Urgency For Higher Conversion Rates!

Occam's Razor

JUNE 4, 2018

Social cues (/proof) can help create a sense of urgency for a whole host of companies. Such is the case with A/B testing. 800 plusses on Google+. Ok, so maybe not Google+ (I was genuinely excited about it, I am sad it died). But you get the idea. Yet, I bet you’ve rarely seen the use of this aggregated information to deliver nudges.

Strategy

Strategy Cost-Benefit Testing Sales

Make Better Data-Driven Decisions with DataRobot AI Platform Single-Tenant SaaS on Microsoft Azure

DataRobot Blog

MARCH 7, 2023

DataRobot on Azure accelerates the machine learning lifecycle with advanced capabilities for rapid experimentation across new data sources and multiple problem types. Models trained in DataRobot can also be easily deployed to Azure Machine Learning, allowing users to host models easier in a secure way.

Data-driven

Data-driven Machine Learning Experimentation Data Lake

Amazon OpenSearch Service search enhancements: 2023 roundup

AWS Big Data

JANUARY 9, 2024

This functionality was initially released as experimental in OpenSearch Service version 2.4, For instance, you can connect to external ML models hosted on Amazon SageMaker , which provides comprehensive capabilities to manage models successfully in production. and is now generally available with version 2.9.

Visualization

Visualization Cost-Benefit Modeling Machine Learning

Getting ready for artificial general intelligence with examples

IBM Big Data Hub

APRIL 18, 2024

While leaders have some reservations about the benefits of current AI, organizations are actively investing in gen AI deployment, significantly increasing budgets, expanding use cases, and transitioning projects from experimentation to production. This personalized approach might lead to more effective therapies with fewer side effects.

Cost-Benefit

Cost-Benefit Manufacturing Modeling Interactive

Improving Multi-tenancy with Virtual Private Clusters

Cloudera

JUNE 6, 2019

The typical Cloudera Enterprise Data Hub Cluster starts with a few dozen nodes in the customer’s datacenter hosting a variety of distributed services. We sometimes refer to this as splitting “dev/test” from “production” workloads, but we can generalize the approach by referring to the overall priority of the workload for the business.

Metadata

Metadata Data Lake Optimization Strategy

New DataRobot and Snowflake Integrations: Seamless Data Prep, Model Deployment, and Monitoring

DataRobot Blog

MARCH 16, 2023

We’ve tightened the loop between ML data prep , experimentation and testing all the way through to putting models into production. Secure, Seamless, and Scalable ML Data Preparation and Experimentation Now DataRobot and Snowflake customers can maximize their return on investment in AI and their cloud data platform.

Modeling

Modeling Experimentation Machine Learning Data-driven

Teaching AI to Smell by Using DataRobot

DataRobot

JUNE 10, 2021

Traditionally, experimentation and observation was the only way to understand the physical-chemical properties of the molecule. To foster innovation in this area, AICrowd hosted a competition to predict the olfactory properties of a molecule. Below are the per-label metrics provided by DataRobot for model evaluation purposes.

Metrics

Metrics Machine Learning Visualization Experimentation

Cloud Native vs. Cloud Enabled: What’s the Difference?

Alation

DECEMBER 14, 2021

These systems offer numerous web-centric features that bolster customer service and engagement, provide server scalability during periods of fluctuating traffic, and allow easy experimentation with new technologies and promotional strategies. Cloud testing. What is cloud-hosted? Optimized business continuity. Cloud performance.

Cost-Benefit

Cost-Benefit Data Processing Optimization Consulting

Themes and Conferences per Pacoid, Episode 11

Domino Data Lab

JULY 2, 2019

several aspects of that earlier U Washington project seem remarkably similar, including the experimental design, train/test data source, and even the slides. OSCON , Jul 15-18 in Portland – come to the “ML Ops: Managing the end-to-end ML lifecycle” track that I’ll be hosting on Jul 16! PyBay , Aug 15-18, SF.

Metadata

Metadata Data Science Machine Learning Data-driven

Introducing the vector engine for Amazon OpenSearch Serverless, now in preview

AWS Big Data

JULY 26, 2023

You can choose to host your collection on a public endpoint or within a VPC. From preview to GA and beyond Today, we are excited to announce the preview of the vector engine, making it available for you to begin testing it out immediately. All the data in the vector engine is encrypted in transit and at rest by default.

Metadata

Metadata Cost-Benefit Testing Metrics

How generative AI impacts your digital transformation priorities

CIO Business Intelligence

AUGUST 1, 2023

Define a game-changing LLM strategy At a recent Coffee with Digital Trailblazers I hosted, we discussed how generative AI and LLMs will impact every industry. Mitigate risks by communicating an LLM governance model The generative AI landscape has more than 100 tools covering test, image, video, code, speech, and other categories.

Digital Transformation

Digital Transformation Unstructured Data Strategy Data Science

Bridging the AI Learning Gap

O'Reilly on Data

FEBRUARY 13, 2025

Designing an effective AI learning path that worked with the Head First methodwhich engages readers through active learning and interactive puzzles, exercises, and other elementstook months of intense research and experimentation. In fact, I realized that I could test my exercises by pasting them verbatim into an AI.

Testing

Testing Experimentation Data Processing Interactive

Under increasing pressure, how can CIOs convince CFOs to invest in AI?

CIO Business Intelligence

FEBRUARY 5, 2025

How to know what to prioritize AI has made remarkable strides over the past year, but its adoption has also uncovered a host of shortcomings like dangerous hallucinations and expensive implementation. Companies need to focus on goals, testing, and people in their effort to determine if an AI project is viable.

Cost-Benefit

Cost-Benefit ROI Metrics Measurement

Digital Analytics + Marketing Career Advice: Your Now, Next, Long Plan

Occam's Razor

OCTOBER 12, 2017

The tiny downside of this is that our parents likely never had to invest as much in constant education, experimentation and self-driven investment in core skills. When you go to the interview, the hiring company will proceed to ask questions that test your competency in the listed job requirements. This is normal.

Marketing

Marketing Analytics Machine Learning Strategy

Data Leaders Brief

Experimentation and Testing: A Primer

The DataOps Vendor Landscape, 2021

Webinars

Trending Sources

US Air Force seeks generative AI test pilots

Webinars

Apply Modern CRM Dashboards & Reports Into Your Business – Examples & Templates

What you need to know about product management for AI

The mainframe is dying: Long live the mainframe application!

Changing assignment weights with time-based confounders

3 steps to eliminate shadow AI

DataRobot Notebooks: Enhanced Code-First Experience for Rapid AI Experimentation

6 best practices to develop a corporate use policy for generative AI

Try semantic search with the Amazon OpenSearch Service vector engine

What’s new with Amazon MWAA support for Apache Airflow version 2.4.3

Rebranding IT for the modernized IT mission

Advancing AI Cloud with Release 7.2

Keys to AI Success for IT Staff

Deploy and Scale AI Applications With Cloudera AI Inference Service

Who Owns Web Analytics? A Framework For Critical Thinking.

Themes and Conferences per Pacoid, Episode 9

How Swisscom automated Amazon Redshift as part of their One Data Platform solution using AWS CDK – Part 1

Six Nudges: Creating A Sense Of Urgency For Higher Conversion Rates!

Make Better Data-Driven Decisions with DataRobot AI Platform Single-Tenant SaaS on Microsoft Azure

Amazon OpenSearch Service search enhancements: 2023 roundup

Getting ready for artificial general intelligence with examples

Improving Multi-tenancy with Virtual Private Clusters

New DataRobot and Snowflake Integrations: Seamless Data Prep, Model Deployment, and Monitoring

Teaching AI to Smell by Using DataRobot

Cloud Native vs. Cloud Enabled: What’s the Difference?

Themes and Conferences per Pacoid, Episode 11

Introducing the vector engine for Amazon OpenSearch Serverless, now in preview

How generative AI impacts your digital transformation priorities

Bridging the AI Learning Gap

Under increasing pressure, how can CIOs convince CFOs to invest in AI?

Digital Analytics + Marketing Career Advice: Your Now, Next, Long Plan

Stay Connected