Document, Experimentation and Measurement

Escaping POC Purgatory: Evaluation-Driven Development for AI Systems

O'Reilly on Data

MARCH 25, 2025

ML apps needed to be developed through cycles of experimentation (as were no longer able to reason about how theyll behave based on software specs). The skillset and the background of people building the applications were realigned: People who were at home with data and experimentation got involved! How will you measure success?

Testing

Testing Data-driven Software Measurement

Where CIOs should place their 2025 AI bets

CIO Business Intelligence

JANUARY 21, 2025

Deloittes State of Generative AI in the Enterprise reports nearly 70% have moved 30% or fewer of their gen AI experiments into production, and 41% of organizations have struggled to define and measure the impacts of their gen AI efforts.

Cost-Benefit

Cost-Benefit Data-driven Strategy Marketing

From project to product: Architecting the future of enterprise technology

CIO Business Intelligence

JANUARY 14, 2025

Documentation and diagrams transform abstract discussions into something tangible. By articulating fitness functions automated tests tied to specific quality attributes like reliability, security or performance teams can visualize and measure system qualities that align with business goals.

Enterprise

Enterprise Technology Metrics Measurement

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

Rocket-Powered Data Science

FEBRUARY 15, 2023

Since ChatGPT is built from large language models that are trained against massive data sets (mostly business documents, internal text repositories, and similar resources) within your organization, consequently attention must be given to the stability, accessibility, and reliability of those resources. Test early and often.

Strategy

Strategy Experimentation Uncertainty Machine Learning

How Block is accelerating engineering velocity through developer experience

CIO Business Intelligence

MARCH 4, 2025

This article goes behind the scenes on whats fueling Blocks investment in developer experience, key initiatives including the role of an engineering intelligence platform , and how the company measures and drives success. These select choices can then be of high quality, well-supported, documented, maintained, secure, and reliable.

Experimentation

Experimentation Metrics Data-driven Measurement

Expectations vs. reality: A real-world check on generative AI

CIO Business Intelligence

MAY 1, 2024

Pilots can offer value beyond just experimentation, of course. McKinsey reports that industrial design teams using LLM-powered summaries of user research and AI-generated images for ideation and experimentation sometimes see a reduction upward of 70% in product development cycle times. What are you measuring?

Cost-Benefit

Cost-Benefit Metrics Insurance Measurement

AI Product Management After Deployment

O'Reilly on Data

OCTOBER 13, 2020

In an incident management blog post , Atlassian defines SLOs as: “the individual promises you’re making to that customer… SLOs are what set customer expectations and tell IT and DevOps teams what goals they need to hit and measure themselves against. While useful, these constructs are not beyond criticism.

Management

Management Machine Learning Metrics Modeling

What high-performance IT teams look like today — and how to build one

CIO Business Intelligence

AUGUST 20, 2024

While the focus at these three levels differ, CIOs should provide a consistent definition of high performance and how it’s measured. One way to do this is to ensure all digital transformation initiatives have documented vision statements and clearly defined business and end-user objectives when scheduling major deployments.

IT

IT Digital Transformation Experimentation Risk

The early returns on gen AI for software development

CIO Business Intelligence

MARCH 12, 2024

Early use cases include code generation and documentation, test case generation and test automation, as well as code optimization and refactoring, among others. The maturity of any development organization can easily be measured in terms of the size and type of investment made in QA,” he says.

Software

Software Experimentation Testing Cost-Benefit

AI incident reporting shortcomings leave regulatory safety hole

CIO Business Intelligence

JULY 1, 2024

By documenting cases where automated systems misbehave, glitch or jeopardize users, we can better discern problematic patterns and mitigate risks. Lastly, CLTR said, capacity to monitor, investigate, and respond to incidents needs to be enhanced through measures such as the establishment of a pilot AI incident database.

Reporting

Reporting Risk Management Experimentation Risk

Get AI in the hands of your employees

CIO Business Intelligence

FEBRUARY 7, 2024

We’ve seen an ongoing iteration of experimentation with a number of promising pilots in production,” he says. Samsara employees are applying these general-purpose assistants to a variety of use cases, like writing documentation and job descriptions, debugging code, or writing API endpoints.

KPI

KPI Measurement Experimentation Finance

The Lean Analytics Cycle: Metrics > Hypothesis > Experiment > Act

Occam's Razor

APRIL 8, 2013

First, you figure out what you want to improve; then you create an experiment; then you run the experiment; then you measure the results and decide what to do. For each of them, write down the KPI you're measuring, and what that KPI should be for you to consider your efforts a success. Measure and decide what to do.

Metrics

Metrics KPI Analytics Key Performance Indicator

Is the gen AI bubble due to burst? CIOs face rethink ahead

CIO Business Intelligence

AUGUST 15, 2024

A virtual assistant may save employees time when searching for old documents or composing emails, but most organizations have no idea how much time those tasks have taken historically, having never tracked such metrics before, she says.

ROI

ROI Cost-Benefit Experimentation Deep Learning

Why enterprise CIOs need to plan for Microsoft gen AI

CIO Business Intelligence

AUGUST 14, 2024

For many enterprises, Microsoft provides not just document and email storage, but also the root of enterprise identity for those data sources, as Vadim Vladimirskiy, CEO of software developer Nerdio, points out. If you pull your data from a document with no permission set on it, then there’s no information to be had,” he adds.

Enterprise

Enterprise Cost-Benefit Experimentation Modeling

Integrate sparse and dense vectors to enhance knowledge retrieval in RAG using Amazon OpenSearch Service

AWS Big Data

SEPTEMBER 5, 2024

It comes in two modes: document-only and bi-encoder. For more details about these two terms, see Improving document retrieval with sparse semantic encoders. Simply put, in document-only mode, term expansion is performed only during document ingestion. Bi-encoder mode improves performance but may cause more latency.

Metrics

Metrics Testing Experimentation Modeling

Bayer Crop Science blends gen AI and data science for innovative edge

CIO Business Intelligence

AUGUST 23, 2024

Like most enterprises, Bayer’s agricultural division will initially use AWS-based generative AI tools out-of-the-box to automate basic business processes, such as the production of internal technical documentation, McQueen says. Making that available across the division will spur more robust experimentation and innovation, he notes.

Data Science

Data Science Experimentation Testing Modeling

Try semantic search with the Amazon OpenSearch Service vector engine

AWS Big Data

AUGUST 21, 2023

Lexical search looks for words in the documents that appear in the queries. Background A search engine is a special kind of database, allowing you to store documents and data and then run queries to retrieve the most relevant ones. OpenSearch Service supports a variety of search and relevance ranking techniques.

Data Processing

Data Processing Visualization Experimentation Metrics

The pandemic pivot: 5 key leadership lessons that will last

CIO Business Intelligence

APRIL 4, 2022

The early days of the pandemic taught organizations like Avery Dennison the power of agility and experimentation. The team was helped with live augmented reality annotations to document each step. “We We are just starting to come back into the office, but in six months we’ll have a much better measure” of efficiencies gained. “I

Experimentation

Experimentation Interactive Business Objectives Manufacturing

AI agents will transform business processes — and magnify risks

CIO Business Intelligence

AUGUST 21, 2024

It wasn’t just a single measurement of particulates,” says Chris Mattmann, NASA JPL’s former chief technology and innovation officer. “It It was many measurements the agents collectively decided was either too many contaminants or not.” They also had extreme measurement sensitivity.

Risk

Risk Insurance Cost-Benefit Software

Amazon OpenSearch Service search enhancements: 2023 roundup

AWS Big Data

JANUARY 9, 2024

Lexical search In lexical search, the search engine compares the words in the search query to the words in the documents, matching word for word. Semantic search doesn’t match individual query terms—it finds documents whose vector embedding is near the query’s embedding in the vector space and therefore semantically similar to the query.

Visualization

Visualization Cost-Benefit Modeling Machine Learning

What Are ChatGPT and Its Friends?

O'Reilly on Data

MARCH 23, 2023

Tokens ChatGPT’s sense of “context”—the amount of text that it considers when it’s in conversation—is measured in “tokens,” which are also used for billing. Be very careful about documents that require any sort of precision. Still, I would want a human lawyer to review anything it produced; legal documents require precision.

IT

IT Modeling Testing Risk

CIOs press ahead for gen AI edge — despite misgivings

CIO Business Intelligence

OCTOBER 18, 2023

We’re seeing lots and lots of pilots,” says Gartner AI analyst Arun Chandrasekaran, who notes content creation, document summarization, sentiment analysis, and enterprise search chief among the initial use cases. A recent survey of nearly 1,000 IT decision-makers conducted by Foundry underscores this.

Risk

Risk Manufacturing Enterprise Technology

Deploy and Scale AI Applications With Cloudera AI Inference Service

Cloudera

OCTOBER 8, 2024

By 2023, the focus shifted towards experimentation. Detailed Data and Model Lineage Tracking*: Ensures comprehensive tracking and documentation of data transformations and model lifecycle events, enhancing reproducibility and auditability. These innovations pushed the boundaries of what generative AI could achieve.

Optimization

Optimization Experimentation Metrics Enterprise

Tech leaders weigh in on the upside and flipside of generative AI

CIO Business Intelligence

AUGUST 17, 2023

“Since the middle of last year, we’ve been analyzing the potential impact, opportunities, and risks of the speed of innovation in this area, as well as introduced policies and implemented measures to minimize risks,” he says. But overall, we see this as a huge opportunity.

Risk

Risk Cost-Benefit Strategy Unstructured Data

What Is Model Risk Management and How is it Supported by Enterprise MLOps?

Domino Data Lab

AUGUST 6, 2021

The process of doing data science is about learning from experimentation failures, but inadvertent errors can create enormous risks in model implementation. A Model Risk Management framework combines sound governance principles with end-to-end documentation in the design, development, validation, and deployment of new models in the business.

Risk Management

Risk Management Risk Modeling Enterprise

How to choose the best AI platform

IBM Big Data Hub

OCTOBER 20, 2023

Automated development: With AutoAI , beginners can quickly get started and more advanced data scientists can accelerate experimentation in AI development. Data extraction: Platform capabilities help sort through complex details and quickly pull the necessary information from large documents.

Machine Learning

Machine Learning Manufacturing Deep Learning Cost-Benefit

Combatting COVID with Embedded Advanced Analytics

Sisense

JULY 15, 2020

This article will focus on the AI Research (AIR) team’s effort, specifically an experimental combination of Sisense BloX (actionable embedded analytics ) and Quest (an advanced analytics add-on for Sisense) which we called the SEIR app. Dozens of Sisensers took part in project SiCo to create this awesome COVID hub. Envisioning the SEIR app.

Analytics

Analytics Modeling Visualization Dashboards

Generative AI headlines are outpacing enterprise adoption

CIO Business Intelligence

JUNE 28, 2023

Experimentation with a use case driven approach. At least for now, they seem focused on use cases that improve productivity, with compelling opportunities in the areas of sales & marketing, code generation, and document generation. By that measure, you will indeed have done better than you thought. Looking forward.

Enterprise

Enterprise Experimentation Risk Data-driven

Performing Non-Compartmental Analysis with Julia and Pumas AI

Domino Data Lab

DECEMBER 4, 2020

Domino Lab supports both interactive and batch experimentation with all popular IDEs and notebooks (Jupyter, RStudio, SAS, Zeppelin, etc.). TIME – time points of measured pain score and plasma concentration (in hrs). 2] Pumas AI Documentation, [link]. [3] In this tutorial we will use JupyterLab. and 3 to 8 hours.

Metrics

Metrics Data Science Knowledge Discovery Measurement

How a Discovery Data Warehouse, the next evolution of augmented analytics, accelerates treatments and delivers medicines safely to patients in need

Cloudera

NOVEMBER 25, 2020

For example, it includes patient samples (blood, blood pressure, temperature, and more), patient information (age, gender, where they have lived, family situation, and other details), and treatment history, most of which is currently found only in paper documents.

Data Warehouse

Data Warehouse Unstructured Data Analytics Visualization

Web Analytics: An Hour A Day

Occam's Razor

JUNE 6, 2007

Bonus: Interactive CD: Contains six podcasts, one video, two web analytics metrics definitions documents and five insightful powerpoint presentations. Experimentation & Testing (A/B, Multivariate, you name it). In 480 pages the book goes from from beginner's basics to a advanced analytics concepts. Clicks and outcomes.

Analytics

Analytics Metrics Marketing Dashboards

How to Take Back 40-60% of Your IT Spend by Fixing Your Data

Ontotext

NOVEMBER 2, 2023

This shift from relational to graph approach has been well-documented by Gartner who advise that “using graph techniques at scale will form the foundation of modern data and analytics” and “graph technologies will be used in 80% of data and analytics innovations by 2025.”

IT

IT Cost-Benefit Data-driven Technology

Introducing the vector engine for Amazon OpenSearch Serverless, now in preview

AWS Big Data

JULY 26, 2023

The vector engine supports a wide range of use cases across various domains, including image search, document search, music retrieval, product recommendation, video search, location-based search, fraud detection, and anomaly detection. One OCU can handle 4 million vectors for 128 dimensions or 500K for 768 dimensions at 99% recall rate.

Metadata

Metadata Cost-Benefit Testing Metrics

Getting ready for artificial general intelligence with examples

IBM Big Data Hub

APRIL 18, 2024

While leaders have some reservations about the benefits of current AI, organizations are actively investing in gen AI deployment, significantly increasing budgets, expanding use cases, and transitioning projects from experimentation to production. Code (62%) : Gen AI helps developers write code more efficiently and with fewer errors.

Cost-Benefit

Cost-Benefit Manufacturing Modeling Interactive

Bringing an AI Product to Market

O'Reilly on Data

JULY 28, 2020

Without clarity in metrics, it’s impossible to do meaningful experimentation. AI PMs must ensure that experimentation occurs during three phases of the product lifecycle: Phase 1: Concept During the concept phase, it’s important to determine if it’s even possible for an AI product “ intervention ” to move an upstream business metric.

Marketing

Marketing Experimentation Metrics Testing

To Balance or Not to Balance?

The Unofficial Google Data Science Blog

JUNE 30, 2016

In an ideal world, experimentation through randomization of the treatment assignment allows the identification and consistent estimation of causal effects. This fact is well documented by Kang & Schafer (2007). In observational studies treatment is assigned by nature, therefore its mechanism is unknown and needs to be estimated.

Statistics

Statistics Optimization Modeling Experimentation

Assembly required: 8 myths about knowledge management debunked

CIO Business Intelligence

MAY 20, 2024

Another study used smartphone geolocation data to measure face-to-face interactions among workers at various Silicon Valley firms. The study documents “substantial returns to face-to-face meetings … (and) returns to serendipity.” As a means of control, budgets measure performance against planned targets, influencing employee behavior.

Management

Management Interactive Measurement Marketing

Knowledge

Occam's Razor

AUGUST 22, 2011

Key To Your Digital Success: Web Analytics Measurement Model. " Measuring Incrementality: Controlled Experiments to the Rescue! Barriers To An Effective Web Measurement Strategy [+ Solutions!]. Measuring Online Engagement: What Role Does Web Analytics Play? "Engagement" How Do I Measure Success?

KPI

KPI Metrics Measurement ROI

Interview with Dominic Sartorio, Senior Vice President for Products & Development, Protegrity

Corinium

APRIL 25, 2019

For example, P&C insurance strives to understand its customers and households better through data, to provide better customer service and anticipate insurance needs, as well as accurately measure risks. Life insurance needs accurate data on consumer health, age and other metrics of risk.

Insurance

Insurance Risk IoT Data-driven

Towards Predictive Accuracy: Tuning Hyperparameters and Pipelines

Domino Data Lab

AUGUST 26, 2019

To support the iterative and experimental nature of industry work, Domino reached out to Addison-Wesley Professional (AWP) for appropriate permissions to excerpt the “Tuning Hyperparameters and Pipelines” from the book, Machine Learning with Python for Everyone by Mark E. Beyond this, we also get a measure of the variability.

Testing

Testing Modeling Machine Learning Metrics

The AIgent: Using Google’s BERT Language Model to Connect Writers & Representation

Insight

MARCH 12, 2020

To collect these genre tags and other metadata, I took advantage of the well-documented Goodreads API. One way that we can get around this is to use the proportion of tags that fall into a given class as a measure of our degree of confidence in that class association. Unfortunately, that API does not permit collection of synopses.

Modeling

Modeling Metadata Publishing Sales

When is the right time to dump an AI project?

CIO Business Intelligence

OCTOBER 10, 2024

For example, a retrieval-augmented generation (RAG) AI document search project can cost up to $1 million to deploy, with recurring per-user costs of up to $11,000 a year, according to Gartner. Still, a 30% failure rate represents a huge amount of time and money, given how widespread AI experimentation is today.

ROI

ROI Experimentation Cost-Benefit Metrics

Why CIOs need a two-tier approach to gen AI

CIO Business Intelligence

OCTOBER 31, 2024

There’s value in that kind of tinkering and experimentation on the employee level, but you want to do it safely,” says Nick van der Meulen, a research scientist at MIT CISR. “A In both cases, organizations should measure the success by both immediate impact and how well these tools and solutions align with the long-term business goals.”

Data-driven

Data-driven Unstructured Data Experimentation Consulting

Beyond the hype: 4 use cases that show what’s actually working with gen AI

CIO Business Intelligence

FEBRUARY 19, 2025

If 2023 was the year of experimentation with gen AI, 2024 was when companies zeroed in on use cases and started putting pilot projects into production. The tools are used to extract information from large documents, to help create presentations, and to summarize lengthy reports and compared documents to find discrepancies.

ROI

ROI Marketing Sales Software

Escaping POC Purgatory: Evaluation-Driven Development for AI Systems

Where CIOs should place their 2025 AI bets

Webinars

Trending Sources

From project to product: Architecting the future of enterprise technology

Webinars

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

How Block is accelerating engineering velocity through developer experience

Expectations vs. reality: A real-world check on generative AI

AI Product Management After Deployment

What high-performance IT teams look like today — and how to build one

The early returns on gen AI for software development

AI incident reporting shortcomings leave regulatory safety hole

Get AI in the hands of your employees

The Lean Analytics Cycle: Metrics > Hypothesis > Experiment > Act

Is the gen AI bubble due to burst? CIOs face rethink ahead

Why enterprise CIOs need to plan for Microsoft gen AI

Integrate sparse and dense vectors to enhance knowledge retrieval in RAG using Amazon OpenSearch Service

Bayer Crop Science blends gen AI and data science for innovative edge

Try semantic search with the Amazon OpenSearch Service vector engine

The pandemic pivot: 5 key leadership lessons that will last

AI agents will transform business processes — and magnify risks

Amazon OpenSearch Service search enhancements: 2023 roundup

What Are ChatGPT and Its Friends?

CIOs press ahead for gen AI edge — despite misgivings

Deploy and Scale AI Applications With Cloudera AI Inference Service

Tech leaders weigh in on the upside and flipside of generative AI

What Is Model Risk Management and How is it Supported by Enterprise MLOps?

How to choose the best AI platform

Combatting COVID with Embedded Advanced Analytics

Generative AI headlines are outpacing enterprise adoption

Performing Non-Compartmental Analysis with Julia and Pumas AI

How a Discovery Data Warehouse, the next evolution of augmented analytics, accelerates treatments and delivers medicines safely to patients in need

Web Analytics: An Hour A Day

How to Take Back 40-60% of Your IT Spend by Fixing Your Data

Introducing the vector engine for Amazon OpenSearch Serverless, now in preview

Getting ready for artificial general intelligence with examples

Bringing an AI Product to Market

To Balance or Not to Balance?

Assembly required: 8 myths about knowledge management debunked

Knowledge

Interview with Dominic Sartorio, Senior Vice President for Products & Development, Protegrity

Towards Predictive Accuracy: Tuning Hyperparameters and Pipelines

The AIgent: Using Google’s BERT Language Model to Connect Writers & Representation

When is the right time to dump an AI project?

Why CIOs need a two-tier approach to gen AI

Beyond the hype: 4 use cases that show what’s actually working with gen AI

Stay Connected