Document, Experimentation and Modeling

Escaping POC Purgatory: Evaluation-Driven Development for AI Systems

O'Reilly on Data

MARCH 25, 2025

Throughout this article, well explore real-world examples of LLM application development and then consolidate what weve learned into a set of first principlescovering areas like nondeterminism, evaluation approaches, and iteration cyclesthat can guide your work regardless of which models or frameworks you choose. Which multiagent frameworks?

Testing

Testing Data-driven Software Measurement

Answers: Generative AI as Learning Tool

O'Reilly on Data

JUNE 11, 2024

It is important to be careful when deploying an AI application, but it’s also important to realize that all AI is experimental. It would have been very difficult to develop the expertise to build and train a model, and much more effective to work with a company that already has that expertise. What are your specific use cases?

Modeling

Modeling Experimentation Interactive Data-driven

CIOs to spend ambitiously on AI in 2025 — and beyond

CIO Business Intelligence

NOVEMBER 11, 2024

Nate Melby, CIO of Dairyland Power Cooperative, says the Midwestern utility has been churning out large language models (LLMs) that not only automate document summarization but also help manage power grids during storms, for example. Only 13% plan to build a model from scratch.

ROI

ROI Cost-Benefit Risk Experimentation

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Where CIOs should place their 2025 AI bets

CIO Business Intelligence

JANUARY 21, 2025

Build toward intelligent document management Most enterprises have document management systems to extract information from PDFs, word processing files, and scanned paper documents, where document structure and the required information arent complex.

Cost-Benefit

Cost-Benefit Data-driven Strategy Marketing

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

Rocket-Powered Data Science

FEBRUARY 15, 2023

While generative AI has been around for several years , the arrival of ChatGPT (a conversational AI tool for all business occasions, built and trained from large language models) has been like a brilliant torch brought into a dark room, illuminating many previously unseen opportunities. So, if you have 1 trillion data points (g.,

Strategy

Strategy Experimentation Uncertainty Machine Learning

From project to product: Architecting the future of enterprise technology

CIO Business Intelligence

JANUARY 14, 2025

Documentation and diagrams transform abstract discussions into something tangible. They achieve this through models, patterns, and peer review taking complex challenges and breaking them down into understandable components that stakeholders can grasp and discuss. From documentation to automation Shawn McCarthy 3.

Enterprise

Enterprise Technology Metrics Measurement

AI Product Management After Deployment

O'Reilly on Data

OCTOBER 13, 2020

Similarly, in “ Building Machine Learning Powered Applications: Going from Idea to Product ,” Emmanuel Ameisen states: “Indeed, exposing a model to users in production comes with a set of challenges that mirrors the ones that come with debugging a model.”.

Management

Management Machine Learning Metrics Modeling

What Are ChatGPT and Its Friends?

O'Reilly on Data

MARCH 23, 2023

It’s important to understand that ChatGPT is not actually a language model. It’s a convenient user interface built around one specific language model, GPT-3.5, is one of a class of language models that are sometimes called “large language models” (LLMs)—though that term isn’t very helpful. with specialized training.

IT

IT Modeling Testing Risk

How Far We Can Go with GenAI as an Information Extraction Tool

Ontotext

JANUARY 10, 2025

Generative AI (GenAI) models, such as GPT-4, offer a promising solution, potentially reducing the dependency on labor-intensive annotation. Through iterative experimentation, we incrementally added new modules refining the prompts. BioRED performance Prompt Model P R F1 Price Latency Generic prompt GPT-4o 72 35 47.8

Informatics

Informatics Modeling Metadata Experimentation

Building end-to-end data lineage for one-time and complex queries using Amazon Athena, Amazon Redshift, Amazon Neptune and dbt

AWS Big Data

DECEMBER 12, 2024

In the context of comprehensive data governance, Amazon DataZone offers organization-wide data lineage visualization using Amazon Web Services (AWS) services, while dbt provides project-level lineage through model analysis and supports cross-project integration between data lakes and warehouses.

Snapshot

Snapshot Recreation/Entertainment Experimentation Data Lake

Going ‘AI native’ with in-house ChatGPT the MITRE way

CIO Business Intelligence

JUNE 21, 2024

And it enables research teams to analyze legislation and policy documents in record time, delivering plans for proposed changes to these critical agencies in a day rather than weeks. By June 2024, MITREChatGPT offered document analysis and reasoning on thousands of documents, provided an enterprise prompt library, and made GPT 3.5

Experimentation

Experimentation Data mining Reporting Data Processing

What’s driving the global common data capability at RGA

CIO Business Intelligence

MARCH 19, 2025

I first described the overall AI landscape and made sure they realized weve been doing AI for quite a while in the form of machine learning and other deterministic models. This enforces the need for good data governance, as AI models will surface incorrect data more frequently, and most likely at a greater cost to the business.

Metrics

Metrics Enterprise Cost-Benefit Experimentation

Why enterprise CIOs need to plan for Microsoft gen AI

CIO Business Intelligence

AUGUST 14, 2024

It’s embedded in the applications we use every day and the security model overall is pretty airtight. Microsoft has also made investments beyond OpenAI, for example in Mistral and Meta’s LLAMA models, in its own small language models like Phi, and by partnering with providers like Cohere, Hugging Face, and Nvidia. That’s risky.”

Enterprise

Enterprise Cost-Benefit Experimentation Modeling

Autodesk enlists Einstein AI to enhance customer service

CIO Business Intelligence

SEPTEMBER 18, 2024

Even as it designs 3D generative AI models for future customer deployment, CAD/CAM design giant Autodesk is “leaning” into generative AI for its customer service operations, deploying Salesforce’s Einstein for Service with plans to use Agentforce in the future, CIO Prakash Kota says. Not anymore. “If

Digital Transformation

Digital Transformation Experimentation Sales Software

What high-performance IT teams look like today — and how to build one

CIO Business Intelligence

AUGUST 20, 2024

Our mental models of what constitutes a high-performance team have evolved considerably over the past five years. Post-pandemic, high-performance teams excelled at remote and hybrid working models, were more empathetic to individual needs, and leveraged automation to reduce manual work.

IT

IT Digital Transformation Experimentation Risk

CBRE’s Sandeep Davé on accelerating your AI ambitions

CIO Business Intelligence

OCTOBER 5, 2023

Sandeep Davé knows the value of experimentation as well as anyone. Davé and his team’s achievements in AI are due in large part to creating opportunities for experimentation — and ensuring those experiments align with CBRE’s business strategy. Let’s start with the models. And those experiments have paid off.

Experimentation

Experimentation Strategy Machine Learning Interactive

How We Teach The Leaders of Tomorrow To Be Curious, Ask Questions and Not Be Afraid To Fail Fast To Learn Fast

Rocket-Powered Data Science

OCTOBER 16, 2020

We build models to test our understanding, but these models are not “one and done.” In ML, the learning cycle is sometimes called backpropagation, where the errors (inaccurate predictions) of our models are fed back into adjusting the model’s input parameters in a way that aims to improve the output accuracy. That’s data.

Digital Transformation

Digital Transformation Experimentation Data Science Data Strategy

The early returns on gen AI for software development

CIO Business Intelligence

MARCH 12, 2024

Early use cases include code generation and documentation, test case generation and test automation, as well as code optimization and refactoring, among others. Junior developers are reporting the biggest productivity boosts, but this remains an area of active research and experimentation,” Tandon says.

Software

Software Experimentation Testing Cost-Benefit

Disrupting the enterprise: How AI is redefining people, process, and productivity

CIO Business Intelligence

MAY 9, 2024

Experimentation drives momentum: How do we maximize the value of a given technology? Via experimentation. This can be as simple as a Google Sheet or sharing examples at weekly all-hands meetings Many enterprises do “blameless postmortems” to encourage experimentation without fear of making mistakes and reprisal.

Enterprise

Enterprise Experimentation Sales Manufacturing

Expectations vs. reality: A real-world check on generative AI

CIO Business Intelligence

MAY 1, 2024

Gen AI takes us from single-use models of machine learning (ML) to AI tools that promise to be a platform with uses in many areas, but you still need to validate they’re appropriate for the problems you want solved, and that your users know how to use gen AI effectively. Pilots can offer value beyond just experimentation, of course.

Cost-Benefit

Cost-Benefit Metrics Insurance Measurement

Integrate sparse and dense vectors to enhance knowledge retrieval in RAG using Amazon OpenSearch Service

AWS Big Data

SEPTEMBER 5, 2024

In the context of Retrieval-Augmented Generation (RAG), knowledge retrieval plays a crucial role, because the effectiveness of retrieval directly impacts the maximum potential of large language model (LLM) generation. It comes in two modes: document-only and bi-encoder. You can get its model ID from the response.

Metrics

Metrics Testing Experimentation Modeling

Deep Learning Illustrated: Building Natural Language Processing Models

Domino Data Lab

AUGUST 22, 2019

The excerpt covers how to create word vectors and utilize them as an input into a deep learning model. While the field of computational linguistics, or Natural Language Processing (NLP), has been around for decades, the increased interest in and use of deep learning models has also propelled applications of NLP forward within industry.

Deep Learning

Deep Learning Modeling Metrics Testing

AI incident reporting shortcomings leave regulatory safety hole

CIO Business Intelligence

JULY 1, 2024

Notable examples of AI safety incidents include: Trading algorithms causing market “flash crashes” ; Facial recognition systems leading to wrongful arrests ; Autonomous vehicle accidents ; AI models providing harmful or misleading information through social media channels.

Reporting

Reporting Risk Management Experimentation Risk

Bayer Crop Science blends gen AI and data science for innovative edge

CIO Business Intelligence

AUGUST 23, 2024

Data scientists at Bayer have developed several proofs of concept of generative AI models on the new platform that remain in discovery and evaluation phase for “efficacy,” McQueen says, adding that the models won’t be in production until 2025. The R&D pipeline is pretty highly confidential at this point,” he says.

Data Science

Data Science Experimentation Testing Modeling

DataRobot Notebooks: Enhanced Code-First Experience for Rapid AI Experimentation

DataRobot Blog

JANUARY 10, 2023

Most, if not all, machine learning (ML) models in production today were born in notebooks before they were put into production. Data science teams of all sizes need a productive, collaborative method for rapid AI experimentation. Capabilities Beyond Classic Jupyter for End-to-end Experimentation. Auto-scale compute.

Experimentation

Experimentation Machine Learning Data Science Modeling

Is the gen AI bubble due to burst? CIOs face rethink ahead

CIO Business Intelligence

AUGUST 15, 2024

A virtual assistant may save employees time when searching for old documents or composing emails, but most organizations have no idea how much time those tasks have taken historically, having never tracked such metrics before, she says. However, foundational models will always have a place as the core backbone for the industry.”

ROI

ROI Cost-Benefit Experimentation Deep Learning

Demystifying Multimodal LLMs

Dataiku

MARCH 25, 2024

This scenario is not science fiction but a glimpse into the capabilities of Multimodal Large Language Models (M-LLMs), where the convergence of various modalities extends the landscape of AI. But instead, a machine seamlessly identifies the scene and its location, provides a detailed description, and even suggests nearby attractions.

Visualization

Visualization Modeling Experimentation Testing

7 steps for turning shadow IT into a competitive edge

CIO Business Intelligence

NOVEMBER 21, 2023

Set parameters and emphasize collaboration To address one root cause of shadow IT, CIOs must also establish a governance and delivery model for evaluating, procuring, and implementing department technology solutions.

IT

IT Risk Cost-Benefit Data Science

Raise Your Corporate IQ by Documenting and Sharing Business Knowledge

Decision Management Solutions

MAY 12, 2021

by Charlotte DeKeyrel, Expert Decision Modeler. When decisions are properly documented and developed with rigor, everyone gets smarter by understanding the complexities and flow of decision-making. Decision Management Solutions. You know who they are—the members of the brain trust in your organizations.

Experimentation

Experimentation Insurance Modeling Management

Enterprise IT moves forward — cautiously — with generative AI

CIO Business Intelligence

MARCH 7, 2023

Vince Kellen understands the well-documented limitations of ChatGPT, DALL-E and other generative AI technologies — that answers may not be truthful, generated images may lack compositional integrity, and outputs may be biased — but he’s moving ahead anyway. And, he says, using generative AI for coding has worked well.

Enterprise

Enterprise IT Unstructured Data Experimentation

How the DataRobot AI Platform Is Delivering Value-Driven AI

DataRobot Blog

MARCH 16, 2023

Why model-driven AI falls short of delivering value Teams that just focus model performance using model-centric and data-centric ML risk missing the big picture business context. We are also thrilled to share the innovations and capabilities that we have developed at DataRobot to meet and exceed those requirements.

Experimentation

Experimentation Data-driven Modeling Metadata

3 key digital transformation priorities for 2024

CIO Business Intelligence

DECEMBER 19, 2023

Many technology investments are merely transitionary, taking something done today and upgrading it to a better capability without necessarily transforming the business or operating model. If CIOs don’t improve conversions from pilot to production, they may find their investors losing patience in the process and culture of experimentation.

Digital Transformation

Digital Transformation Unstructured Data Machine Learning Risk Management

Comparing the Functionality of Open Source Natural Language Processing Libraries

Domino Data Lab

APRIL 7, 2019

Other use cases may involve returning the most appropriate answer to a question, finding the most relevant documents for a query or classifying the input document itself. A good NLP library will make it easy to both train your own NLP models and integrate with the downstream ML or DL pipeline. Training domain-specific models.

Deep Learning

Deep Learning Machine Learning Experimentation Modeling

Amazon OpenSearch Service search enhancements: 2023 roundup

AWS Big Data

JANUARY 9, 2024

Lexical search In lexical search, the search engine compares the words in the search query to the words in the documents, matching word for word. Traditional lexical search, based on term frequency models like BM25, is widely used and effective for many search applications. Only items that have words the user typed match the query.

Visualization

Visualization Cost-Benefit Modeling Machine Learning

Evaluating Ray: Distributed Python for Massive Scalability

Domino Data Lab

FEBRUARY 12, 2020

for model serving (experimental), are implemented with Ray internally for its scalable, distributed computing and state management benefits, while providing a domain-specific API for the purposes they serve. Motivations for Ray: Training a Reinforcement Learning (RL) Model. Handling Stateful Computation with an Actor Model.

Experimentation

Experimentation Modeling Data Science Machine Learning

Try semantic search with the Amazon OpenSearch Service vector engine

AWS Big Data

AUGUST 21, 2023

Lexical search looks for words in the documents that appear in the queries. For the demo, we’re using the Amazon Titan foundation model hosted on Amazon Bedrock for embeddings, with no fine tuning. In lexical search, the search engine compares the words in the search query to the words in the documents, matching word for word.

Data Processing

Data Processing Visualization Experimentation Metrics

What Is Model Risk Management and How is it Supported by Enterprise MLOps?

Domino Data Lab

AUGUST 6, 2021

Model Risk Management is about reducing bad consequences of decisions caused by trusting incorrect or misused model outputs. Systematically enabling model development and production deployment at scale entails use of an Enterprise MLOps platform, which addresses the full lifecycle including Model Risk Management.

Risk Management

Risk Management Risk Modeling Enterprise

6 best practices to develop a corporate use policy for generative AI

CIO Business Intelligence

APRIL 14, 2023

In fact, it’s likely your organization has a large number of employees currently experimenting with generative AI, and as this activity moves from experimentation to real-life deployment, it’s important to be proactive before unintended consequences happen. For example, will this cover all forms of AI or just generative AI?

Risk

Risk Experimentation Marketing Risk Management

It’s a new dawn of AI-powered knowledge management

CIO Business Intelligence

JULY 3, 2023

But the rise of large language models (LLMs) is starting to make true knowledge management (KM) a reality. These models can extract meaning from digital data at scale and speed beyond the capabilities of human analysts. Data exists in ever larger silos, but real knowledge still resides in employees.

Management

Management Experimentation Modeling Enterprise

5 methods to adopt responsible generative AI practice at work

CIO Business Intelligence

APRIL 5, 2023

Midjourney, ChatGPT, Bing AI Chat, and other AI tools that make generative AI accessible have unleashed a flood of ideas, experimentation and creativity. That turns generic documentation into conversational programming where the AI can take your data and show you how to write a query, for example.

Modeling

Modeling Testing Cost-Benefit Experimentation

How CIOs align with CFOs to build RevOps

CIO Business Intelligence

APRIL 24, 2024

But to find ways it can help grow a company’s bottom line, CIOs have to do more to understand a company’s business model and identify opportunities where gen AI can change the playing field. We have a HITRUST certified health care environment and we bring in publicly-available models.” And there are audit trails for everything.”

Sales

Sales Finance Experimentation Cost-Benefit

AI agents will transform business processes — and magnify risks

CIO Business Intelligence

AUGUST 21, 2024

According to Gartner, an agent doesn’t have to be an AI model. Starting in 2018, the agency used agents, in the form of Raspberry PI computers running biologically-inspired neural networks and time series models, as the foundation of a cooperative network of sensors. “It And, yes, enterprises are already deploying them.

Risk

Risk Insurance Cost-Benefit Software

Get AI in the hands of your employees

CIO Business Intelligence

FEBRUARY 7, 2024

We’ve seen an ongoing iteration of experimentation with a number of promising pilots in production,” he says. Samsara employees are applying these general-purpose assistants to a variety of use cases, like writing documentation and job descriptions, debugging code, or writing API endpoints.

KPI

KPI Measurement Experimentation Finance

Digital transformation’s fundamental change management mistake

CIO Business Intelligence

APRIL 2, 2024

Joanne Friedman, PhD, CEO, and principal of smart manufacturing at Connektedminds, says orchestrating success in digital transformation requires a symphony of integration across disciplines : “CIOs face the challenge of harmonizing diverse disciplines like design thinking, product management, agile methodologies, and data science experimentation.

Management

Management Digital Transformation Testing Experimentation

Escaping POC Purgatory: Evaluation-Driven Development for AI Systems

Answers: Generative AI as Learning Tool

Webinars

Trending Sources

CIOs to spend ambitiously on AI in 2025 — and beyond

Webinars

Where CIOs should place their 2025 AI bets

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

From project to product: Architecting the future of enterprise technology

AI Product Management After Deployment

What Are ChatGPT and Its Friends?

How Far We Can Go with GenAI as an Information Extraction Tool

Building end-to-end data lineage for one-time and complex queries using Amazon Athena, Amazon Redshift, Amazon Neptune and dbt

Going ‘AI native’ with in-house ChatGPT the MITRE way

What’s driving the global common data capability at RGA

Why enterprise CIOs need to plan for Microsoft gen AI

Autodesk enlists Einstein AI to enhance customer service

What high-performance IT teams look like today — and how to build one

CBRE’s Sandeep Davé on accelerating your AI ambitions

How We Teach The Leaders of Tomorrow To Be Curious, Ask Questions and Not Be Afraid To Fail Fast To Learn Fast

The early returns on gen AI for software development

Disrupting the enterprise: How AI is redefining people, process, and productivity

Expectations vs. reality: A real-world check on generative AI

Integrate sparse and dense vectors to enhance knowledge retrieval in RAG using Amazon OpenSearch Service

Deep Learning Illustrated: Building Natural Language Processing Models

AI incident reporting shortcomings leave regulatory safety hole

Bayer Crop Science blends gen AI and data science for innovative edge

DataRobot Notebooks: Enhanced Code-First Experience for Rapid AI Experimentation

Is the gen AI bubble due to burst? CIOs face rethink ahead

Demystifying Multimodal LLMs

7 steps for turning shadow IT into a competitive edge

Raise Your Corporate IQ by Documenting and Sharing Business Knowledge

Enterprise IT moves forward — cautiously — with generative AI

How the DataRobot AI Platform Is Delivering Value-Driven AI

3 key digital transformation priorities for 2024

Comparing the Functionality of Open Source Natural Language Processing Libraries

Amazon OpenSearch Service search enhancements: 2023 roundup

Evaluating Ray: Distributed Python for Massive Scalability

Try semantic search with the Amazon OpenSearch Service vector engine

What Is Model Risk Management and How is it Supported by Enterprise MLOps?

6 best practices to develop a corporate use policy for generative AI

It’s a new dawn of AI-powered knowledge management

5 methods to adopt responsible generative AI practice at work

How CIOs align with CFOs to build RevOps

AI agents will transform business processes — and magnify risks

Get AI in the hands of your employees

Digital transformation’s fundamental change management mistake

Stay Connected