Document, Experimentation and Testing

Escaping POC Purgatory: Evaluation-Driven Development for AI Systems

O'Reilly on Data

MARCH 25, 2025

Weve seen this across dozens of companies, and the teams that break out of this trap all adopt some version of Evaluation-Driven Development (EDD), where testing, monitoring, and evaluation drive every decision from the start. What breaks your app in production isnt always what you tested for in dev! The way out?

Testing

Testing Data-driven Software Measurement

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

Rocket-Powered Data Science

FEBRUARY 15, 2023

Since ChatGPT is built from large language models that are trained against massive data sets (mostly business documents, internal text repositories, and similar resources) within your organization, consequently attention must be given to the stability, accessibility, and reliability of those resources. Test early and often.

Strategy

Strategy Experimentation Uncertainty Machine Learning

From project to product: Architecting the future of enterprise technology

CIO Business Intelligence

JANUARY 14, 2025

Documentation and diagrams transform abstract discussions into something tangible. By articulating fitness functions automated tests tied to specific quality attributes like reliability, security or performance teams can visualize and measure system qualities that align with business goals.

Enterprise

Enterprise Technology Metrics Measurement

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

The early returns on gen AI for software development

CIO Business Intelligence

MARCH 12, 2024

Early use cases include code generation and documentation, test case generation and test automation, as well as code optimization and refactoring, among others. Gen AI is also reducing the time needed to complete testing, via automation, Ramakrishnan says. One example is with document search and summarization.

Software

Software Experimentation Testing Cost-Benefit

AI Product Management After Deployment

O'Reilly on Data

OCTOBER 13, 2020

In Bringing an AI Product to Market , we distinguished the debugging phase of product development from pre-deployment evaluation and testing. During testing and evaluation, application performance is important, but not critical to success. require not only disclosure, but also monitored testing. Debugging AI Products.

Management

Management Machine Learning Metrics Modeling

AI poised to replace entry-level positions at large financial institutions

CIO Business Intelligence

APRIL 12, 2024

Large banking firms are quietly testing AI tools under code names such as as Socrates that could one day make the need to hire thousands of college graduates at these firms obsolete, according to the report.

Experimentation

Experimentation Reporting Testing Statistics

Modernizing bp’s application landscape with AI

CIO Business Intelligence

JANUARY 24, 2025

Engagement with leadership and upskilling for personnel help develop the conditions for AI innovation and experimentation to take place, she says. Like many companies, bp is also using genAI to extract information from documents, summarize meetings, and so on, freeing up office workers time for more strategic activities.

Digital Transformation

Digital Transformation B2B Testing Experimentation

5 ways CIOs can help gen AI achieve its lightbulb moment

CIO Business Intelligence

FEBRUARY 9, 2024

You’ll want to make the policy a living document and update it on a suitable cadence as needed. Inside your organization, whether within the IT department or business units, be sure to emphasize and allow considerable time for testing and experimentation before going live.

IT

IT Experimentation Risk Testing

Expectations vs. reality: A real-world check on generative AI

CIO Business Intelligence

MAY 1, 2024

Pilots can offer value beyond just experimentation, of course. McKinsey reports that industrial design teams using LLM-powered summaries of user research and AI-generated images for ideation and experimentation sometimes see a reduction upward of 70% in product development cycle times.

Cost-Benefit

Cost-Benefit Metrics Insurance Measurement

Integrate sparse and dense vectors to enhance knowledge retrieval in RAG using Amazon OpenSearch Service

AWS Big Data

SEPTEMBER 5, 2024

It comes in two modes: document-only and bi-encoder. For more details about these two terms, see Improving document retrieval with sparse semantic encoders. Simply put, in document-only mode, term expansion is performed only during document ingestion. Bi-encoder mode improves performance but may cause more latency.

Metrics

Metrics Testing Experimentation Modeling

Disrupting the enterprise: How AI is redefining people, process, and productivity

CIO Business Intelligence

MAY 9, 2024

Experimentation drives momentum: How do we maximize the value of a given technology? Via experimentation. This can be as simple as a Google Sheet or sharing examples at weekly all-hands meetings Many enterprises do “blameless postmortems” to encourage experimentation without fear of making mistakes and reprisal.

Enterprise

Enterprise Experimentation Sales Manufacturing

What Are ChatGPT and Its Friends?

O'Reilly on Data

MARCH 23, 2023

It’s by far the most convincing example of a conversation with a machine; it has certainly passed the Turing test. Be very careful about documents that require any sort of precision. Still, I would want a human lawyer to review anything it produced; legal documents require precision. But it is an amazing analytic engine.”

IT

IT Modeling Testing Risk

How Far We Can Go with GenAI as an Information Extraction Tool

Ontotext

JANUARY 10, 2025

Our goal is to test whether GenAI can handle diverse domains effectively and determine if its a viable tool for domain-specific graph-building tasks. Through iterative experimentation, we incrementally added new modules refining the prompts. Prompting The quality of GenAI outputs is heavily influenced by how prompts are formulated.

Informatics

Informatics Modeling Metadata Experimentation

Bayer Crop Science blends gen AI and data science for innovative edge

CIO Business Intelligence

AUGUST 23, 2024

Like most enterprises, Bayer’s agricultural division will initially use AWS-based generative AI tools out-of-the-box to automate basic business processes, such as the production of internal technical documentation, McQueen says. Making that available across the division will spur more robust experimentation and innovation, he notes.

Data Science

Data Science Experimentation Testing Modeling

CBRE’s Sandeep Davé on accelerating your AI ambitions

CIO Business Intelligence

OCTOBER 5, 2023

Sandeep Davé knows the value of experimentation as well as anyone. As chief digital and technology officer at CBRE, Davé recognized early that the commercial real estate industry was ripe for AI and machine learning enhancements, and he and his team have tested countless use cases across the enterprise ever since.

Experimentation

Experimentation Strategy Machine Learning Interactive

How We Teach The Leaders of Tomorrow To Be Curious, Ask Questions and Not Be Afraid To Fail Fast To Learn Fast

Rocket-Powered Data Science

OCTOBER 16, 2020

We build models to test our understanding, but these models are not “one and done.” Images, text, documents, audio, video and all the apps on your phone, all the things you search for on the internet? They are part of a cycle of learning. What you see with your eyes? That’s data. What you hear with your ears? That’s all data.

Digital Transformation

Digital Transformation Experimentation Data Science Data Strategy

AI incident reporting shortcomings leave regulatory safety hole

CIO Business Intelligence

JULY 1, 2024

By documenting cases where automated systems misbehave, glitch or jeopardize users, we can better discern problematic patterns and mitigate risks. It’s then important to regularly test and validate AI systems to help identify potential issues proactively.”

Reporting

Reporting Risk Management Experimentation Risk

What high-performance IT teams look like today — and how to build one

CIO Business Intelligence

AUGUST 20, 2024

One way to do this is to ensure all digital transformation initiatives have documented vision statements and clearly defined business and end-user objectives when scheduling major deployments. CIOs should consider stepping into the conversation as facilitators when teams and stakeholders are at a standstill negotiating timelines and scope.

IT

IT Digital Transformation Experimentation Risk

Digital transformation’s fundamental change management mistake

CIO Business Intelligence

APRIL 2, 2024

Joanne Friedman, PhD, CEO, and principal of smart manufacturing at Connektedminds, says orchestrating success in digital transformation requires a symphony of integration across disciplines : “CIOs face the challenge of harmonizing diverse disciplines like design thinking, product management, agile methodologies, and data science experimentation.

Management

Management Digital Transformation Testing Experimentation

DataRobot Notebooks: Enhanced Code-First Experience for Rapid AI Experimentation

DataRobot Blog

JANUARY 10, 2023

ML model builders spend a ton of time running multiple experiments in a data science notebook environment before moving the well-tested and robust models from those experiments to a secure, production-grade environment for general consumption. Capabilities Beyond Classic Jupyter for End-to-end Experimentation. Auto-scale compute.

Experimentation

Experimentation Machine Learning Data Science Modeling

The Lean Analytics Cycle: Metrics > Hypothesis > Experiment > Act

Occam's Razor

APRIL 8, 2013

Sometimes, we escape the clutches of this sub optimal existence and do pick good metrics or engage in simple A/B testing. Testing out a new feature. If you have access to existing data, take some time to document what the current performance looks like. Identify, hypothesize, test, react. But it is not routine.

Metrics

Metrics KPI Analytics Key Performance Indicator

3 key digital transformation priorities for 2024

CIO Business Intelligence

DECEMBER 19, 2023

Create these six generative AI workstreams CIOs should document their AI strategy for delivering short-term productivity improvements while planning visionary impacts. If CIOs don’t improve conversions from pilot to production, they may find their investors losing patience in the process and culture of experimentation.

Digital Transformation

Digital Transformation Unstructured Data Machine Learning Risk Management

6 best practices to develop a corporate use policy for generative AI

CIO Business Intelligence

APRIL 14, 2023

In fact, it’s likely your organization has a large number of employees currently experimenting with generative AI, and as this activity moves from experimentation to real-life deployment, it’s important to be proactive before unintended consequences happen.

Risk

Risk Experimentation Marketing Risk Management

Enterprise IT moves forward — cautiously — with generative AI

CIO Business Intelligence

MARCH 7, 2023

Vince Kellen understands the well-documented limitations of ChatGPT, DALL-E and other generative AI technologies — that answers may not be truthful, generated images may lack compositional integrity, and outputs may be biased — but he’s moving ahead anyway. Michal Cenkl, director of innovation and experimentation, Mitre Corp.

Enterprise

Enterprise IT Unstructured Data Experimentation

Demystifying Multimodal LLMs

Dataiku

MARCH 25, 2024

One limitation observed while testing the LENS approach, particularly in VQA, is its heavy reliance on the output of the first modules, namely CLIP and BLIP captions. Here is a figure showing a comparison of extraction from the same web document: Comparison between image-text pairs (left) and interleaved image-text documents (right).

Visualization

Visualization Modeling Experimentation Testing

Generative AI copilots: What’s hype and where to drive results

CIO Business Intelligence

MARCH 5, 2024

Many other platforms, such as Coveo’s Relative Generative Answering , Quickbase AI , and LaunchDarkly’s Product Experimentation , have embedded virtual assistant capabilities but don’t brand them copilots. Today, top AI-assistant capabilities delivering results include generating code, test cases, and documentation.

Advertising

Advertising Cost-Benefit Experimentation Reporting

How the DataRobot AI Platform Is Delivering Value-Driven AI

DataRobot Blog

MARCH 16, 2023

This means they need the tools that can help with testing and documenting the model, automation across the entire pipeline and they need to be able to seamlessly integrate the model into business critical applications or workflows. Assured Compliance and Governance – DataRobot has always been strong on ensuring governance.

Experimentation

Experimentation Data-driven Modeling Metadata

5 methods to adopt responsible generative AI practice at work

CIO Business Intelligence

APRIL 5, 2023

Midjourney, ChatGPT, Bing AI Chat, and other AI tools that make generative AI accessible have unleashed a flood of ideas, experimentation and creativity. That turns generic documentation into conversational programming where the AI can take your data and show you how to write a query, for example.

Modeling

Modeling Testing Cost-Benefit Experimentation

Try semantic search with the Amazon OpenSearch Service vector engine

AWS Big Data

AUGUST 21, 2023

Lexical search looks for words in the documents that appear in the queries. Background A search engine is a special kind of database, allowing you to store documents and data and then run queries to retrieve the most relevant ones. OpenSearch Service supports a variety of search and relevance ranking techniques.

Data Processing

Data Processing Visualization Experimentation Metrics

Accelerating Cost Reduction: AI Making an Impact on Financial Services

Cloudera

OCTOBER 18, 2023

Document translation: When collaborating with multinational groups, generative AI can translate contracts, agreements, policies, and other legal/ business documents ensuring accurate written communication. Automated documentation generation: Generating documentation is time consuming and tedious.

Cost-Benefit

Cost-Benefit Testing Machine Learning Optimization

The pandemic pivot: 5 key leadership lessons that will last

CIO Business Intelligence

APRIL 4, 2022

The early days of the pandemic taught organizations like Avery Dennison the power of agility and experimentation. The team was helped with live augmented reality annotations to document each step. “We Employee crowdsourcing can yield breakthrough ideas. We are now making the solution available to more factories,” he says.

Experimentation

Experimentation Interactive Business Objectives Manufacturing

How CIOs align with CFOs to build RevOps

CIO Business Intelligence

APRIL 24, 2024

We’ve been doing proof-of-value and different test cases on efficiency opportunities within our organization as it relates to AI,” he says. A third gen AI product, BenefitsGPT, isn’t yet commercially available, but is currently being tested by three other Blue Cross Blue Shield organizations.

Sales

Sales Finance Experimentation Cost-Benefit

What’s new with Amazon MWAA support for Apache Airflow version 2.4.3

AWS Big Data

MAY 2, 2023

Test the feature To test this feature, run the producer DAG. Removal of experimental Smart Sensors. For detailed release documentation with sample code, visit the Apache Airflow v2.4.0 How dynamic task mapping works Let’s see an example using the reference code available in the Airflow documentation. Airflow v2.4.0

Testing

Testing Experimentation Management Metadata

Comparing the Functionality of Open Source Natural Language Processing Libraries

Domino Data Lab

APRIL 7, 2019

Other use cases may involve returning the most appropriate answer to a question, finding the most relevant documents for a query or classifying the input document itself. A good NLP library will make it easy to both train your own NLP models and integrate with the downstream ML or DL pipeline.

Deep Learning

Deep Learning Machine Learning Experimentation Modeling

How technology is shaping education in Saudi Arabia

CIO Business Intelligence

APRIL 11, 2024

KAUST Smart partners with companies and organizations to develop, test, and pilot technologies and to take advantage of our unique city environment. KAUST has launched numerous initiatives over the last couple of years in recognition of the ever-changing nature of the digital landscape.

Technology

Technology Digital Transformation Experimentation Testing

AI agents will transform business processes — and magnify risks

CIO Business Intelligence

AUGUST 21, 2024

Then there’s the risk of malicious code injections, where the code is hidden inside documents read by an AI agent, and the AI then executes the code. Enterprises also need to think about how they’ll test these systems to ensure they’re performing as intended. That’s the most difficult thing,” he says.

Risk

Risk Insurance Cost-Benefit Software

Amazon OpenSearch Service search enhancements: 2023 roundup

AWS Big Data

JANUARY 9, 2024

Lexical search In lexical search, the search engine compares the words in the search query to the words in the documents, matching word for word. Semantic search doesn’t match individual query terms—it finds documents whose vector embedding is near the query’s embedding in the vector space and therefore semantically similar to the query.

Visualization

Visualization Cost-Benefit Modeling Machine Learning

H&R Block answers tax questions using gen AI

CIO Business Intelligence

APRIL 15, 2024

Given the speed required, Lowden established a specialized team for the project to encourage a culture of experimentation and “moving fast to learn fast.” “You One of the challenging things we found was in getting the content right, the source documents to feed the LLM,” Lowden says. The first was safety and data privacy testing.

Testing

Testing Machine Learning Data Quality Experimentation

Saudi Vision 2030: Why the Kingdom is becoming a hub in EdTech education

CIO Business Intelligence

JUNE 7, 2023

KAUST Smart partners with companies and organizations to develop, test, and pilot technologies and to take advantage of our unique city environment.

Digital Transformation

Digital Transformation Experimentation Technology Testing

Behind the scenes: The daily impact of genAI at Hamburg’s largest gaming company

CIO Business Intelligence

DECEMBER 10, 2024

For example, AI-supported chat tools help our game designers to: Brainstorm ideas Test complex game mechanics Generate dialogs They act as digital sparring partners that open up new perspectives and accelerate the creative process. QueryMind training is based on information about the table structure, sample queries and documentation.

Data-driven

Data-driven Metadata Interactive KPI

Introducing the Next Generation of Text AI for AI Cloud Platform

DataRobot

DECEMBER 16, 2021

Take advantage of DataRobot’s wide range of options for experimentation. DataRobot’s Text AI clears the way for you to test various text and NLP techniques (such as “bag-of-words” models, tf-idf, cosine similarity, FastText, TinyBert, NLTK, spaCy, stop word removal, stemming, lemmatization, and many more). It is part of our new 7.3

Unstructured Data

Unstructured Data Experimentation Deep Learning Modeling

CIOs press ahead for gen AI edge — despite misgivings

CIO Business Intelligence

OCTOBER 18, 2023

We’re seeing lots and lots of pilots,” says Gartner AI analyst Arun Chandrasekaran, who notes content creation, document summarization, sentiment analysis, and enterprise search chief among the initial use cases. A recent survey of nearly 1,000 IT decision-makers conducted by Foundry underscores this. “As

Risk

Risk Manufacturing Enterprise Technology

How generative AI impacts your digital transformation priorities

CIO Business Intelligence

AUGUST 1, 2023

Clean and prep your data for private LLMs Generative AI capabilities will increase the importance and value of an enterprise’s unstructured data, including documents, videos, and content stored in learning management systems. What stops employees from trying a tool and pasting proprietary or other confidential information into their prompts?

Digital Transformation

Digital Transformation Unstructured Data Strategy Data Science

How Italian CIOs produce value with gen AI

CIO Business Intelligence

MAY 30, 2024

Having overcome the initial perplexity about ChatGPT, Maffei tested gen AI in coding activity and found great benefits. After this project, we’ll constantly introduce AI on other sectors and services like control of travel documentation.” AI is the future for us,” says Maffei.

Cost-Benefit

Cost-Benefit Sales Modeling Experimentation

Escaping POC Purgatory: Evaluation-Driven Development for AI Systems

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

Webinars

Trending Sources

From project to product: Architecting the future of enterprise technology

Webinars

The early returns on gen AI for software development

AI Product Management After Deployment

AI poised to replace entry-level positions at large financial institutions

Modernizing bp’s application landscape with AI

5 ways CIOs can help gen AI achieve its lightbulb moment

Expectations vs. reality: A real-world check on generative AI

Integrate sparse and dense vectors to enhance knowledge retrieval in RAG using Amazon OpenSearch Service

Disrupting the enterprise: How AI is redefining people, process, and productivity

What Are ChatGPT and Its Friends?

How Far We Can Go with GenAI as an Information Extraction Tool

Bayer Crop Science blends gen AI and data science for innovative edge

CBRE’s Sandeep Davé on accelerating your AI ambitions

How We Teach The Leaders of Tomorrow To Be Curious, Ask Questions and Not Be Afraid To Fail Fast To Learn Fast

AI incident reporting shortcomings leave regulatory safety hole

What high-performance IT teams look like today — and how to build one

Digital transformation’s fundamental change management mistake

DataRobot Notebooks: Enhanced Code-First Experience for Rapid AI Experimentation

The Lean Analytics Cycle: Metrics > Hypothesis > Experiment > Act

3 key digital transformation priorities for 2024

6 best practices to develop a corporate use policy for generative AI

Enterprise IT moves forward — cautiously — with generative AI

Demystifying Multimodal LLMs

Generative AI copilots: What’s hype and where to drive results

How the DataRobot AI Platform Is Delivering Value-Driven AI

5 methods to adopt responsible generative AI practice at work

Try semantic search with the Amazon OpenSearch Service vector engine

Accelerating Cost Reduction: AI Making an Impact on Financial Services

The pandemic pivot: 5 key leadership lessons that will last

How CIOs align with CFOs to build RevOps

What’s new with Amazon MWAA support for Apache Airflow version 2.4.3

Comparing the Functionality of Open Source Natural Language Processing Libraries

How technology is shaping education in Saudi Arabia

AI agents will transform business processes — and magnify risks

Amazon OpenSearch Service search enhancements: 2023 roundup

H&R Block answers tax questions using gen AI

Saudi Vision 2030: Why the Kingdom is becoming a hub in EdTech education

Behind the scenes: The daily impact of genAI at Hamburg’s largest gaming company

Introducing the Next Generation of Text AI for AI Cloud Platform

CIOs press ahead for gen AI edge — despite misgivings

How generative AI impacts your digital transformation priorities

How Italian CIOs produce value with gen AI

Stay Connected