Modeling, Risk and Testing - Data Leaders Brief

Beyond “Prompt and Pray”

O'Reilly on Data

JANUARY 21, 2025

The Evolution of Expectations For years, the AI world was driven by scaling laws : the empirical observation that larger models and bigger datasets led to proportionally better performance. This fueled a belief that simply making models bigger would solve deeper issues like accuracy, understanding, and reasoning.

Cost-Benefit

Cost-Benefit Testing Interactive Software

Risk Management for AI Chatbots

O'Reilly on Data

JUNE 27, 2023

Doing so means giving the general public a freeform text box for interacting with your AI model. Welcome to your company’s new AI risk management nightmare. ” ) With a chatbot, the web form passes an end-user’s freeform text input—a “prompt,” or a request to act—to a generative AI model.

Risk Management

Risk Management Risk Management Modeling

Why you should care about debugging machine learning models

O'Reilly on Data

DECEMBER 12, 2019

Not least is the broadening realization that ML models can fail. And that’s why model debugging, the art and science of understanding and fixing problems in ML models, is so critical to the future of ML. Because all ML models make mistakes, everyone who cares about ML should also care about model debugging. [1]

Machine Learning

Machine Learning Modeling Testing Risk Management

Webinars

How to Achieve High-Accuracy Results When Using LLMs

MORE WEBINARS

What are model governance and model operations?

O'Reilly on Data

JUNE 19, 2019

A look at the landscape of tools for building and deploying robust, production-ready machine learning models. We are also beginning to see researchers share sample code written in popular open source libraries, and some even share pre-trained models. Model development. Model governance. Source: Ben Lorica.

Modeling

Modeling Machine Learning Testing Metrics

Escaping POC Purgatory: Evaluation-Driven Development for AI Systems

O'Reilly on Data

MARCH 25, 2025

Weve seen this across dozens of companies, and the teams that break out of this trap all adopt some version of Evaluation-Driven Development (EDD), where testing, monitoring, and evaluation drive every decision from the start. What breaks your app in production isnt always what you tested for in dev! The way out?

Testing

Testing Data-driven Software Measurement

7 types of tech debt that could cripple your business

CIO Business Intelligence

MARCH 25, 2025

CIOs perennially deal with technical debts risks, costs, and complexities. While the impacts of legacy systems can be quantified, technical debt is also often embedded in subtler ways across the IT ecosystem, making it hard to account for the full list of issues and risks.

Risk

Risk Cost-Benefit Data-driven Digital Transformation

5 top business use cases for AI agents

CIO Business Intelligence

MARCH 19, 2025

There are risks around hallucinations and bias, says Arnab Chakraborty, chief responsible AI officer at Accenture. Meanwhile, in December, OpenAIs new O3 model, an agentic model not yet available to the public, scored 72% on the same test. SS&C uses Metas Llama as well as other models, says Halpin.

Software

Software Risk Enterprise Cost-Benefit

CIOs face mounting pressure as AI costs and complexities threaten enterprise value

CIO Business Intelligence

OCTOBER 23, 2024

Despite AI’s potential to transform businesses, many senior technology leaders find themselves wrestling with unpredictable expenses, uneven productivity gains, and growing risks as AI adoption scales, Gartner said. CIOs should create proofs of concept that test how costs will scale, not just how the technology works.”

Enterprise

Enterprise Cost-Benefit Broadcasting Risk

Generative AI in the Enterprise

O'Reilly on Data

NOVEMBER 28, 2023

And everyone has opinions about how these language models and art generation programs are going to change the nature of work, usher in the singularity, or perhaps even doom the human race. 16% of respondents working with AI are using open source models. A few have even tried out Bard or Claude, or run LLaMA 1 on their laptop.

Enterprise

Enterprise Testing Modeling Reporting

Our Favorite Questions

O'Reilly on Data

OCTOBER 22, 2020

Taking the time to work this out is like building a mathematical model: if you understand what a company truly does, you don’t just get a better understanding of the present, but you can also predict the future. Since I work in the AI space, people sometimes have a preconceived notion that I’ll only talk about data and models.

Consulting

Consulting Risk Cost-Benefit Modeling

Bringing an AI Product to Market

O'Reilly on Data

JULY 28, 2020

Product Managers are responsible for the successful development, testing, release, and adoption of a product, and for leading the team that implements those milestones. You must detect when the model has become stale, and retrain it as necessary. The Core Responsibilities of the AI Product Manager. The AI Product Development Process.

Marketing

Marketing Experimentation Metrics Testing

Accelerating AI for financial services: Innovation at scale with NVIDIA and Microsoft

CIO Business Intelligence

DECEMBER 18, 2024

Financial institutions have an unprecedented opportunity to leverage AI/GenAI to expand services, drive massive productivity gains, mitigate risks, and reduce costs. GenAI is also helping to improve risk assessment via predictive analytics.

Forecasting

Forecasting Predictive Analytics Risk Finance

AI Governance: Act now, thrive later

CIO Business Intelligence

JANUARY 30, 2025

These changes can expose businesses to risks and vulnerabilities such as security breaches, data privacy issues and harm to the companys reputation. It also includes managing the risks, quality and accountability of AI systems and their outcomes. It is easy to see how the detractions can get in the way. Start with: An AI culture.

Testing

Testing Metrics Cost-Benefit Modeling

The Role of Model Governance in Machine Learning and Artificial Intelligence

Domino Data Lab

AUGUST 6, 2021

All models require testing and auditing throughout their deployment and, because models are continually learning, there is always an element of risk that they will drift from their original standards. As such, model governance needs to be applied to each model for as long as it’s being used.

Machine Learning

Machine Learning Modeling Testing Data Science

Reclaiming the stories that algorithms tell

O'Reilly on Data

MAY 27, 2020

Under school district policy, each of Audrey’s eleven- and twelve-year old students is tested at least three times a year to determine his or her Lexile, a number between 200 and 1,700 that reflects how well the student can read. They test each student’s grasp of a particular sentence or paragraph—but not of a whole story.

Risk

Risk Testing Reporting Measurement

AI agents will transform business processes — and magnify risks

CIO Business Intelligence

AUGUST 21, 2024

According to Gartner, an agent doesn’t have to be an AI model. Starting in 2018, the agency used agents, in the form of Raspberry PI computers running biologically-inspired neural networks and time series models, as the foundation of a cooperative network of sensors. “It Adding smarter AI also adds risk, of course. “At

Risk

Risk Insurance Cost-Benefit Software

Structural Evolutions in Data

O'Reilly on Data

SEPTEMBER 19, 2023

Stage 2: Machine learning models Hadoop could kind of do ML, thanks to third-party tools. While data scientists were no longer handling Hadoop-sized workloads, they were trying to build predictive models on a different kind of “large” dataset: so-called “unstructured data.” And it was good.

Machine Learning

Machine Learning Testing Modeling Cost-Benefit

7 ways gen AI can create more work than it saves

CIO Business Intelligence

NOVEMBER 13, 2024

One is going through the big areas where we have operational services and look at every process to be optimized using artificial intelligence and large language models. But a substantial 23% of respondents say the AI has underperformed expectations as models can prove to be unreliable and projects fail to scale.

IT

IT Consulting ROI Cost-Benefit

What Are ChatGPT and Its Friends?

O'Reilly on Data

MARCH 23, 2023

What is it, how does it work, what can it do, and what are the risks of using it? It’s important to understand that ChatGPT is not actually a language model. It’s a convenient user interface built around one specific language model, GPT-3.5, The GPT-series LLMs are also called “foundation models.” GPT-2, 3, 3.5,

IT

IT Modeling Testing Risk

AI Powered Misinformation and Manipulation at Scale #GPT-3

O'Reilly on Data

MAY 25, 2021

GPT-3 is essentially an auto-complete bot whose underlying Machine Learning (ML) model has been trained on vast quantities of text available on the Internet. I’d like to share my thoughts on GPT-3 in terms of risks and countermeasures, and discuss real examples of how I have interacted with the model to support my learning journey.

Modeling

Modeling Marketing Testing Machine Learning

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

Rocket-Powered Data Science

FEBRUARY 15, 2023

While generative AI has been around for several years , the arrival of ChatGPT (a conversational AI tool for all business occasions, built and trained from large language models) has been like a brilliant torch brought into a dark room, illuminating many previously unseen opportunities. So, if you have 1 trillion data points (g.,

Strategy

Strategy Experimentation Uncertainty Machine Learning

Build a high-performance quant research platform with Apache Iceberg

AWS Big Data

JANUARY 9, 2025

This simplifies data modification processes, which is crucial for ingesting and updating large volumes of market and trade data, quickly iterating on backtesting and reprocessing workflows, and maintaining detailed audit trails for risk and compliance requirements. At petabyte scale, Icebergs advantages become clear.

Metadata

Metadata Snapshot Cost-Benefit Optimization

Managing machine learning in the enterprise: Lessons from banking and health care

O'Reilly on Data

JULY 15, 2019

In recent posts, we described requisite foundational technologies needed to sustain machine learning practices within organizations, and specialized tools for model development, model governance, and model operations/testing/monitoring. Note that the emphasis of SR 11-7 is on risk management.). Image by Ben Lorica.

Machine Learning

Machine Learning Management Enterprise Risk Management

Deep tech disruption: How advanced technologies are transforming businesses

CIO Business Intelligence

MARCH 4, 2025

From AI models that boost sales to robots that slash production costs, advanced technologies are transforming both top-line growth and bottom-line efficiency. The takeaway is clear: embrace deep tech now, or risk being left behind by those who do. Today, that timeline is shrinking dramatically. Thats a remarkably short horizon for ROI.

Technology

Technology ROI Cost-Benefit Manufacturing

US Air Force seeks generative AI test pilots

CIO Business Intelligence

JUNE 13, 2024

Not instant perfection The NIPRGPT experiment is an opportunity to conduct real-world testing, measuring generative AI’s computational efficiency, resource utilization, and security compliance to understand its practical applications. It is not training the model, nor are responses refined based on any user inputs.

Testing

Testing Experimentation Data Processing Modeling

6 keys to genAI success in 2025

CIO Business Intelligence

DECEMBER 17, 2024

What are the associated risks and costs, including operational, reputational, and competitive? For AI models to succeed, they must be fed high-quality data thats accurate, up-to-date, secure, and complies with privacy regulations such as the Colorado Privacy Act, California Consumer Privacy Act, or General Data Protection Regulation (GDPR).

Experimentation

Experimentation ROI Risk Data Quality

Start DataOps Today with ‘Lean DataOps’

DataKitchen

SEPTEMBER 20, 2021

The best way to ensure error-free execution of data production is through automated testing and monitoring. The DataKitchen Platform enables data teams to integrate testing and observability into data pipeline orchestrations. Automated tests work 24×7 to ensure that the results of each processing stage are accurate and correct.

Testing

Testing Metrics Measurement Dashboards

Avoiding Toxicity in Generative AI

David Menninger's Analyst Perspectives

SEPTEMBER 24, 2024

erroneous results), and an equal amount (32%) mentioned legal risk. AI governance should address a number of issues, including data privacy, bias in data and models, drift in model accuracy, hallucinations and toxicity. Red-teaming is a term used to describe human testing of models for vulnerabilities.

Testing

Testing Modeling Enterprise Risk

What gives IT leaders pause as they look to integrate agentic AI with legacy infrastructure

CIO Business Intelligence

FEBRUARY 26, 2025

AI agents are powered by gen AI models but, unlike chatbots, they can handle more complex tasks, work autonomously, and be combined with other AI agents into agentic systems capable of tackling entire workflows, replacing employees or addressing high-level business goals. D&B is not alone in worrying about the risks of AI agents.

IT

IT Enterprise Interactive Data Quality

What CIOs should learn now that DeepSeek is here

CIO Business Intelligence

JANUARY 30, 2025

Chinese AI startup DeepSeek made a big splash last week when it unveiled an open-source version of its reasoning model, DeepSeek-R1, claiming performance superior to OpenAIs o1 generative pre-trained transformer (GPT). That echoes a statement issued by NVIDIA on Monday: DeepSeek is a perfect example of test time scaling.

Modeling

Modeling Data-driven Technology Strategy

Delegation: The biggest test for transformational CIOs

CIO Business Intelligence

JANUARY 19, 2023

Giving up control: rewards outweigh the risks The benefits of greater delegation include an increase in the velocity of decision making and an increased sense of ownership and accountability, says Cisco Sanchez, SVP and CIO at Qualcomm, which has been increasing the pace of delegation. They just need visibility.

Testing

Testing Digital Transformation Risk Management

ServiceOps: Unleashing a new AI agent to reduce change failures in complex systems

CIO Business Intelligence

JANUARY 21, 2025

DevOps teams follow their own practices of using continuous integration and continuous deployment (CI/CD) tools to automatically merge code changes and automate testing steps to deploy changes more frequently and reliably. With this information, teams can ask the AI agent additional questions such as Should I approve the change?

Risk

Risk Data-driven Interactive Management

3 musts when recruiting vendors for AI

CIO Business Intelligence

MARCH 5, 2025

If they decide a project could solve a big enough problem to merit certain risks, they then make sure they understand what type of data will be needed to address the solution. The next thing is to make sure they have an objective way of testing the outcome and measuring success. But we dont ignore the smaller players.

Testing

Testing Measurement Technology Experimentation

Cost, security, and flexibility: the business case for open source gen AI

CIO Business Intelligence

DECEMBER 11, 2024

To solve the problem, the company turned to gen AI and decided to use both commercial and open source models. With security, many commercial providers use their customers data to train their models, says Ringdahl. Thats one of the catches of proprietary commercial models, he says. Its possible to opt-out, but there are caveats.

Cost-Benefit

Cost-Benefit Modeling Marketing Sales

Practical Skills for The AI Product Manager

O'Reilly on Data

MAY 14, 2020

Experimentation: It’s just not possible to create a product by building, evaluating, and deploying a single model. In reality, many candidate models (frequently hundreds or even thousands) are created during the development process. Modelling: The model is often misconstrued as the most important component of an AI product.

Management

Management Experimentation B2B Machine Learning

Automating the Automators: Shift Change in the Robot Factory

O'Reilly on Data

JANUARY 17, 2023

Building Models. A common task for a data scientist is to build a predictive model. You’ll try this with a few other algorithms, and their respective tuning parameters–maybe even break out TensorFlow to build a custom neural net along the way–and the winning model will be the one that heads to production.

Machine Learning

Machine Learning Predictive Modeling Software Modeling

ChatGPT, Author of The Quixote

O'Reilly on Data

MARCH 26, 2024

TL;DR LLMs and other GenAI models can reproduce significant chunks of training data. Researchers are finding more and more ways to extract training data from ChatGPT and other models. And the space is moving quickly: SORA , OpenAI’s text-to-video model, is yet to be released and has already taken the world by storm.

Modeling

Modeling Machine Learning Risk Advertising

Is your disaster recovery a house of cards? Why BIA, BCP, and DRP are your foundation

CIO Business Intelligence

MARCH 4, 2025

In fact, successful recovery from cyberattacks and other disasters hinges on an approach that integrates business impact assessments (BIA), business continuity planning (BCP), and disaster recovery planning (DRP) including rigorous testing. See also: How resilient CIOs future-proof to mitigate risks.)

Testing

Testing Risk Consulting Risk Management

AI brings complexity to cybersecurity and fraud

CIO Business Intelligence

MARCH 31, 2025

The 2024 Security Priorities study shows that for 72% of IT and security decision makers, their roles have expanded to accommodate new challenges, with Risk management, Securing AI-enabled technology and emerging technologies being added to their plate. Regular engagement with the board and business leaders ensures risk visibility.

Data-driven

Data-driven Risk Risk Management Metrics

Closer to AGI?

O'Reilly on Data

JUNE 7, 2022

DeepMind’s new model, Gato, has sparked a debate on whether artificial general intelligence (AGI) is nearer–almost at hand–just a matter of scale. Gato is a model that can solve multiple unrelated problems: it can play a large number of different games, label images, chat, operate a robot, and more. If we had AGI, how would we know it?

Modeling

Modeling Interactive Optimization Deep Learning

AI will evolve the role of the CIO

CIO Business Intelligence

NOVEMBER 4, 2024

Mark Read, CEO of global advertising giant WPP recently told shareholders: “AI will also offer the ability to develop new business and financial models.” Lead the conversation with the board on risks, pros and cons, and talk like a businessperson. Do not dismiss yourself from being the driver, and reinvent yourself,” Langer advises.

Business Driver

Business Driver Advertising Data-driven Modeling

How to Set AI Goals

O'Reilly on Data

SEPTEMBER 15, 2020

In my book, I introduce the Technical Maturity Model: I define technical maturity as a combination of three factors at a given point of time. Technical competence results in reduced risk and uncertainty. AI initiatives may also require significant considerations for governance, compliance, ethics, cost, and risk.

Advertising

Advertising Cost-Benefit ROI Machine Learning

How To Succeed As a DataOps Engineer

DataKitchen

NOVEMBER 20, 2021

A DataOps Engineer can make test data available on demand. We have automated testing and a system for exception reporting, where tests identify issues that need to be addressed. It then autogenerates QC tests based on those rules. Let’s say a data scientist has developed a model that works perfectly with training data.

Testing

Testing Machine Learning Data Warehouse Analytics

Accomplish Agile Business Intelligence & Analytics For Your Business

datapine

APRIL 15, 2020

Your Chance: Want to test an agile business intelligence solution? Business intelligence is moving away from the traditional engineering model: analysis, design, construction, testing, and implementation. In the traditional model communication between developers and business users is not a priority. Finalize testing.

Business Intelligence

Business Intelligence Analytics Testing Dashboards

Beyond “Prompt and Pray”

Risk Management for AI Chatbots

Webinars

Trending Sources

Why you should care about debugging machine learning models

Webinars

What are model governance and model operations?

Escaping POC Purgatory: Evaluation-Driven Development for AI Systems

7 types of tech debt that could cripple your business

5 top business use cases for AI agents

CIOs face mounting pressure as AI costs and complexities threaten enterprise value

Generative AI in the Enterprise

Our Favorite Questions

Bringing an AI Product to Market

Accelerating AI for financial services: Innovation at scale with NVIDIA and Microsoft

AI Governance: Act now, thrive later

The Role of Model Governance in Machine Learning and Artificial Intelligence

Reclaiming the stories that algorithms tell

AI agents will transform business processes — and magnify risks

Structural Evolutions in Data

7 ways gen AI can create more work than it saves

What Are ChatGPT and Its Friends?

AI Powered Misinformation and Manipulation at Scale #GPT-3

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

Build a high-performance quant research platform with Apache Iceberg

Managing machine learning in the enterprise: Lessons from banking and health care

Deep tech disruption: How advanced technologies are transforming businesses

US Air Force seeks generative AI test pilots

6 keys to genAI success in 2025

Start DataOps Today with ‘Lean DataOps’

Avoiding Toxicity in Generative AI

What gives IT leaders pause as they look to integrate agentic AI with legacy infrastructure

What CIOs should learn now that DeepSeek is here

Delegation: The biggest test for transformational CIOs

ServiceOps: Unleashing a new AI agent to reduce change failures in complex systems

3 musts when recruiting vendors for AI

Cost, security, and flexibility: the business case for open source gen AI

Practical Skills for The AI Product Manager

Automating the Automators: Shift Change in the Robot Factory

ChatGPT, Author of The Quixote

Is your disaster recovery a house of cards? Why BIA, BCP, and DRP are your foundation

AI brings complexity to cybersecurity and fraud

Closer to AGI?

AI will evolve the role of the CIO

How to Set AI Goals

How To Succeed As a DataOps Engineer

Accomplish Agile Business Intelligence & Analytics For Your Business

Stay Connected