Remove Document Remove Risk Remove Testing
article thumbnail

Beyond “Prompt and Pray”

O'Reilly on Data

Your companys AI assistant confidently tells a customer its processed their urgent withdrawal requestexcept it hasnt, because it misinterpreted the API documentation. These are systems that engage in conversations and integrate with APIs but dont create stand-alone content like emails, presentations, or documents.

article thumbnail

Risk Management for AI Chatbots

O'Reilly on Data

Welcome to your company’s new AI risk management nightmare. Before you give up on your dreams of releasing an AI chatbot, remember: no risk, no reward. The core idea of risk management is that you don’t win by saying “no” to everything. Why not take the extra time to test for problems?

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

The Race For Data Quality in a Medallion Architecture

DataKitchen

Finally, the challenge we are addressing in this document – is how to prove the data is correct at each layer.? Get Off The Blocks Fast: Data Quality In The Bronze Layer Effective Production QA techniques begin with rigorous automated testing at the Bronze layer , where raw data enters the lakehouse environment.

article thumbnail

Escaping POC Purgatory: Evaluation-Driven Development for AI Systems

O'Reilly on Data

Weve seen this across dozens of companies, and the teams that break out of this trap all adopt some version of Evaluation-Driven Development (EDD), where testing, monitoring, and evaluation drive every decision from the start. What breaks your app in production isnt always what you tested for in dev! The way out?

Testing 177
article thumbnail

5 top business use cases for AI agents

CIO Business Intelligence

There are risks around hallucinations and bias, says Arnab Chakraborty, chief responsible AI officer at Accenture. Meanwhile, in December, OpenAIs new O3 model, an agentic model not yet available to the public, scored 72% on the same test. That adds up to millions of documents a month that need to be processed.

Software 143
article thumbnail

Preparing for Q-Day: Safeguarding Enterprises Against Quantum Threats

David Menninger's Analyst Perspectives

This impending shift not only poses significant risks for individuals but also presents a high-stakes event that every enterprise must anticipate and prepare for; inadequate preparation could lead to substantial data breaches, compromised systems and irrevocable damage to customer trust and organizational reputation.

article thumbnail

7 types of tech debt that could cripple your business

CIO Business Intelligence

CIOs perennially deal with technical debts risks, costs, and complexities. While the impacts of legacy systems can be quantified, technical debt is also often embedded in subtler ways across the IT ecosystem, making it hard to account for the full list of issues and risks.

Risk 140