Remove Document Remove Risk Remove Testing
article thumbnail

Beyond “Prompt and Pray”

O'Reilly on Data

Your companys AI assistant confidently tells a customer its processed their urgent withdrawal requestexcept it hasnt, because it misinterpreted the API documentation. These are systems that engage in conversations and integrate with APIs but dont create stand-alone content like emails, presentations, or documents.

article thumbnail

Risk Management for AI Chatbots

O'Reilly on Data

Welcome to your company’s new AI risk management nightmare. Before you give up on your dreams of releasing an AI chatbot, remember: no risk, no reward. The core idea of risk management is that you don’t win by saying “no” to everything. Why not take the extra time to test for problems?

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

5 top business use cases for AI agents

CIO Business Intelligence

There are risks around hallucinations and bias, says Arnab Chakraborty, chief responsible AI officer at Accenture. Meanwhile, in December, OpenAIs new O3 model, an agentic model not yet available to the public, scored 72% on the same test. That adds up to millions of documents a month that need to be processed.

Software 143
article thumbnail

The Race For Data Quality in a Medallion Architecture

DataKitchen

Finally, the challenge we are addressing in this document – is how to prove the data is correct at each layer.? Get Off The Blocks Fast: Data Quality In The Bronze Layer Effective Production QA techniques begin with rigorous automated testing at the Bronze layer , where raw data enters the lakehouse environment.

article thumbnail

Escaping POC Purgatory: Evaluation-Driven Development for AI Systems

O'Reilly on Data

Weve seen this across dozens of companies, and the teams that break out of this trap all adopt some version of Evaluation-Driven Development (EDD), where testing, monitoring, and evaluation drive every decision from the start. What breaks your app in production isnt always what you tested for in dev! The way out?

Testing 168
article thumbnail

7 types of tech debt that could cripple your business

CIO Business Intelligence

CIOs perennially deal with technical debts risks, costs, and complexities. While the impacts of legacy systems can be quantified, technical debt is also often embedded in subtler ways across the IT ecosystem, making it hard to account for the full list of issues and risks.

Risk 140
article thumbnail

Close Brothers unlocks RPA with Document Understanding

CIO Business Intelligence

But Stephen Durnin, the company’s head of operational excellence and automation, says the 2020 Covid-19 pandemic thrust automation around unstructured input, like email and documents, into the spotlight. “We This was exacerbated by errors or missing information in documents provided by customers, leading to additional work downstream. “We

Finance 118