Remove Measurement Remove Risk Remove Testing
article thumbnail

Preliminary Thoughts on the White House Executive Order on AI

O'Reilly on Data

adversarial testing to determine a model’s flaws and weak points), and not a wider range of information that would help to address many of the other concerns outlined in the EO. Methods by which the AI provider manages and mitigates risks identified via Red Teaming, including their effectiveness.

article thumbnail

Reclaiming the stories that algorithms tell

O'Reilly on Data

Using the new scores, Apgar and her colleagues proved that many infants who initially seemed lifeless could be revived, with success or failure in each case measured by the difference between an Apgar score at one minute after birth, and a second score taken at five minutes. Books, in turn, get matching scores to reflect their difficulty.

Risk 355
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Bringing an AI Product to Market

O'Reilly on Data

Product Managers are responsible for the successful development, testing, release, and adoption of a product, and for leading the team that implements those milestones. When a measure becomes a target, it ceases to be a good measure ( Goodhart’s Law ). The Core Responsibilities of the AI Product Manager.

Marketing 363
article thumbnail

CIOs must reassess cloud concentration risk post-CrowdStrike

CIO Business Intelligence

It also highlights the downsides of concentration risk. What is concentration risk? Looking to the future, IT leaders must bring stronger focus on “concentration risk”and how these supply chain risks can be better managed. Unfortunately, the complexity of multiple vendors can lead to incidents and new risks.

Risk 144
article thumbnail

Report: AI giants grow impatient with UK safety tests

CIO Business Intelligence

Key AI companies have told the UK government to speed up its safety testing for their systems, raising questions about future government initiatives that too may hinge on technology providers opening up generative AI models to tests before new releases hit the public.

Testing 124
article thumbnail

You Can’t Regulate What You Don’t Understand

O'Reilly on Data

Should we risk loss of control of our civilization?” If we want prosocial outcomes, we need to design and report on the metrics that explicitly aim for those outcomes and measure the extent to which they have been achieved. And they are stress testing and “ red teaming ” them to uncover vulnerabilities.

Metrics 289
article thumbnail

Need a security road map? Ditch the ad hoc measurement

CIO Business Intelligence

CISOs can only know the performance and maturity of their security program by actively measuring it themselves; after all, to measure is to know. However, CISOs aren’t typically measuring their security program proactively or methodically to understand their current security program.