Remove what-is-machine-learning-model-training
article thumbnail

Machine Learning and the Production Gap

O'Reilly on Data

The biggest problem facing machine learning today isn’t the need for better algorithms; it isn’t the need for more computing power to train models; it isn’t even the need for more skilled practitioners. It’s getting machine learning from the researcher’s laptop to production.

article thumbnail

The unreasonable importance of data preparation

O'Reilly on Data

In a world focused on buzzword-driven models and algorithms, you’d be forgiven for forgetting about the unreasonable importance of data preparation and quality: your models are only as good as the data you feed them. On the machine learning side, we are entering what Andrei Karpathy, director of AI at Tesla, dubs the Software 2.0

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Proposals for model vulnerability and security

O'Reilly on Data

Apply fair and private models, white-hat and forensic model debugging, and common sense to protect machine learning models from malicious actors. Like many others, I’ve known for some time that machine learning models themselves could pose security risks. Data poisoning attacks.

Modeling 278
article thumbnail

2021 Data/AI Salary Survey

O'Reilly on Data

The results gave us insight into what our subscribers are paid, where they’re located, what industries they work for, what their concerns are, and what sorts of career development opportunities they’re pursuing. The results then provide a place to start thinking about what effect the pandemic had on employment.

article thumbnail

ChatGPT, Author of The Quixote

O'Reilly on Data

TL;DR LLMs and other GenAI models can reproduce significant chunks of training data. Specific prompts seem to “unlock” training data. Generative AI Has a Plagiarism Problem ChatGPT, for example, doesn’t memorize its training data, per se. This is the basis of The New York Times lawsuit against OpenAI.

Modeling 317
article thumbnail

Practical Skills for The AI Product Manager

O'Reilly on Data

In our previous article, What You Need to Know About Product Management for AI , we discussed the need for an AI Product Manager. This role includes everything a traditional PM does, but also requires an operational understanding of machine learning software development, along with a realistic view of its capabilities and limitations.

article thumbnail

What Are ChatGPT and Its Friends?

O'Reilly on Data

What is it, how does it work, what can it do, and what are the risks of using it? What Software Are We Talking About? It’s important to understand that ChatGPT is not actually a language model. It’s a convenient user interface built around one specific language model, GPT-3.5, with specialized training.

IT 346