Blog - Data Leaders Brief

Machine Learning and the Production Gap

O'Reilly on Data

JUNE 9, 2020

The biggest problem facing machine learning today isn’t the need for better algorithms; it isn’t the need for more computing power to train models; it isn’t even the need for more skilled practitioners. It’s getting machine learning from the researcher’s laptop to production.

Machine Learning

Machine Learning Metrics Modeling IT

The unreasonable importance of data preparation

O'Reilly on Data

MARCH 24, 2020

In a world focused on buzzword-driven models and algorithms, you’d be forgiven for forgetting about the unreasonable importance of data preparation and quality: your models are only as good as the data you feed them. On the machine learning side, we are entering what Andrei Karpathy, director of AI at Tesla, dubs the Software 2.0

Machine Learning

Machine Learning Statistics Data Quality Data Collection

ChatGPT, Author of The Quixote

O'Reilly on Data

MARCH 26, 2024

TL;DR LLMs and other GenAI models can reproduce significant chunks of training data. Specific prompts seem to “unlock” training data. Generative AI Has a Plagiarism Problem ChatGPT, for example, doesn’t memorize its training data, per se. This is the basis of The New York Times lawsuit against OpenAI.

Modeling

Modeling Machine Learning Risk Advertising

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Proposals for model vulnerability and security

O'Reilly on Data

MARCH 20, 2019

Apply fair and private models, white-hat and forensic model debugging, and common sense to protect machine learning models from malicious actors. Like many others, I’ve known for some time that machine learning models themselves could pose security risks. Data poisoning attacks.

Modeling

Modeling Machine Learning Predictive Modeling Consulting

2021 Data/AI Salary Survey

O'Reilly on Data

SEPTEMBER 15, 2021

The results gave us insight into what our subscribers are paid, where they’re located, what industries they work for, what their concerns are, and what sorts of career development opportunities they’re pursuing. The results then provide a place to start thinking about what effect the pandemic had on employment.

Machine Learning

Machine Learning Statistics Reporting Consulting

Practical Skills for The AI Product Manager

O'Reilly on Data

MAY 14, 2020

In our previous article, What You Need to Know About Product Management for AI , we discussed the need for an AI Product Manager. This role includes everything a traditional PM does, but also requires an operational understanding of machine learning software development, along with a realistic view of its capabilities and limitations.

Management

Management Experimentation B2B Machine Learning

What Are ChatGPT and Its Friends?

O'Reilly on Data

MARCH 23, 2023

What is it, how does it work, what can it do, and what are the risks of using it? What Software Are We Talking About? It’s important to understand that ChatGPT is not actually a language model. It’s a convenient user interface built around one specific language model, GPT-3.5, with specialized training.

IT

IT Modeling Testing Risk

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

Rocket-Powered Data Science

FEBRUARY 15, 2023

While generative AI has been around for several years , the arrival of ChatGPT (a conversational AI tool for all business occasions, built and trained from large language models) has been like a brilliant torch brought into a dark room, illuminating many previously unseen opportunities.

Strategy

Strategy Experimentation Uncertainty Machine Learning

Generative AI in the Enterprise

O'Reilly on Data

NOVEMBER 28, 2023

And everyone has opinions about how these language models and art generation programs are going to change the nature of work, usher in the singularity, or perhaps even doom the human race. What’s the reality? We wanted to find out what people are actually doing, so in September we surveyed O’Reilly’s users.

Enterprise

Enterprise Testing Modeling Reporting

AI Product Management After Deployment

O'Reilly on Data

OCTOBER 13, 2020

Similarly, in “ Building Machine Learning Powered Applications: Going from Idea to Product ,” Emmanuel Ameisen states: “Indeed, exposing a model to users in production comes with a set of challenges that mirrors the ones that come with debugging a model.”. Proper AI product monitoring is essential to this outcome.

Management

Management Machine Learning Metrics Modeling

An AI Chat Bot Wrote This Blog Post …

DataKitchen

DECEMBER 9, 2022

ChatGPT> DataOps is a term that refers to the set of practices and tools that organizations use to improve the quality and speed of data analytics and machine learning. This can help organizations to build trust in their data-related workflows, and to drive better outcomes from their data analytics and machine learning initiatives.

Machine Learning

Machine Learning Data-driven Optimization Data Analytics

The DataOps Vendor Landscape, 2021

DataKitchen

APRIL 13, 2021

Read the complete blog below for a more detailed description of the vendors and their capabilities. We have also included vendors for the specific use cases of ModelOps, MLOps, DataGovOps and DataSecOps which apply DataOps principles to machine learning, AI, data governance, and data security operations. . Meta-Orchestration .

Testing

Testing Machine Learning Consulting Data Science

Top 10 Analytics And Business Intelligence Trends For 2020

datapine

NOVEMBER 27, 2019

Businesses of all sizes are no longer asking if they need increased access to business intelligence analytics but what is the best BI solution for their specific business. Companies are no longer wondering if data visualizations improve analyses but what is the best way to tell each data-story.

Business Intelligence

Business Intelligence Analytics Prescriptive Analytics Data Quality

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

AWS Big Data

DECEMBER 4, 2024

They’re taking data they’ve historically used for analytics or business reporting and putting it to work in machine learning (ML) models and AI-powered applications. For example, when a retail data analyst creates customer segmentation reports, those same datasets are now being used by AI teams to train recommendation engines.

Data Analytics

Data Analytics Analytics Data Lake Data Quality

Cloudera AI Inference Service Enables Easy Integration and Deployment of GenAI Into Your Production Environments

Cloudera

DECEMBER 4, 2024

Today, Artificial Intelligence (AI) and Machine Learning (ML) are more crucial than ever for organizations to turn data into a competitive advantage. To unlock the full potential of AI, however, businesses need to deploy models and AI applications at scale, in real-time, and with low latency and high throughput.

Metrics

Metrics Data Processing Machine Learning Deep Learning

The Journey to DataOps Success: Key Takeaways from Transformation Trailblazers

DataKitchen

APRIL 26, 2021

In early April 2021, DataKItchen sat down with Jonathan Hodges, VP Data Management & Analytics, at Workiva ; Chuck Smith, VP of R&D Data Strategy at GlaxoSmithKline (GSK) ; and Chris Bergh, CEO and Head Chef at DataKitchen, to find out about their enterprise DataOps transformation journey, including key successes and lessons learned.

Measurement

Measurement Metrics Data-driven Dashboards

From Machine Learning to AI: Simplifying the Path to Enterprise Intelligence

Cloudera

JANUARY 9, 2025

Thats why were moving from Cloudera Machine Learning to Cloudera AI. Why AI Matters More Than ML Machine learning (ML) is a crucial piece of the puzzle, but its just one piece. It means combining data engineering, model ops, governance, and collaboration in a single, streamlined environment.

Machine Learning

Machine Learning Enterprise Data-driven Modeling

How Can Machine Learning Change Customer Reviews?

Smart Data Collective

SEPTEMBER 24, 2021

Machine Learning is a branch of Artificial Intelligence that works by giving computers the ability to learn without being explicitly programmed. As technology advances, machine learning will have more opportunities to help businesses engage with their customers and improve the overall customer experience.

Machine Learning

Machine Learning Marketing Modeling Sales

Moving from Red AI to Green AI, Part 1: How to Save the Environment and Reduce Your Hardware Costs

DataRobot Blog

APRIL 21, 2022

Machine learning, and especially deep learning, has become increasingly more accurate in the past few years. This increase in accuracy is important to make AI applications good enough for production , but there has been an explosion in the size of these models. Why should you care?

Deep Learning

Deep Learning Machine Learning Metrics Measurement

Generative AI – Chapter 1, Page 1

Rocket-Powered Data Science

JULY 6, 2023

These AI applications are essentially deep machine learning models that are trained on hundreds of gigabytes of text and that can provide detailed, grammatically correct, and “mostly accurate” text responses to user inputs (questions, requests, or queries, which are called prompts). Guess what?

Statistics

Statistics Deep Learning Machine Learning Enterprise

NVIDIA RAPIDS in Cloudera Machine Learning

Cloudera

MAY 19, 2021

In the previous blog post in this series, we walked through the steps for leveraging Deep Learning in your Cloudera Machine Learning (CML) projects. What is RAPIDS. RAPIDS brings the power of GPU compute to standard Data Science operations, be it exploratory data analysis, feature engineering or model building.

Machine Learning

Machine Learning Data Science Data Lake Modeling

Three Emerging Analytics Products Derived from Value-driven Data Innovation and Insights Discovery in the Enterprise

Rocket-Powered Data Science

JULY 19, 2023

The ease with which such structured data can be stored, understood, indexed, searched, accessed, and incorporated into business models could explain this high percentage. What could be faster and easier than on-prem enterprise data sources? A similarly high percentage of tabular data usage among data scientists was mentioned here.

Data-driven

Data-driven Enterprise Analytics Machine Learning

10 most in-demand generative AI skills

CIO Business Intelligence

SEPTEMBER 29, 2023

If any technology has captured the collective imagination in 2023, it’s generative AI — and businesses are beginning to ramp up hiring for what in some cases are very nascent gen AI skills, turning at times to contract workers to fill gaps, pursue pilots, and round out in-house AI project teams.

Deep Learning

Deep Learning Machine Learning Consulting Modeling

7 famous analytics and AI disasters

CIO Business Intelligence

APRIL 15, 2022

And 20% of IT leaders say machine learning/artificial intelligence will drive the most IT investment. Insights gained from analytics and actions driven by machine learning algorithms can give organizations a competitive advantage, but mistakes can be costly in terms of reputation, revenue, or even lives.

Analytics

Analytics Machine Learning Insurance Testing

What Is The Difference Between Business Intelligence And Analytics?

datapine

MARCH 25, 2022

1) What Is Business Intelligence And Analytics? If someone puts you on the spot, could you tell him/her what the difference between business intelligence and analytics is? But let’s see in more detail what experts say and how can we connect and differentiate the both. What Do The Experts Say? Table of Contents.

Business Intelligence

Business Intelligence Analytics Statistics Dashboards

DP 100 Updated – Microsoft Data Science Certification

Data Science 101

JANUARY 22, 2020

I have covered my experience and what topics are on the exam. Those blog posts were for the old exam which focused on the legacy Azure Machine Learning Studio interface and general data science knowledge. Microsoft has posted a new skills document, and they are planning to add new training for DP 100.

Data Science

Data Science Machine Learning Optimization Modeling

Mastering Multi-Cloud with Cloudera: Strategic Data & AI Deployments Across Clouds

Cloudera

JANUARY 7, 2025

As enterprises navigate complex data-driven transformations, hybrid and multi-cloud models offer unmatched flexibility and resilience. Adopting hybrid and multi-cloud models provides enterprises with flexibility, cost optimization, and a way to avoid vendor lock-in. The terms hybrid and multi-cloud are often used interchangeably.

Cost-Benefit

Cost-Benefit Optimization Data-driven Strategy

Artificial Intelligence: Implications On Marketing, Analytics, And You

Occam's Razor

MARCH 30, 2017

What the heck is Artificial Intelligence? Machine Learning | Marketing. Machine Learning | Analytics. It is actually smarter than what you see above. I mean, just imagine how hard it is to do what you see above, and everything I do is actually so much easier! AI | Now | Global Maxima.

Marketing Analytics

Marketing Analytics Marketing Analytics Deep Learning

How To Succeed As a DataOps Engineer

DataKitchen

NOVEMBER 20, 2021

What makes an effective DataOps Engineer? You might ask what that means. A DataOps Engineer shepherds process flows across complex corporate structures. Organizations have changed significantly over the last number of years and even more dramatically over the previous 12 months, with the sharp increase in remote work.

Testing

Testing Machine Learning Data Warehouse Analytics

Getting Started with Machine Learning

Cloudera

FEBRUARY 11, 2022

Advances in the development and application of Machine Learning (ML) and Deep Learning (DL) algorithms, require greater care to ensure that the ethics embedded in previous rule-based systems are not lost. This blog post hopes to provide this foundational understanding. What is Machine Learning.

Machine Learning

Machine Learning Sales Data Science Deep Learning

Cloud Data Science News – Beta #4

Data Science 101

NOVEMBER 29, 2019

Luckily, Amazon has come through with a flurry of machine learning announcements. Amazon Athena and Aurora add support for ML in SQL Queries You can now invoke Machine Learning models right from your SQL Queries. Now the AutoML will provide details on all model run iterations. We will have to wait and see.

Data Science

Data Science Machine Learning Deep Learning Modeling

Evaluating Ray: Distributed Python for Massive Scalability

Domino Data Lab

FEBRUARY 12, 2020

Dean Wampler provides a distilled overview of Ray, an open source system for scaling Python systems from single machines to large clusters. and you’re wondering what it is, this post is for you. this post on the Ray project blog ?. for reinforcement learning (RL), ? Introduction. Ray: Scaling Python Applications.

Experimentation

Experimentation Modeling Data Science Machine Learning

6 Case Studies on The Benefits of Business Intelligence And Analytics

datapine

JANUARY 31, 2022

Business intelligence can also be referred to as “descriptive analytics”, as it only shows past and current state: it doesn’t say what to do, but what is or was. What Are The Benefits of Business Intelligence? In order to do this, they first defined what data was the most relevant for the company. The power of knowledge.

Business Intelligence

Business Intelligence Analytics Cost-Benefit ROI

New Applied ML Prototypes Now Available in Cloudera Machine Learning

Cloudera

NOVEMBER 17, 2021

Every AMP includes all the dependencies, industry best practices, prebuilt models, and a business-ready AI application — All deployable with a couple clicks, allowing Data Science teams to start a new project with a working example that they can then customize to their own needs in a fraction of the time.

Machine Learning

Machine Learning Visualization Data Science Dashboards

Gen AI can be the answer to your data problems — but not all of them

CIO Business Intelligence

JUNE 12, 2024

As part of this work, the foundation’s volunteers learned about the necessity of collecting reliable data to provide efficient healthcare activity. Some of the models are traditional machine learning (ML), and some, LaRovere says, are gen AI, including the new multi-modal advances. It’s not aggregated,” she says.

Modeling

Modeling Testing Cost-Benefit Metadata

What Is ‘Equity As Code,’ And How Can It Eliminate AI Bias?

DataKitchen

OCTOBER 28, 2021

What Is AI Bias? Machine learning (ML) models are computer programs that draw inferences from data — usually lots of data. One way to think of ML models is that they instantiate an algorithm (a decision-making procedure often involving math) in software and then, at relatively low cost, deploy it on a large scale.

Testing

Testing IT Manufacturing Machine Learning

8 Modeling Tools to Build Complex Algorithms

Domino Data Lab

AUGUST 9, 2021

For a model-driven enterprise, having access to the appropriate tools can mean the difference between operating at a loss with a string of late projects lingering ahead of you or exceeding productivity and profitability forecasts. What Are Modeling Tools? Importance of Modeling Tools. Types of Modeling Tools.

Modeling

Modeling Deep Learning Machine Learning Statistics

Building a Machine Learning Application With Cloudera Data Science Workbench And Operational Database, Part 3: Productionization of ML models

Cloudera

JANUARY 20, 2021

to make a classification model based off of training data stored in both Cloudera’s Operational Database (powered by Apache HBase) and Apache HDFS. Afterwards, this model is then scored and served through a simple Web Application. Machine learning is now being used to solve many real-time problems.

Machine Learning

Machine Learning Data Science Modeling Big Data

Why data observability is essential to AI governance

erwin

DECEMBER 9, 2024

When it comes to using AI and machine learning across your organization, there are many good reasons to provide your data and analytics community with an intelligent data foundation. For instance, Large Language Models (LLMs) are known to ultimately perform better when data is structured. Lets give a for instance.

Metadata

Metadata Data Quality Sales Modeling

Thinking of building your own AI agents? Don’t do it, advisors say

CIO Business Intelligence

SEPTEMBER 19, 2024

Goldcast, a software developer focused on video marketing, has experimented with a dozen open-source AI models to assist with various tasks, says Lauren Creedon, head of product at the company. The goal at Goldcast is to link all these AI models and turn them into agents that do their assigned tasks without human prompts, she says.

IT

IT Cost-Benefit Consulting Modeling

Machine Learning Product Management: Lessons Learned

Domino Data Lab

MAY 15, 2019

It focuses on his ML product management insights and lessons learned. Machine Learning Projects are Hard: Shifting from a Deterministic Process to a Probabilistic One. Over the years, I have listened to data scientists and machine learning (ML) researchers relay various pain points and challenges that impede their work.

Machine Learning

Machine Learning Management Experimentation Data Science

Adding Common Sense to Machine Learning with TensorFlow Lattice

The Unofficial Google Data Science Blog

NOVEMBER 17, 2020

by TAMAN NARAYAN & SEN ZHAO A data scientist is often in possession of domain knowledge which she cannot easily apply to the structure of the model. On the one hand, basic statistical models (e.g. On the other hand, sophisticated machine learning models are flexible in their form but not easy to control.

Machine Learning

Machine Learning Deep Learning Modeling Interactive

Moving from Red AI to Green AI, Part 2: A Practitioner’s Guide to Efficient Machine Learning

DataRobot Blog

APRIL 22, 2022

In our previous post , we talked about how red AI means adding computational power to “buy” more accurate models in machine learning , and especially in deep learning. We covered different ways of measuring model efficiency and showed ways to visualize this and select models based on it.

Machine Learning

Machine Learning Measurement Deep Learning Manufacturing

Demystifying Multimodal LLMs

Dataiku

MARCH 25, 2024

But instead, a machine seamlessly identifies the scene and its location, provides a detailed description, and even suggests nearby attractions. This scenario is not science fiction but a glimpse into the capabilities of Multimodal Large Language Models (M-LLMs), where the convergence of various modalities extends the landscape of AI.

Visualization

Visualization Modeling Experimentation Testing

Machine Learning and the Production Gap

The unreasonable importance of data preparation

Webinars

Trending Sources

ChatGPT, Author of The Quixote

Webinars

Proposals for model vulnerability and security

2021 Data/AI Salary Survey

Practical Skills for The AI Product Manager

What Are ChatGPT and Its Friends?

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

Generative AI in the Enterprise

AI Product Management After Deployment

An AI Chat Bot Wrote This Blog Post …

The DataOps Vendor Landscape, 2021

Top 10 Analytics And Business Intelligence Trends For 2020

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Cloudera AI Inference Service Enables Easy Integration and Deployment of GenAI Into Your Production Environments

The Journey to DataOps Success: Key Takeaways from Transformation Trailblazers

From Machine Learning to AI: Simplifying the Path to Enterprise Intelligence

How Can Machine Learning Change Customer Reviews?

Moving from Red AI to Green AI, Part 1: How to Save the Environment and Reduce Your Hardware Costs

Generative AI – Chapter 1, Page 1

NVIDIA RAPIDS in Cloudera Machine Learning

Three Emerging Analytics Products Derived from Value-driven Data Innovation and Insights Discovery in the Enterprise

10 most in-demand generative AI skills

7 famous analytics and AI disasters

What Is The Difference Between Business Intelligence And Analytics?

DP 100 Updated – Microsoft Data Science Certification

Mastering Multi-Cloud with Cloudera: Strategic Data & AI Deployments Across Clouds

Artificial Intelligence: Implications On Marketing, Analytics, And You

How To Succeed As a DataOps Engineer

Getting Started with Machine Learning

Cloud Data Science News – Beta #4

Evaluating Ray: Distributed Python for Massive Scalability

6 Case Studies on The Benefits of Business Intelligence And Analytics

New Applied ML Prototypes Now Available in Cloudera Machine Learning

Gen AI can be the answer to your data problems — but not all of them

What Is ‘Equity As Code,’ And How Can It Eliminate AI Bias?

8 Modeling Tools to Build Complex Algorithms

Building a Machine Learning Application With Cloudera Data Science Workbench And Operational Database, Part 3: Productionization of ML models

Why data observability is essential to AI governance

Thinking of building your own AI agents? Don’t do it, advisors say

Machine Learning Product Management: Lessons Learned

Adding Common Sense to Machine Learning with TensorFlow Lattice

Moving from Red AI to Green AI, Part 2: A Practitioner’s Guide to Efficient Machine Learning

Demystifying Multimodal LLMs

Stay Connected