Article, Predictive Modeling and Statistics

The Science of T20 Cricket: Decoding Player Performance with Predictive Modeling

Analytics Vidhya

JUNE 1, 2023

With franchise leagues like IPL and BBL, teams rely on statistical models and tools for competitive edge. This article explores how data analytics optimizes strategies by leveraging player performances and opposition weaknesses. Python programming predicts player performances, aiding team selections and game tactics.

Predictive Modeling

Predictive Modeling Modeling Statistics Optimization

Machine Learning Paradigms with Example

Analytics Vidhya

JULY 25, 2022

This article was published as a part of the Data Science Blogathon. Machine Learning is the method of teaching computer programs to do a specific task accurately (essentially a prediction) by training a predictive model using various statistical algorithms leveraging data. Source: [link] For […].

Machine Learning

Machine Learning Predictive Modeling Statistics Data Science

Bivariate Feature Analysis in Python

Analytics Vidhya

MARCH 22, 2022

This article was published as a part of the Data Science Blogathon. Introduction Feature analysis is an important step in building any predictive model. In this article, we will look into a very simple feature analysis technique that can be used in cases such as […].

Predictive Modeling

Predictive Modeling Data Science Publishing Modeling

Webinars

How to Achieve High-Accuracy Results When Using LLMs

MORE WEBINARS

Quick Guide to Evaluation Metrics for Supervised and Unsupervised Machine Learning

Analytics Vidhya

OCTOBER 12, 2020

This article was published as a part of the Data Science Blogathon. Introduction Machine learning is about building a predictive model using historical data. The post Quick Guide to Evaluation Metrics for Supervised and Unsupervised Machine Learning appeared first on Analytics Vidhya.

Machine Learning

Machine Learning Metrics Predictive Modeling Data Science

STANDARDIZED VS UNSTANDARDIZED REGRESSION COEFFICIENT

Analytics Vidhya

MARCH 20, 2021

ArticleVideo Book This article was published as a part of the Data Science Blogathon. Introduction Some time back, I was making the predictive model. The post STANDARDIZED VS UNSTANDARDIZED REGRESSION COEFFICIENT appeared first on Analytics Vidhya.

Predictive Modeling

Predictive Modeling Data Science Publishing Modeling

Beyond the hype: Do you really need an LLM for your data?

CIO Business Intelligence

FEBRUARY 6, 2025

This article reflects some of what Ive learned. The hype around large language models (LLMs) is undeniable. Even basic predictive modeling can be done with lightweight machine learning in Python or R. In life sciences, simple statistical software can analyze patient data. You get the picture.

Unstructured Data

Unstructured Data Manufacturing Data Governance Sales

The quest for high-quality data

O'Reilly on Data

JUNE 18, 2019

Moreover, the domain knowledge, which often is not encoded in the data (nor fully documented), is an integral part of this data (see this article from Forbes). In this post, we shed some light on various efforts toward generating data for machine learning (ML) models. See this article on data integration status for details.

Machine Learning

Machine Learning Data Quality Statistics Modeling

Top 5 Statistical Techniques in Python

Sisense

SEPTEMBER 25, 2020

A data scientist must be skilled in many arts: math and statistics, computer science, and domain knowledge. Statistics and programming go hand in hand. Mastering statistical techniques and knowing how to implement them via a programming language are essential building blocks for advanced analytics. Linear regression.

Statistics

Statistics Predictive Modeling Modeling Machine Learning

What to Do When AI Fails

O'Reilly on Data

MAY 18, 2020

This article answers these questions, based on our combined experience as both a lawyer and a data scientist responding to cybersecurity incidents, crafting legal frameworks to manage the risks of AI, and building sophisticated interpretable models to mitigate risk. Because statistics: Last is the inherently probabilistic nature of ML.

Risk

Risk Modeling Data Processing Reporting

Proposals for model vulnerability and security

O'Reilly on Data

MARCH 20, 2019

The objective here is to brainstorm on potential security vulnerabilities and defenses in the context of popular, traditional predictive modeling systems, such as linear and tree-based models trained on static data sets. If an attacker can receive many predictions from your model API or other endpoint (website, app, etc.),

Modeling

Modeling Machine Learning Predictive Modeling Consulting

Data Insights Assure Quality Data and Confident Decisions!

Smarten

NOVEMBER 26, 2024

The business can harness the power of statistics and machine learning to uncover those crucial nuggets of information that drive effective decision, and to improve the overall quality of data. This helps you select the predictors that have the greatest impact, making it easier to create an effective predictive model.

Machine Learning

Machine Learning Data Quality Predictive Modeling Metadata

What Is The Difference Between Business Intelligence And Analytics?

datapine

MARCH 25, 2022

While some experts try to underline that BA focuses, also, on predictive modeling and advanced statistics to evaluate what will happen in the future, BI is more focused on the present moment of data, making the decision based on current insights. Your Chance: Want to extract the maximum potential out of your data?

Business Intelligence

Business Intelligence Analytics Statistics Dashboards

Why you should care about debugging machine learning models

O'Reilly on Data

DECEMBER 12, 2019

Because all ML models make mistakes, everyone who cares about ML should also care about model debugging. [1] This article is meant to be a short, relatively technical primer on what model debugging is, what you should know about it, and the basics of how to debug models in practice.

Machine Learning

Machine Learning Modeling Testing Risk Management

3 Key Components of the Interdisciplinary Field of Data Science

Domino Data Lab

JULY 28, 2021

Through a marriage of traditional statistics with fast-paced, code-first computer science doctrine and business acumen, data science teams can solve problems with more accuracy and precision than ever before, especially when combined with soft skills in creativity and communication. Math and Statistics Expertise.

Data Science

Data Science Statistics Predictive Analytics Recreation/Entertainment

Defining data science in 2018

Data Science and Beyond

JULY 22, 2018

Two years later, I published a post on my then-favourite definition of data science , as the intersection between software engineering and statistics. This article is a short summary of my understanding of the definition of data science in 2018. Numerous articles have been published on the meaning of data science in the past six years.

Data Science

Data Science Machine Learning Statistics Predictive Modeling

3 Things Citizen Data Scientists Need in Predictive Analytics!

Smarten

MAY 31, 2022

The technology research firm, Gartner has predicted that, ‘predictive and prescriptive analytics will attract 40% of net new enterprise investment in the overall business intelligence and analytics market.’ It is meant to identify crucial relationships and opportunities and risks and help the organization to accurately predict: Growth.

Predictive Analytics

Predictive Analytics Analytics Predictive Modeling Prescriptive Analytics

InfoTribes, Reality Brokers

O'Reilly on Data

MARCH 23, 2021

On top of this, pre-existing societal biases are being reinforced and promulgated at previously unheard of scales as we increasingly integrate machine learning models into our daily lives. Put simply, we are reduced to the inputs of an algorithm.

Internet Publishing and Broadcasting

Internet Publishing and Broadcasting Broadcasting Data-driven Publishing

AI In Analytics: Today and Tomorrow!

Smarten

APRIL 19, 2024

In this article, we will discuss the current state of AI in analytics, as well as the future of this burgeoning industry and how it can be applied to analytics to simplify and clarify results and to make analytics easier for businesses and business users to leverage.

Analytics

Analytics Predictive Modeling KPI Machine Learning

R vs Python: What’s the Best Language for Natural Language Processing?

Sisense

APRIL 10, 2020

R is a tool built by statisticians mainly for mathematics, statistics, research, and data analysis. We’ll actually do this later in this article. These support a wide array of uses, such as data analysis, manipulation, visualizations, and machine learning (ML) modeling. y_pred=predict(xb, y_val) val-auc=auc(y_pred,y_val).

Deep Learning

Deep Learning Data Science Machine Learning Visualization

Best Practice of Using Data Science Competitions Skills to Improve Business Value

DataRobot Blog

JULY 28, 2022

This article presents a case study of how DataRobot was able to achieve high accuracy and low cost by actually using techniques learned through Data Science Competitions in the process of solving a DataRobot customer’s problem. I thought of the solutions of the top team in a Data Science Competition for LANL Earthquake Prediction.

Data Science

Data Science Machine Learning Statistics Modeling

What is the Independent Samples T Test Method of Analysis and How Can it Benefit an Organization?

Smarten

JUNE 29, 2018

This article focuses on the Independent Samples T Test technique of Hypothesis testing. The independent sample t-test is a statistical method of hypothesis testing that determines whether there is a statistically significant difference between the means of two independent samples. About Smarten.

Testing

Testing Statistics IT Business Intelligence

Data science vs. machine learning: What’s the difference?

IBM Big Data Hub

JULY 6, 2023

Areas making up the data science field include mining, statistics, data analytics, data modeling, machine learning modeling and programming. Ultimately, data science is used in defining new business problems that machine learning techniques and statistical analysis can then help solve.

Machine Learning

Machine Learning Data Science Statistics Deep Learning

What Are Data Trends and Patterns, and How Do They Impact Business Decisions?

Smarten

JUNE 20, 2018

In this article, we will focus on the identification and exploration of data patterns and the trends that data reveals. In prediction, the objective is to “model” all the components to some trend patterns to the point that the only component that remains unexplained is the random component. Stationary/Stationarity.

Forecasting

Forecasting Statistics Business Intelligence Predictive Modeling

What is Karl Pearson Correlation Analysis and How Can it be Used for Enterprise Analysis Needs?

Smarten

JUNE 29, 2018

This article explains the Karl Pearson Correlation method of analysis, and how it can be applied in business. Correlation is a statistical measure that indicates the extent to which two variables fluctuate together. What is the Karl Pearson Correlation Analytical Technique?

Enterprise

Enterprise Statistics IT Business Intelligence

What is Simple Linear Regression and How Can an Enterprise Use this Technique to Analyze Data?

Smarten

JUNE 29, 2018

This article describes the Simple Linear Regression method of analysis. Simple Linear Regression is a statistical technique that attempts to explore the relationship between one independent variable (X) and one dependent variable (Y). What is Simple Linear Regression?

Enterprise

Enterprise Statistics Sales Business Intelligence

Credit Card Fraud Detection using XGBoost, SMOTE, and threshold moving

Domino Data Lab

APRIL 21, 2021

In this article, we’ll discuss the challenge organizations face around fraud detection, how machine learning can be used to identify and spot anomalies that the human eye might not catch. In contrast, the decision tree classifies observations based on attribute splits learned from the statistical properties of the training data.

Statistics

Statistics Machine Learning Modeling Metrics

What is the Chi Square Test of Association and How Can it be Used for Analysis?

Smarten

JUNE 26, 2018

This article describes chi square test of association and hypothesis testing. It is used to determine whether there is a statistically significant association between the two categorical variables. What is the Chi Square Test of Association Method of Hypothesis Testing?

Testing

Testing Statistics IT Business Intelligence

What is the Paired Sample T Test and How is it Beneficial to Business Analysis?

Smarten

JUNE 29, 2018

This article discusses the Paired Sample T Test method of hypothesis testing and analysis. At 95% confidence level (5% chance of error): As p-value = 0.041 which is less than 0.05, there is a statistically significant difference between means of pre and post sample values. What is the Paired Sample T Test?

Business Analysis

Business Analysis Testing Statistics Advertising

What is Multiple Linear Regression and How Can it be Helpful for Business Analysis?

Smarten

JUNE 29, 2018

This article describes the analytical technique of multiple linear regression. Multiple Linear Regression is a statistical technique that is designed to explore the relationship between two or more variables (X, and Y). What is Multiple Linear Regression Analysis?

Business Analysis

Business Analysis IT Sales Statistics

What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?

Smarten

JUNE 29, 2018

This article looks at the ARIMAX Forecasting method of analysis and how it can be used for business analysis. An Autoregressive Integrated Moving Average with Explanatory Variable (ARIMAX) model can be viewed as a multiple regression model with one or more autoregressive (AR) terms and/or one or more moving average (MA) terms.

Forecasting

Forecasting Enterprise IT Business Intelligence

What is Spearman’s Rank Correlation and How is it Useful for Business Analysis?

Smarten

JUNE 29, 2018

This article describes the Spearman’s Rank Correlation and how it is used for enterprise analysis. Correlation is a statistical measure that indicates the extent to which two variables fluctuate together A positive correlation indicates the extent to which those variables increase or decrease in parallel.

Business Analysis

Business Analysis IT Measurement Statistics

What is ARIMA Forecasting and How Can it Be Used for Enterprise Analysis?

Smarten

JUNE 26, 2018

This article provides a brief explanation of the ARIMA method of analytical forecasting. Autoregressive Integrated Moving Average (ARIMA) predicts future values of a time series using a linear combination of its past values and a series of errors. ’ The ARIMA model is suggested for short term forecasting.

Forecasting

Forecasting Enterprise IT Sales

What Are Simple Random Sampling and Stratified Random Sampling Analytical Techniques?

Smarten

JUNE 29, 2018

This article discusses the analytical technique known as Sampling and provides a brief explanation of two types of sampling analysis, and how each of these methods is applied. What is Sampling Analysis? A random sample from each of these subgroups is taken in proportion to the subgroup size relative to the population size.

Analytics

Analytics Statistics Business Intelligence Predictive Modeling

Data Science at The New York Times

Domino Data Lab

JULY 9, 2019

Diving into examples of building and deploying ML models at The New York Times including the descriptive topic modeling-oriented Readerscope (audience insights engine), a prediction model regarding who was likely to subscribe/cancel their subscription, as well as prescriptive example via recommendations of highly curated editorial content.

Data Science

Data Science Machine Learning Advertising Modeling

Augmented Analytics Algorithms and Techniques: Learning for Citizen Data Scientists

Smarten

JULY 10, 2018

This article summarizes our recent article series on the definition, meaning and use of the various algorithms and analytical methods and techniques used in predictive analytics for business users, and in augmented data preparation and augmented data discovery tools. Use Case(s): Weather Forecasting, Fraud Analysis and more.

Analytics

Analytics Forecasting Statistics Sales

What is Outlier Analysis and How Can It Improve Analysis?

Smarten

JUNE 27, 2018

This article presents a brief explanation of Outliers, and how this type of analysis is used. All of these tools are designed for business users with average skills and require no special skills or knowledge of statistical analysis or support from IT or data scientists. What is Outlier Analysis?

IT

IT Business Intelligence Predictive Modeling Statistics

What is Hierarchical Clustering and How Can an Organization Use it to Analyze Data?

Smarten

JUNE 28, 2018

This article discusses the analytical method of Hierarchical Clustering and how it can be used within an organization for analytical purposes. All of these tools are designed for business users with average skills and require no special skills or knowledge of statistical analysis or support from IT or data scientists.

IT

IT Business Intelligence Predictive Modeling Statistics

What is the KMeans Clustering Algorithm and How Does an Enterprise Use it to Analyze Data?

Smarten

JUNE 25, 2018

This article provides a brief explanation of the KMeans Clustering algorithm. Smarten Augmented Analytics tools include plug n’ play predictive analytics , assisted predictive modeling , smart data visualization , self-serve data preparation and clickless analytics for search analytics with natural language processing (NLP).

Enterprise

Enterprise IT Business Intelligence Predictive Modeling

What is the Multinomial-Logistic Regression Classification Algorithm and How Does One Use it for Analysis?

Smarten

JUNE 26, 2018

This article provides a brief definition of the multinomial-logistic regression classification algorithm and its uses and benefits. All of these tools are designed for business users with average skills and require no special skills or knowledge of statistical analysis or support from IT or data scientists.

IT

IT Business Intelligence Predictive Modeling Statistics

What is FP Growth Analysis and How Can a Business Use Frequent Pattern Mining to Analyze Data?

Smarten

JUNE 26, 2018

This article provides a brief explanation of the FP Growth technique of Frequent Pattern Mining. All of these tools are designed for business users with average skills and require no special skills or knowledge of statistical analysis or support from IT or data scientists. What is the FP Growth Algorithm?

Business Intelligence

Business Intelligence Sales Predictive Modeling Statistics

What is Binary Logistic Regression Classification and How is it Used in Analysis?

Smarten

JUNE 29, 2018

In this article, we will discuss the Binary Logistic Regression Classification method of analysis, and how it can be used in business. All of these tools are designed for business users with average skills and require no special skills or knowledge of statistical analysis or support from IT or data scientists.

IT

IT Business Intelligence Predictive Modeling Statistics

What is SVM Classification Analysis and How Can It Benefit Business Analytics?

Smarten

JUNE 27, 2018

This article provides a brief explanation of the SVM Classification method of analytics. All of these tools are designed for business users with average skills and require no special skills or knowledge of statistical analysis or support from IT or data scientists. What is SVM Classification Analysis?

Business Analytics

Business Analytics Analytics IT Business Intelligence

Techniques for Collecting, Prepping, and Plotting Data: Predicting Social Media-Influence in the NBA

Domino Data Lab

OCTOBER 23, 2019

This article provides insight on the mindset, approach, and tools to consider when solving a real-world ML problem. As a result, there has been a recent explosion in individual statistics that try to measure a player’s impact. 05) in predicting changes in attendance. The lower the RMSE, the better the prediction.

Statistics

Statistics Machine Learning Testing Modeling

What is KNN Classification and How Can This Analysis Help an Enterprise?

Smarten

JUNE 29, 2018

In this article, we will discuss the KNN Classification method of analysis. Smarten Augmented Analytics tools include plug n’ play predictive analytics , assisted predictive modeling , smart data visualization , self-serve data preparation and clickless analytics for search analytics with natural language processing (NLP).

Enterprise

Enterprise Business Intelligence Predictive Modeling Statistics

The Science of T20 Cricket: Decoding Player Performance with Predictive Modeling

Machine Learning Paradigms with Example

Webinars

Trending Sources

Bivariate Feature Analysis in Python

Webinars

Quick Guide to Evaluation Metrics for Supervised and Unsupervised Machine Learning

STANDARDIZED VS UNSTANDARDIZED REGRESSION COEFFICIENT

Beyond the hype: Do you really need an LLM for your data?

The quest for high-quality data

Top 5 Statistical Techniques in Python

What to Do When AI Fails

Proposals for model vulnerability and security

Data Insights Assure Quality Data and Confident Decisions!

What Is The Difference Between Business Intelligence And Analytics?

Why you should care about debugging machine learning models

3 Key Components of the Interdisciplinary Field of Data Science

Defining data science in 2018

3 Things Citizen Data Scientists Need in Predictive Analytics!

InfoTribes, Reality Brokers

AI In Analytics: Today and Tomorrow!

R vs Python: What’s the Best Language for Natural Language Processing?

Best Practice of Using Data Science Competitions Skills to Improve Business Value

What is the Independent Samples T Test Method of Analysis and How Can it Benefit an Organization?

Data science vs. machine learning: What’s the difference?

What Are Data Trends and Patterns, and How Do They Impact Business Decisions?

What is Karl Pearson Correlation Analysis and How Can it be Used for Enterprise Analysis Needs?

What is Simple Linear Regression and How Can an Enterprise Use this Technique to Analyze Data?

Credit Card Fraud Detection using XGBoost, SMOTE, and threshold moving

What is the Chi Square Test of Association and How Can it be Used for Analysis?

What is the Paired Sample T Test and How is it Beneficial to Business Analysis?

What is Multiple Linear Regression and How Can it be Helpful for Business Analysis?

What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?

What is Spearman’s Rank Correlation and How is it Useful for Business Analysis?

What is ARIMA Forecasting and How Can it Be Used for Enterprise Analysis?

What Are Simple Random Sampling and Stratified Random Sampling Analytical Techniques?

Data Science at The New York Times

Augmented Analytics Algorithms and Techniques: Learning for Citizen Data Scientists

What is Outlier Analysis and How Can It Improve Analysis?

What is Hierarchical Clustering and How Can an Organization Use it to Analyze Data?

What is the KMeans Clustering Algorithm and How Does an Enterprise Use it to Analyze Data?

What is the Multinomial-Logistic Regression Classification Algorithm and How Does One Use it for Analysis?

What is FP Growth Analysis and How Can a Business Use Frequent Pattern Mining to Analyze Data?

What is Binary Logistic Regression Classification and How is it Used in Analysis?

What is SVM Classification Analysis and How Can It Benefit Business Analytics?

Techniques for Collecting, Prepping, and Plotting Data: Predicting Social Media-Influence in the NBA

What is KNN Classification and How Can This Analysis Help an Enterprise?

Stay Connected