Caret R Package for Applied Predictive Modeling

阿新 • • 發佈：2019-01-12

The R platform for statistical computing is perhaps the most popular and powerful platform for applied machine learning.

The caret package in R has been called “R’s competitive advantage“. It makes the process of training, tuning and evaluating machine learning models in R consistent, easy and even fun.

In this post you will discover the caret package in R, it’s key features and where to go to learn more about it.

Caret package in R

What is the Caret R Package

Caret was built on a key philosophy in machine learning, that of the no free lunch theorem. The theorem states, that given no prior knowledge of prediction problem, no single method can be said to be better than any other.

In this face of this theorem, the caret package has an opinionated stance on how applied machine learning should be conducted. You cannot know which algorithm or which algorithm parameters will be optimal for a given problem, it can only be known by empirical experimentation. This is the process that the caret package was designed to facilitate.

It does this in a few key ways:

Streamlined Model Creation: It provides a consistent interface to train a large number of the most popular third party algorithms in R.
Evaluate the Effect of Parameters on Performance: It provides tools to grid search combinations of algorithm parameters against an objective measure to understand the effect of parameters on the model for a given problem.
Choose an Optimal Model: It provides tools to evaluate and compare models on a given problem to locate the most suitable using objective criteria.
Estimate Model Performance: It provides tools to estimate the accuracy of models on unseen data for a given problem.

Need more Help with R for Machine Learning?

Take my free 14-day email course and discover how to use R on your project (with sample code).

Click to sign-up and also get a free PDF Ebook version of the course.

Caret Features

The caret package has many features built around the core philosophy. Some examples include:

Data Splitting: Split data in training and test datasets.
Data Pre-processing: Prepare data for modeling such as normalization and standardization.
Feature Selection: Methods to select only those attributes required to make effective predictions.
Feature Importance: Evaluate the relevance of each attribute in the dataset on the predicted attribute.
Model Tuning: Evaluate the effect of algorithm parameters on performance and locate an optimal configuration
Parallel Processing: Tune and estimate model performance using parallel computing such as multiple cores on a workstation to give performance improvements.
Visualization: Better understand training data, model comparison and the effect of parameters on model with tailored visualizations.

Where Did Caret Come From

Caret is a package in R created and maintained by Max Kuhn form Pfizer. Development started in 2005 and was later made open source and uploaded to CRAN.

Caret is actually an acronym which stands for Classification And REgression Training (CARET).

It was initially developed out of the need to run multiple different algorithms for a given problem. R packages are created by third parties and can vary in terms of their parameters and syntax when training and generating predictions. The first versions of the caret package were designed to unify model training and prediction.

It later expanded to further standardize related common tasks such as parameter tuning and determining variable importance.

Interview with Max Kuhn

Max Kuhn is interviewed by DataScience.LA at the useR conference. In the interview, Max talks about the development of caret and his use of R. He talks about the importance of testing multiple models on a given problem and the pain in working with multiple different packages at the same time, the impetus for creating the package.

Demonstration of Caret by Max Kuhn

Max Kuhn demonstrates caret and talks about its development and features of caret in this presentation. He touches again on the the no free lunch theorem and the need to test multiple models. The heart of the presentation is an example of a model on some churn data. He touches on estimating model performance, algorithm tuning and much more.

Caret Resources

If you are interested in more information in the caret package for, check out some of the links below.

Frustrated With Your Progress In R Machine Learning?

Develop Your Own Models in Minutes

…with just a few lines of R code

Covers self-study tutorials and end-to-end projects like:
Loading data, visualization, build models, tuning, and much more…

Finally Bring Machine Learning To
Your Own Projects

Skip the Academics. Just Results.

Caret R Package for Applied Predictive Modeling

What is the Caret R Package

Need more Help with R for Machine Learning?

Caret Features

Where Did Caret Come From

Interview with Max Kuhn

Demonstration of Caret by Max Kuhn

Caret Resources

Frustrated With Your Progress In R Machine Learning?

Develop Your Own Models in Minutes

Finally Bring Machine Learning To
Your Own Projects

Caret R Package for Applied Predictive Modeling

Data Visualization with the Caret R package

Tuning Machine Learning Models Using the Caret R Package

Review of Applied Predictive Modeling

Feature Selection with the Caret R Package

Compare Models And Select The Best Using The Caret R Package

【轉】論文閱讀（Chenyi Chen——【ACCV2016】R-CNN for Small Object Detection）

WeightedCLuster R package的使用

R語言FOR迴圈列印9*9乘法表

R package, RBGL, graph包直接install.package()失敗的解決方案

R語言-《Learning R》-Chapter15 : Distribution and Modeling-隨機數字+線性迴歸

Predictive Modeling: Best practices and lessons learnt the hard way

Clojure Package for MXNet

R programming for feature selection and regression

Openstack murano NoPackageForClassFound: Package for class "io.murano.Environment" is not found

Gentle Introduction to Predictive Modeling

Python is the Growing Platform for Applied Machine Learning

Build an AWS Lambda Deployment Package for Python

Build a Lambda Deployment Package for Node.js

【R】no applicable method for 'xml_find_all' applied to an object of class "xml_document"

Caret R Package for Applied Predictive Modeling

What is the Caret R Package

Need more Help with R for Machine Learning?

Caret Features

Where Did Caret Come From

Interview with Max Kuhn

Demonstration of Caret by Max Kuhn

Caret Resources

Frustrated With Your Progress In R Machine Learning?

Develop Your Own Models in Minutes

Finally Bring Machine Learning ToYour Own Projects

相關推薦

Finally Bring Machine Learning To
Your Own Projects