1. 程式人生 > >Penalized Regression in R

Penalized Regression in R

In this post you will discover 3 recipes for penalized regression for the R platform.

You can copy and paste the recipes in this post to make a jump-start on your own problem or to learn and practice with linear regression in R.

Penalized Regression

Penalized Regression
Photo by Bay Area Bias, some rights reserved

Each example in this post uses the longley dataset provided in the datasets package that comes with R. The longley dataset describes 7 economic variables observed from 1947 to 1962 used to predict the number of people employed yearly.

Ridge Regression

Ridge Regression creates a linear regression model that is penalized with the L2-norm which is the sum of the squared coefficients. This has the effect of shrinking the coefficient values (and the complexity of the model) allowing some coefficients with minor contribution to the response to get close to zero.

Ridge Regression in R R
123456789101112131415 # load the packagelibrary(glmnet)# load datadata(longley)x<-as.matrix(longley[,1:6])y<-as.matrix(longley[,7])# fit modelfit<-glmnet(x,y,family="gaussian",alpha=0,lambda=0.001)# summarize the fitsummary(fit)# make predictionspredictions<-predict(fit,x,type="link")# summarize accuracymse<-mean((y-predictions)^2)print(mse)

Learn about the glmnet function in the glmnet package.

Need more Help with R for Machine Learning?

Take my free 14-day email course and discover how to use R on your project (with sample code).

Click to sign-up and also get a free PDF Ebook version of the course.

Least Absolute Shrinkage and Selection Operator

Least Absolute Shrinkage and Selection Operator (LASSO) creates a regression model that is penalized with the L1-norm which is the sum of the absolute coefficients. This has the effect of shrinking coefficient values (and the complexity of the model), allowing some with a minor effect to the response to become zero.

Least Absolute Shrinkage and Selection Operator (LASSO) in R R
1234567891011121314151617 # load the packagelibrary(lars)# load datadata(longley)x<-as.matrix(longley[,1:6])y<-as.matrix(longley[,7])# fit modelfit<-lars(x,y,type="lasso")# summarize the fitsummary(fit)# select a step with a minimum errorbest_step<-fit$df[which.min(fit$RSS)]# make predictionspredictions<-predict(fit,x,s=best_step,type="fit")$fit# summarize accuracymse<-mean((y-predictions)^2)print(mse)

Learn about the lars function in the lars package.

Elastic Net

Elastic Net creates a regression model that is penalized with both the L1-norm and L2-norm. This has the effect of effectively shrinking coefficients (as in ridge regression) and setting some coefficients to zero (as in LASSO).

Elastic Net in R R
123456789101112131415 # load the packagelibrary(glmnet)# load datadata(longley)x<-as.matrix(longley[,1:6])y<-as.matrix(longley[,7])# fit modelfit<-glmnet(x,y,family="gaussian",alpha=0.5,lambda=0.001)# summarize the fitsummary(fit)# make predictionspredictions<-predict(fit,x,type="link")# summarize accuracymse<-mean((y-predictions)^2)print(mse)

Learn about the glmnet function in the glmnet package.

Summary

In this post you discovered 3 recipes for penalized regression in R.

Penalization is a powerful method for attribute selection and improving the accuracy of predictive models. For more information see Chapter 6 of Applied Predictive Modeling by Kuhn and Johnson that provides an excellent introduction to linear regression with R for beginners.


Frustrated With Your Progress In R Machine Learning?

Master Machine Learning With R

Develop Your Own Models in Minutes

…with just a few lines of R code

Covers self-study tutorials and end-to-end projects like:
Loading data, visualization, build models, tuning, and much more…

Finally Bring Machine Learning To
Your Own Projects

Skip the Academics. Just Results.


相關推薦

Penalized Regression in R

Tweet Share Share Google Plus In this post you will discover 3 recipes for penalized regression

Logistic regression in R

Datacamp Learning Logistic regression in R Building simple logistic regression models The donors dataset contains 93,462 examples of peop

Linear Regression in R with Decision Trees

Tweet Share Share Google Plus In this post, you will discover 8 recipes for non-linear regressio

Linear Regression in R

Tweet Share Share Google Plus In this post you will discover 4 recipes for non-linear regression

OLS and Logistic Regression Models in R

OLS and Logistic Regression Models in RWe use linear models primarily to analyse cross-sectional data; i.e. data collected at one specific point in time ac

intersect for multiple vectors in R

con span osi library tar other and pos intersect Say you have a <- c(1,3,5,7,9) b <- c(3,6,8,9,10) c <- c(2,3,4,5,7,9) A stra

print,cat打印格式及字符串引號格式,去掉字符串空格 in R

with letters out logs true right int 函數 cnblogs print 函數的打印格式: ##no quote print out > x <- letters[1:5] > print(x,quote=F,);pri

Match function in R

nom -i strong UNC https tps true mat amp Examples: print(match(5, c(1,2,9,5,3,6,7,4,5)))[1] 4 5 %in% c(1,2,9,5,3,6,7,4,5)[1] T

Beautiful and Powerful Correlation Tables in R

select auto edit for open sat local plot fig Another correlation function?! Yes, the correlation function from the psycho package. devt

Customer Lifetime Value in R筆記

原文:http://www.keetan.me/clv CLVcustomer lifetime value是顧客對公司的價值估計,客戶在與公司整個接觸時期的未來貢獻現金流淨值,利用到的模型為Pareto/NBD Buy Til'You Die;重點是預測顧客的未來價值 Four T

Linear and Logistic Regression in TensorFlow

Linear and Logistic Regression in TensorFlow   Graphs and sessions TF Ops: constants, variables, functions TensorBoard Lazy loading Linear Re

How to write tidy SQL queries in R

How to write tidy SQL queries in RMost of us have to interact with databases nowadays, and SQL is by far the most common language used. However, working wi

Linear Regression in the Wild

In one of my job interviews for a data scientist position, I was given a home assignment I'd like to share with you. The interviewer sent me a CSV file con

Cross-Correlation of Currency Pairs In R (ccf)

Cross-Correlation of Currency Pairs In R (ccf)When working with a time series, one important thing we wish to determine is whether one series “causes” chan

Building a Logistic Regression in Python

Building a Logistic Regression in PythonSuppose you are given the scores of two exams for various applicants and the objective is to classify the applicant

Simple and Multiple Linear Regression in Python

Linear Regression in PythonThere are two main ways to perform linear regression in Python — with Statsmodels and scikit-learn. It is also possible to use t

Linear Regression in 6 lines of Python

Today to perform Linear Regression quickly, we will be using the library scikit-learn. If you don’t have it already you can install it using pip:pip instal

style app in R using Shiny+Flexdashboard

Managing Human Intelligence TasksThis workflow has served us well, but we quickly developed a backlog of messages that did not fall into one of our existin

Simple Linear Regression in Python

Simple Linear Regression in Python“If you can’t explain it simply, you don’t understand it well enough.”Simple linear regression is a statistical method th

Statistic summary in R

summary the statistics of data visualize the statistics (boxplot and histogram) view the data library("AzureML") ws <- workspace(