38Regression Shrinkage and Selection via the Lasso

整理文档很辛苦,赏杯茶钱您下走!

免费阅读已结束,点击下载阅读编辑剩下 ...

阅读已结束,您可以下载文档离线阅读编辑

资源描述

RegressionShrinkageandSelectionviatheLassoAuthor(s):RobertTibshiraniSource:JournaloftheRoyalStatisticalSociety.SeriesB(Methodological),Vol.58,No.1(1996),pp.267-288Publishedby:BlackwellPublishingfortheRoyalStatisticalSocietyStableURL::05/01/201102:51YouruseoftheJSTORarchiveindicatesyouracceptanceofJSTOR'sTermsandConditionsofUse,availableat.://=black..EachcopyofanypartofaJSTORtransmissionmustcontainthesamecopyrightnoticethatappearsonthescreenorprintedpageofsuchtransmission.JSTORisanot-for-profitservicethathelpsscholars,researchers,andstudentsdiscover,use,andbuilduponawiderangeofcontentinatrusteddigitalarchive.Weuseinformationtechnologyandtoolstoincreaseproductivityandfacilitatenewformsofscholarship.FormoreinformationaboutJSTOR,pleasecontactsupport@jstor.org.BlackwellPublishingandRoyalStatisticalSocietyarecollaboratingwithJSTORtodigitize,preserveandextendaccesstoJournaloftheRoyalStatisticalSociety.SeriesB(Methodological).(1996)58,No.1,pp.267-288RegressionShrinkageandSelectionviatheLassoByROBERTTIBSHIRANItUniversityofToronto,Canada[ReceivedJanuary1994.RevisedJanuary1995]SUMMARYWeproposeanewmethodforestimationinlinearmodels.The'lasso'minimizestheresidualsumofsquaressubjecttothesumoftheabsolutevalueofthecoefficientsbeinglessthanaconstant.Becauseofthenatureofthisconstraintittendstoproducesomecoefficientsthatareexactly0andhencegivesinterpretablemodels.Oursimulationstudiessuggestthatthelassoenjoyssomeofthefavourablepropertiesofbothsubsetselectionandridgeregression.Itproducesinterpretablemodelslikesubsetselectionandexhibitsthestabilityofridgeregression.ThereisalsoaninterestingrelationshipwithrecentworkinadaptivefunctionestimationbyDonohoandJohnstone.Thelassoideaisquitegeneralandcanbeappliedinavarietyofstatisticalmodels:extensionstogeneralizedregressionmodelsandtree-basedmodelsarebrieflydescribed.Keywords:QUADRATICPROGRAMMING;REGRESSION;SHRINKAGE;SUBSETSELECTION1.INTRODUCTIONConsidertheusualregressionsituation:wehavedata(xi,yi),i=1,2,...,N,wherex=(x,...,xP)Tandyiaretheregressorsandresponsefortheithobservation.Theordinaryleastsquares(OLS)estimatesareobtainedbyminimizingtheresidualsquarederror.TherearetworeasonswhythedataanalystisoftennotsatisfiedwiththeOLSestimates.Thefirstispredictionaccuracy:theOLSestimatesoftenhavelowbiasbutlargevariance;predictionaccuracycansometimesbeimprovedbyshrinkingorsettingto0somecoefficients.Bydoingsowesacrificealittlebiastoreducethevarianceofthepredictedvaluesandhencemayimprovetheoverallpredictionaccuracy.Thesecondreasonisinterpretation.Withalargenumberofpredictors,weoftenwouldliketodetermineasmallersubsetthatexhibitsthestrongesteffects.ThetwostandardtechniquesforimprovingtheOLSestimates,subsetselectionandridgeregression,bothhavedrawbacks.Subsetselectionprovidesinterpretablemodelsbutcanbeextremelyvariablebecauseitisadiscreteprocess-regressorsareeitherretainedordroppedfromthemodel.Smallchangesinthedatacanresultinverydifferentmodelsbeingselectedandthiscanreduceitspredictionaccuracy.Ridgeregressionisacontinuousprocessthatshrinkscoefficientsandhenceismorestable:however,itdoesnotsetanycoefficientsto0andhencedoesnotgiveaneasilyinterpretablemodel.Weproposeanewtechnique,calledthelasso,for'leastabsoluteshrinkageandselectionoperator'.Itshrinkssomecoefficientsandsetsothersto0,andhencetriestoretainthegoodfeaturesofbothsubsetselectionandridgeregression.tAddressforcorrespondence:DepartmentofPreventiveMedicineandBiostatistics,andDepartmentofStatistics,UniversityofToronto,12Queen'sParkCrescentWest,Toronto,Ontario,M5S1A8,Canada.E-mail:tibs@utstat.toronto.edu?1996RoyalStatisticalSociety0035-9246/96/58267268TIBSHIRANI[No.1,InSection2wedefinethelassoandlookatsomespecialcases.ArealdataexampleisgiveninSection3,whileinSection4wediscussmethodsforestimationofpredictionerrorandthelassoshrinkageparameter.ABayesmodelforthelassoisbrieflymentionedinSection5.WedescribethelassoalgorithminSection6.SimulationstudiesaredescribedinSection7.Sections8and9discussextensionstogeneralizedregressionmodelsandotherproblems.SomeresultsonsoftthresholdingandtheirrelationshiptothelassoarediscussedinSection10,whileSection11containsasummaryandsomediscussion.2.THELASSO2.1.DefinitionSupposethatwehavedata(xi,yi),i=1,2,...,N,wherexi=(xi,...X,)Tarethepredictorvariablesandyiaretheresponses.Asintheusualregressionset-up,weassumeeitherthattheobservationsareindependentorthattheyisareconditionallyindependentgiventhexys.Weassumethatthexyarestandardizedsothat2ixyl/N?,Eix2/N=1.Letting,3=(PI,...,pp)T,thelassoestimate(&,/3)isdefinedby(&3)=argminf(Y

1 / 23
下载文档,编辑使用

©2015-2020 m.777doc.com 三七文档.

备案号:鲁ICP备2024069028号-1 客服联系 QQ:2149211541

×
保存成功