An Introduction to Bootstrap Methods using Arc

整理文档很辛苦,赏杯茶钱您下走!

免费阅读已结束,点击下载阅读编辑剩下 ...

阅读已结束,您可以下载文档离线阅读编辑

资源描述

AnIntroductiontoBootstrapMethodsusingArcIainPardoeDepartmentofAppliedStatisticsSchoolofStatisticsUniversityofMinnesota,St.Paul,MN55108TechnicalReportNumber631WorkSupportedbytheNationalScienceFoundation,GrantDUE96-52887February24,2000AbstractThisreportpresents(1)thebasicideasofbootstrappingwhenappliedtomul-tiplelinearregression,asdescribedin[2,3],and(2)howtoimplementtheseideasusingArc,thecomputerpackagethataccompanies[1].1IntroductionThisreportprovides(1)anintroductiontobootstrapmethodsinlinearregressionanal-ysesasdiscussedin[2,3],and(2)computercodeforusewiththeprogramArc,de-scribedin[1],thatimplementstheseanalyses.Theremainderofthissectionoutlinesthegeneralideasbehindlinearregression.InSection2,Isummarizethebasicboot-strapapproachtostatisticalinference,andpresenttwowaysofapplyingittolinearregression.Next,IdescribesomeoftheissuesinvolvedwithestimatingandtestingmeanfunctioncoefficientsinSections3and4respectively.Finally,Ipresentsomeex-amplesinSection5,andoutlinesomeoftheotherareasinregressionwherebootstrapmethodscouldbeusedtogoodeffectinSection6.ThemostrecentversionofArccanbeobtainedontheInternetfromthelinkfileficproblemsdiscussed.Asthesemayhaveinterestinvarioussituations,manyreaderswillfindtheadditionsuseful.Theadditionscanalsoprovideastarting1pointforimplementationofthebootstrapinothersituations,butforthisthereadermustbeabletoreadandwritecomputerprogramsinthelanguagelisp.Thereareseveralwaysofgettingstarted.Tierneyin[4]providesaveryreadableintroduction.Severalon-linereferencescanbeobtainedfrom://“SimulationsusingArc”maybehelpful,andisavailablefrom[1],regressionconcernsaresponseandpredictors,.Thegeneralgoalinregressionistostudyhowtheconditionaldistributionofchangesasthevalueofchanges,oftenconcentratingonthemeanfunction,.Inmanyregressionproblems,theresponseiswritten!#whereiscalledthestatisticalerrorandtheweights#%$’&areknown,positivenum-bers.Anotherfeatureoftheconditionaldistributionofthatisoftenstudiedinregressionisthevariancefunction(*)+,*’(*)+-./0#.Let1bea24365vectoroftermsderivedfrom.Typically,1willconsistofaconstant1foranintercept,and72985,additionalfunctionsof,likepolynomialsorothertransformations.Thelinearregressionmodelhasmeanfunction:*;1*=?@=ACBDEEE=GF,HIBF0HJ’K1(1)whereKLMNO=I=F0Hisa2*3P5vectorofmeanfunctioncoefficients,andvariancefunction(*)+QRS.0#(2)TheseassumedformsofthemeanandvariancefunctionsimplythatO0TU&and(*)+,O.VWRS.Thisreflectsanalternativewayofspecifyingthegeneralformofthelinearregressionmodel—thelinearmeanfunction(1)togetherwiththeassumptionthatthedistributionoftheerrorsisindependentof.Forafullparametricanalysis,thedistributionof,oralternativelyof,mustbespecified.Fornormallydistributederrors,theleastsquarestheoryofregressionestimationandinferenceprovidesstraightforward,exactmethodsforanalysis.Butfornon-normalerrors,thesemethodshavethepotentialtobeinaccurateormisleading.Resamplingmethodssuchasthebootstrapprovideanalternativemethodology,withthepotentialtobothXreinforceconclusionsarrivedatusingnormaltheory,and2Ytoprovideestimationandinferencetechniquesinsituationswherenormaltheorydoesnotseemtobejustified.Theexamplesinthisreportfocusmainlyonthefirstofthesegoals,althoughSec-tion6mentionssomeareasthatcouldinvolvemoreinthewayofthesecondofthesegoals.2Twoalternativeparadigmsforusingbootstrapmeth-odsinlinearregressionThebootstrapisadata-basedsimulationmethodforstatisticalinference.Thebasicideaisasfollows.Iwishtomakeaninferenceabouta(population)quantity,sayZ,forwhichIhaveadata-basedestimate,[Z.Ithenwanttogetsomeideaofthedistributionofmyestimate,withouthavingtomakeassumptionsaboutmydata(forexample,thatitcomesfromamultivariatenormaldistribution).Onewaytodothisistoresamplewithreplacementfrommydatatogetabootstrapsample(ofthesamesizeasmyoriginalsample,andmadeupofcasesfrommyoriginalsample,someappearingonce,sometwice,andsoon,andsomenotappearingatall).Ithencreatealargenumber,\,ofsuchbootstrapsamples,andcalculate[Zforeachsample.(Fornotation,Idenotebootstrapestimateswithastar,andhence[Zforabootstrapsampleisdenoted[ZG].)These\[ZG]’scontaininformationthatcanbeusedtomakeinferencesfromthedata;essentially,[ZG]isto[Zas[ZistoZ.Someofthetypesofinferencepos

1 / 19
下载文档,编辑使用

©2015-2020 m.777doc.com 三七文档.

备案号:鲁ICP备2024069028号-1 客服联系 QQ:2149211541

×
保存成功