机器学习-试卷-midsu19

ryan0704
0 ℃
2020-08-30

整理文档很辛苦，赏杯茶钱您下走！

还剩 ... 页未读，继续阅读 >>

免费阅读已结束，点击下载阅读编辑剩下 ... 页

阅读已结束，您可以下载文档离线阅读编辑

资源描述

CS189Summer2019IntroductiontoMachineLearningMidtermPleasedonotopentheexambeforeyouareinstructedtodoso.Theexamisclosedbook,closednotesexceptyourtwo-pagecheatsheet.Electronicdevicesareforbiddenonyourperson,includingcellphones,iPods,headphones,andlaptops.Turnyourcellphoneoandleaveallelectronicsatthefrontoftheroom,orriskgettingazeroontheexam.Youhave3hours.Pleasewriteyourinitialsatthetoprightofeachpageafterthisone(e.g.,write“MK”ifyouareMarcKhoury).Finishthisbytheendofyour3hours.Markyouranswersontheexamitselfinthespaceprovided.Donotattachanyextrasheets.Thetotalnumberofpointsis150.Thereare26multiplechoicequestionsworth3pointseach,and5writtenquestionsworthatotalof72points.Formultipleanswerquestions,ﬁllinthebubblesforALLcorrectchoices:theremaybemorethanonecorrectchoice,butthereisalwaysatleastonecorrectchoice.NOpartialcreditonmultipleanswerquestions:thesetofallcorrectanswersmustbechecked.FirstnameLastnameSIDFirstandlastnameofstudenttoyourleftFirstandlastnameofstudenttoyourright1Q1.[60pts]MultipleAnswerFillinthebubblesforALLcorrectchoices:theremaybemorethanonecorrectchoice,butthereisalwaysatleastonecorrectchoice.NOpartialcredit:thesetofallcorrectanswersmustbechecked.(a)[3pts]LetXBernoulli(11+exp)forsome2R.WhatistheMLEestimatorof?X10Doesnotexist.l(;1)=11+expwhichhasnomaximizerinR.l(;0)=111+expwhichhasnomaximizerinR.(b)[3pts]LetYN(X;In)forsomeunknown2RdandsomeknownX2Rndthathasfullcolumnrankanddn.WhatistheMLEestimatorof?(XX)1XYX(XX)1YY+Z8Z2Null(X)Doesnotexist.MaximizingthelikelihoodfunctionisequivalenttominimizingkYXk22,whichwecandowithlinearleastsquares.Interestingly,thismeansthattheMLEestimatorofwheny=X+whereN(0;In)isthestandardleastsqauressolution.(c)[3pts]Letf(x)=Pni=1xilogxi.ForsomexsuchthatPni=1xi=1andxi0,theHessianoffis:positivedeﬁnitenegativedeﬁnitepositivesemideﬁnitenegativesemideﬁniteindeﬁnite(neitherpositivesemideﬁnitenornega-tivesemideﬁnite)invertiblenonexistentNoneoftheabove.rxf(x)=~1log(x)r2xf(x)=diag(1x1;:::;1xn)(d)[3pts]Whichofthefollowingstatementsaboutoptimizationalgorithmsarecorrect?Newton’smethodalwaysrequiresfeweriterationsthangradientdescent.Stochasticgradientdescentalwaysrequiresfeweriterationsthangradientdescent.Stochasticgradientdescent,evenwithsmallstepsize,sometimesincreasesthelossinsomeiterationforconvexproblems.Gradientdescent,regardlessthestepsize,decreasesthelossineveryiterationforconvexproblems.Argumentslikeonereasonableoptimizationalgorithmdominatesanotherforallproblemsareingeneralwrong.GradientdescentcanworkforsomelossthatNewtondoesnotevenconverge.2SGDduetostochasticitydoesnotnecessarilydecreasethelossineachiteration.Onecanconstructcasewheretherecouldbeadatapointwhichissortofcontradictingwithallothers,sooptimizingthisparticulardatapointmayincreasetheoverallloss.Gradientdescentwithextremelylargestepsizemaynotbeabletoreachthesmallneighborhoodoftheminimum,mayjustjumparoundoutsidetheneighborhood.(e)[3pts]Assumewerunthehard-marginSVMalgorithmon100d-dimensionalpointsfrom2dierentclasses.Thealgorithmoutputsasolution.Afterwhichtransformationtothetrainingdatawouldthealgorithmstilloutputasolution?CenteringthedatapointsTransformingeachdatapointfromxtoAxforsomematrixA2RdxdDividingallentriesofeachdatapointbysomenegativeconstantcAddinganadditionalfeatureIfallentriesanAare0,thedatapointsalllandontheorigin.Ifasetofdatapointsarelinearlyseparableinddimensions,itislinearlyseparableind+1dimensionsregardlessofwhatthed+1featureis.(f)[3pts]WhichofthefollowingholdstruewhenrunninganSVMalgorithm?Increasingordecreasingvalueonlyallowsthedecisionboundarytotranslate.Givenn-dimensionalpoints,theSVMalgorithmﬁndsahyperplanepassingthroughtheorigininthe(n+1)-dimensionalspacethatseparatesthepointsbytheirclass.Decisionboundaryrotatesifwechangethecon-strainttowTx+3.ThesetofweightsthatfulﬁlltheconstraintsoftheSVMalgorithmisconvex.(g)[3pts]Considerthesetfx2Rd:(x)(x)=1ggivensomevector2Rdandmatrix2Rdxd.Whichofthefollowingaretrue?Ifistheidentitymatrixscaledbysomeconstantc,thenthesetisisotropic.Increasingtheeigenvaluesofincreasestheradiioftheellipsoid.Increasingtheeigenvaluesofdecreasestheradiioftheellipsoid.Asingularproducesanellipsoidwithaninﬁniteradius.(h)[3pts]Considerthelinearregressionproblemwithfullrankdesignmatrix,whichofthefollowingregularizationingeneralencouragemoresparsitythannon-regularizedobjective:L0regularization(numberofthenon-zerocoordi-nates)L1regularizationL2(Tikhonov)regularizationL3regularizationL4regularizationL1regularization(themaximumabsolutevalueacrossallcoordinates)Thinkaboutthecorrespondingequivalentconstrainedoptimizationproblem.(i)[3pts]Whichofthefollowingstatementsarecorrect?Inridgeregression,theregularizationparameterisusuallysetas0:1.SVMingeneraldoesnotenforcesparsityovertheparameterswand.Inbinarylinearclassiﬁcation,thesupportvectorsofSVMmightcontainsamplesfromonlyoneclasseveniftrainingdatahasbothclasses.3Inbinarylinearclassiﬁcation,suppose1fwx+0gisonemaximummarginlinearclassiﬁer,thenthemarginonlydependsonwbutnot.Basicdeﬁnitions.(j)[3