A spectral algorithm for learning mixtures of dist

整理文档很辛苦,赏杯茶钱您下走!

免费阅读已结束,点击下载阅读编辑剩下 ...

阅读已结束,您可以下载文档离线阅读编辑

资源描述

ASpetralAlgorithmforLearningMixturesofDistributionsSantoshVempalaGrantWangyAbstratWeshowthatasimplespetralalgorithmforlearningamixtureofkspherialGaussiansinRnworksremarkablywell|itsueedsinidentifyingtheGaussiansassumingessentiallytheminimumpossibleseparationbetweentheirentersthatkeepsthemunique(solvinganopenproblemof[1℄).Thesampleomplexityandrunningtimearepolynomialinbothnandk.Thealgorithmalsoworksforthemoregeneralproblemoflearningamixtureof\weaklyisotropidistributions(e.g.amixtureofuniformdistributionsonubes).Thealgorithmisrobustinthatitantoleratesmallamountsofnoiseandthusanalsobeusedforthemoregeneralproblemofndingthebestmixturemodelthattsagivendataset,providedthereisagoodt.1IntrodutionLearningamixtureofdistributionsisalassialprobleminstatistisandlearningtheory(see[10,14℄);morereently,ithasalsobeenproposedasamodelforlustering.Inthebasiversionoftheproblemwearegivenrandomsamplesfromamixtureofkdistributions,F1;:::;Fk.Eahsampleisdrawnindependentlywithprobabilitywifromthei’thdistribution.Thenumbersw1;:::;wkarealledthemixingweights.Theproblemistolassifytherandomsamplesaordingtowhihdistributiontheyomefrom(andtherebyinferthemixingweights,meansandotherpropertiesoftheunderlyingdistributions).AnimportantaseofthisproblemiswheneahunderlyingdistributionisaGaussian.Inthisase,thegoalistondthemeanandovarianesofeahGaussian(alongwiththemixingweights).Thisproblemseemstobeofgreatpratialinterestandmanyheuristishavebeenusedtosolveit.ThemostfamousamongthemistheEMalgorithm[5℄.UnfortunatelyEMisaloalsearhheuristithatanfail.AspeialaseoftheproblemiswhentheGaussiansareassumedtobespherial,i.e.thevarianeisthesameinanydiretion.Inreentyears,therehasbeensubstantialprogressindevelopingpolynomial-timealgorithmsforthisspeialase,bymakingassumptionsontheseparationbetweenthemeansoftheGaussians.Thisseparationonditionisruial,soweproeedtomakeitexpliit.LetF1;:::;FkbespherialGaussiansinRnwithmeanvetors1;:::;kandvarianes21;:::;2k.WewillrefertoipnastheradiusofFiandjjijjjastheseparationbetweenFiandFj.Ifthepairwiseseparationislargerthantheradii,thenpointsfromdierentGaussiansareisolatedinspaeandeasytolassify.Ontheotherhand,iftheseparationisverysmall,thenthelassiationproblemneednothaveauniquesolution.1.1PreviousworkDasgupta[3℄usedrandomprojetiontolearnamixtureofspherialGaussiansprovidedtheyareessentiallynon-overlapping,i.e.theoverlapinprobabilitymassisexponentiallysmallinn.DepartmentofMathematisandLaboratoryforComputerSiene,MIT,vempalamath.mit.eduyLaboratoryforComputerSiene,MIT,gjwtheory.ls.mit.edu.BothauthorsaresupportedinpartbyNSFCareerawardCCR-987024.1Hisalgorithmispolynomial-timeprovidedthesmallestmixingweightwminis(1=k)andtheseparationisjjijjjCmaxfi;jgn12foraonstantC.Inotherwords,theseparationisproportionaltothelargerradius(thealgorithmalsorequiredallthevarianestobewithinaboundedrange).Shortlythereafter,itwasshownbyDasguptaandShulman[6℄thatavariantofEMworkswithasmallerseparation(alongwithsometehnialonditionsonthevarianes):jjijjjCmaxfi;jgn14log14(n=wmin)(1)ThisseparationistheminimumatwhihrandompointsfromthesameGaussiananbedis-tinguishedfromrandompointsfromtwodierentGaussiansbasedonpairwisedistanes.SopointsfromtheGaussianwith(approximately)thesmallestvarianehavethesmallestpairwisedistanes.Theyanbeidentiedandremovedandthisanberepeatedontheremainingpoints.AroraandKannan[1℄generalizedthistonon-spherialGaussians.Theyusedisoperimetrithe-oremstoobtaindistaneonentrationresultsforthenon-spherialase.Atthisseparation,theiralgorithmsimplyidentiesallpointsatroughlytheminimumdistanefromeahotherasomingfromasingleGaussian,removesthemandrepeatsontheremainingdata.Theyalsogiveaversionthatusesrandomprojetion.LearningamixtureofspherialGaussiansatasmallerseparation(whendistaneonentrationresultsarenolongervalid)hasbeenanopenproblem.1.2OurresultsInorderforthesolutiontothelassiationproblemtobewell-dened(i.e.uniquewithreasonableprobability)weneedaseparationofatleastqjjijjjCmaxfi;jg:Atthisseparationtheoverlapintheprobabilitymassisaonstantfration.Soinpartiular,distaneonentrationresultsarenolongerappliable.Inthispaper,weshowthatasimplespetralalgorithmanlearnamixtureofkGaussiansatthisminimumseparationintimepolynomialinkO(k)poly(n).OurmainresultisthatwithaslightlylargerseparationofjjijjjCmaxfi;jgk14log14(n=wmin)(2)thealgorithmispolynomialinbothkandn.Notethatthisonditionisalmostindependentofnandismuhweakerthan(1)asthedimension(n)getslargerthanthenumberofGaussians(k).Moregenerally,thealgorithmworksforamixtureofweaklyisotropidistributions,i.e.dis-tributionswiththepropertythatthevarianeisthesamealonganydiretion.Examplesofthisinludeuniformdistributionoverubes,balls,andgenerallyweaklyisotropionvexsets(denedinsetion4),adistributionproportionaltoejxjforaonstant,et.Themainstepofthealgorithmistoprojettothesubspaespannedbythetopkrightsingularvetorsofthesamplematrix(i.e.itskprinipalomponents).Thisistherankksubspaewhihmaximizesthesquaredprojetionsofthesamples.Thekeyobservationisthatwithhighprobabilitythissubspaeliesverylosetothespanofthemeanvetors

1 / 18
下载文档,编辑使用

©2015-2020 m.777doc.com 三七文档.

备案号:鲁ICP备2024069028号-1 客服联系 QQ:2149211541

×
保存成功