On the Bayes Risk in Information-Hiding Protocols

song625613
0 ℃
2020-06-26

整理文档很辛苦，赏杯茶钱您下走！

还剩 ... 页未读，继续阅读 >>

免费阅读已结束，点击下载阅读编辑剩下 ... 页

阅读已结束，您可以下载文档离线阅读编辑

资源描述

OntheBayesRiskinInformation-HidingProtocols∗KonstantinosChatzikokolakisCatusciaPalamidessiINRIAandLIX,´EcolePolytechniquePalaiseau,France{kostas,catuscia}@lix.polytechnique.frPrakashPanangadenMcGillUniversityMontreal,Quebec,Canadaprakash@cs.mcgill.caAbstractRandomizedprotocolsforhidingprivateinformationcanberegardedasnoisychannelsintheinformation-theoreticsense,andtheinferenceoftheconcealedinformationcanberegardedasahypothesis-testingprob-lem.WeconsidertheBayesianapproachtotheproblem,andinvestigatetheprobabilityoferrorassociatedtotheMAP(MaximumAposterioriProbability)inferencerule.Ourmainresultisaconstructivecharacter-izationofaconvexbaseoftheprobabilityoferror,whichallowsustocomputeitsmaximumvalue(overallpossibleinputdistributions),andtoidentifyupperboundsforitintermsofsimplefunctions.Asasideresult,weareabletoimprovetheHellman-RavivandtheSanthi-Vardyboundsexpressedintermsofconditionalentropy.WethendiscussanapplicationofourmethodologytotheCrowdsprotocol,andinparticularweshowhowtocomputetheboundsontheprobabilitythatanadversarybreakanonymity.1IntroductionInformation-hidingprotocolstrytohidetherelationbetweencertainfacts,thatwewishtomaintainhidden,andtheobservableconsequencesofthesefacts.ExampleofsuchprotocolsareanonymityprotocolslikeCrowds[23],OnionRouting[29],andFreenet[8].Oftentheseprotocolsuserandomizationtoob-fuscatethelinkbetweentheinformationthatwewishtokeephiddenandthe∗ThisworkhasbeenpartiallysupportedbytheINRIADREI´EquipeAssoci´eePRINT-EMPS.TheworkofKonstantinosChatzikokolakisandCatusciaPalamidessihasbeenalsosupportedbytheINRIAARCprojectProNoBiS.observedevents.Crowds,forinstance,triestoconcealtheidentityoftheorigi-natorofamessagebyforwardingthemessagerandomlyuntilitsdestination,sothatifanattackerinterceptsthemessage,itcannotbesurewhetherthesenderistheoriginatororjustaforwarder.Inmostcases,protocolsliketheonesabovecanberegardedasinformation-theoreticchannels,wheretheinputsarethefactstokeephidden,theoutputsaretheobservables,andthematrixrepresentsthecorrelationbetweenthefactsandtheobservedevents,intermsofconditionalprobabilities.AnadversarycantrytoinferthefactsfromtheobservedeventsusingtheBayesianmethod,whichisbasedontheprincipleofassuminganaprioriprobabilitydistributiononthehiddenfacts(hypotheses),andderivingfromthat(andfromthematrix)theaposterioridistributionafteracertaineventhasbeenobserved.ItiswellknownthatthebeststrategyfortheadversaryistoapplytheMAP(MaximumAposterioriProbability)criterion,which,asthenamesays,dictatesthatoneshouldchoosethehypothesiswiththemaximumaposterioriprobabilitygiventheobservation.“Best”meansthatthisstrategyinducesthesmallestprobabil-ityofguessingthewronghypothesis.Theprobabilityoferror,inthiscase,isalsocalledBayesrisk.Intuitively,theBayesriskismaximumwhentherowsofthechannel’smatrixareallthesame;thiscasecorrespondsindeedtocapacity0,whichmeansthattheinputandtheoutputareindependent,i.e.wedonotlearnanythingabouttheinputsbyobservingtheoutputs.Thisistheidealsituation,fromthepointofviewofinformation-hidingprotocols.Inpractice,however,itisdiﬃculttoachievesuchdegreeofprivacy.WearetheninterestedinmaximizingtheBayesrisk,sotocharacterizequantitativelytheprotectionoﬀeredbytheprotocol.ThemainpurposeofthispaperistoinvestigatetheBayesrisk,inrelationtothechannel’smatrix,andtoproducetightboundsonit.Theinterestinﬁndinggoodboundsfortheprobabilityoferrorismotivatedalsobythefactthatinsomecasethedecisionregioncanhaveacomplicatedgeometry,orthedecisionfunctioncanbeverysensitivetosmallvariationsintheinputdistribution,thusmakingitdiﬃculttocomputetheprobabilityoferror.Someexamplesofsuchsituationsareillustratedin[26].Goodboundsbasedon“easy”functions(i.e.functionseasytocompute,andnottoosensitivetocomputationalerrors)arethereforeveryusefulinsuchsituationsastheycanbeusedasanapproximationoftheprobabilityoferror.Itisparticularlynicetohaveconvexboundssincetheyboundanyestimatebasedonlinearinterpolation.Sinceourboundisbasedontheconvexhullitisthebestconvexboundthatmatchesthecornerpoints.TherearemanyboundsknowninliteraturefortheBayesrisk.Oneoftheseistheequivocationbound,duetoR´enyi[24],whichstatesthattheprobabilityoferrorisboundedbytheconditionalentropyofthechannel’sinputgiventheoutput.Later,HellmanandRavivimprovedthisboundbyhalf[15].Recently,SanthiandVardyhaveproposedanewbound,thatdependsexponentiallyonthe(oppositeofthe)conditionalentropy,andwhichconsiderablyimprovestheHellman-Ravivboundinthecaseofmulti-hypothesistesting[26].Thelatterisbetter,however,inthecaseofbinaryhypothesistesting.2TheBayesapproachtohypothesistestingisoftencriticizedbecauseitas-sumestheknowledgeoftheaprioridistribution,oratleastofagoodapprox-imationofit,whichisoftenanunjustiﬁedassumption.However,eveniftheadversarydoesnotknowtheaprioridistribution,themethodisstillvalidasymptotically,undertheconditionthatthematrix’srowsareallpairwisedis-tinguished.Undersuchconditionindeed,asshownin[3],byrepeatingtheexperimentthecontributionoftheaprioriprobabilitybecomeslessandlessrelevantforthecomputationoftheBayesianrisk,andit“washesout”inthelimit.Furthermore,t