Approachability in infinite dimensional spaces

uyuyan1234
1 ℃
2020-01-29

整理文档很辛苦，赏杯茶钱您下走！

还剩 ... 页未读，继续阅读 >>

免费阅读已结束，点击下载阅读编辑剩下 ... 页

阅读已结束，您可以下载文档离线阅读编辑

资源描述

ApproachabilityinInﬁniteDimensionalSpacesbyEhudLehrer1SchoolofMathematicalSciencesSacklerFacultyofExactSciencesTelAvivUniversity,RamatAvivTelAviv69978,Israele-mail:lehrer@math.tau.ac.ilFirstversion:April1997February27,2002Abstract.TheapproachabilitytheoremofBlackwell(1956b)isextendedtoinﬁnitedimensionalspaces.Twoplayersplayasequentialgamewhosepayoﬀsarerandomvariables.AsetCofrandomvariablesissaidtobeapproachablebyplayer1ifhehasastrategythatensuresthatthediﬀer-encebetweentheaveragepayoﬀanditsclosestpointinC,almostsurelyconvergestozero.Necessaryconditionsforasettobeapproachablearepresented.1IacknowledgeEilonSolanforhishelpfulcomments.1IntroductionBlackwell(1956b)consideredatwo-playersequentialgamewherethepayoﬀsateachroundarevectorsinaﬁnitedimensionalspace,ratherthannumbers.Apre-speciﬁedsetinthe(vector)payoﬀspaceCissaidtobeapproachablebyplayer1ifhehasastrategythatensuresthatthediﬀerencebetweenthepartialaveragepayoﬀanditsclosestpointinC,almostsurelyconvergeswithtimetozero.Incontrastwiththeﬁnitedimensionalpayoﬀspace,asinBlackwell(1956b),thepayoﬀspaceconsideredhereisinﬁnitedimensional.Wepresentsuﬃcientconditionsthatensurealmostsurelyapproachability.Onemayconsideravector-payoﬀgameasﬁnitelymanygamesplayedsimultaneously.Ineachroundaplayertakesanactionwhichappliestoallgamesplayed.Iftherearenotransferablepayoﬀsfromonegametoanother,thepayoﬀsofthegamesareconsideredasonevector.Theobjectiveofplayer1thenistobringtheaveragevectorpayoﬀintoasetC.Blackwelltreatedthecasewhereineachroundallgamesareplayed.Herewealsoexaminethecasewherenotallgamesareplayedallthetime.Ineachround,dependingonthehistory,adiﬀerentsetofgamesisplayed.Intermsofvector-payoﬀs,somecoordinatesmaybeinactiveinsomerounds.Therelevantaverageisthereforethesumofpastpayoﬀsdividedbythenumberoftimesacoordinatewaspreviouslyactive.Thus,thesumofpayoﬀsisdividedbyanumberthatmayvarywiththecoordinate.Thisfactimposesadiﬃcultyinthatitdoesnotallowuseofthemulti-linearityoftheinnerproduct.Theapproachabilitytheoremhasbeenappliedextensivelysinceitsin-ception.Blackwell(1956a)himselfnotedthatHannan’s(1957)no-regrettheoremcanbeprovenbyusingtheapproachabilitytheorem.AumannandMaschler(1995)usedittoshowthattheuninformedplayerinare-1peatedgamewithincompleteinformationcanguaranteeatleastwhatisthenproventobethevalue.RecentlytheapproachabilitytheorygainedarevivalduetotheinﬂuentialworkofFosterandVohra(1997and1999)oncalibrationanditsrelationtocorrelatedequilibrium.HartandMas-Colell(2000)demonstratedaninteractivelearningprocessthatconvergestocorrelatedequilibrium.InHartandMas-Colell(2001)theyusedtheideabehindthegeometricprincipleofapproachabilitytointroducealargefam-ilyofadaptivelearningprocedures.Rustichini(1999)provedano-regrettheoremforacaseofimperfectmonitoring.Spinat(2000)showedthatanyminimalapproachablesetisaB-set,thatis,asetwhichsatisﬁestheconditionofBlackwell’stheorem.InhisoriginalpaperBlackwell(1956b)alsointroducedthenotionofweakapproachability.Vieille(1992)useddiﬀerentialgameswithaﬁxeddurationtostudyweakapproachabilityinﬁnitedimensionalspaces.Asforapproachabilityinlargespaces,Lehrer(2001a)usedittoshowthatthereexistsapredictionschemethatpassesalargesetofcheckingrulesalaDavid(1982).Sandroniatal.(2000)extendedthisresulttothecasewherethecheckingrulesareprediction-based,thatis,whenaninspectorcanuserulesthatarebasedoncurrentforecastingratherthanonhistoricalonesonly.Lehrer(2001b)showedtheexistenceofaregret-freestrategyagainstinﬁnitelymanyperformancecriteria.Lehrer(2001c)introducedaninﬁnitegamewhereineachroundplayer1choosesadigitandplayer2adistributionoverdigits.Player1winsifthesequenceofdigitshechoseduringthegameisnormalwithrespecttothemeasureinducedbythesequenceofdistributionsplayer2chosethroughthegame.Lehrer(2001c)provedbytheapproachabilitytheoreminlargespacesthatplayer1hasapurewinningstrategyinthisgame.Thisstrategyisinparticularaprocedurebywhichonecanconstructanextendednormalnumberwithrespecttoanydistribution.Inthispaperweseparatethegeometricaspectsofapproachabilityfrom2thestrategicaspects.Thegeometricprinciplesbehindapproachabilityareintroducedﬁrst(Section3)andthenappliedtorandom-variable-payoﬀgames(inSection5).Section6isdevotedtodemonstratingtherelationbetweenthelawoflargenumbersandtheideaofapproachability.2ApproachabilityinanInﬁniteDimensionalSpaceConsiderasequentialgamewhereateachstageplayerichoosesanac-tionfromameasurablesetSi,i=1;2:Let(sn1;sn2)2S1£S2denotethepairofactionstakenattimen.Ahistoryoflengthnisasequence(s11;s12;s21;s22;:::;sn1;sn2).Historiesoflengthnwilllaterbedenotedashn.ThesetofallﬁnitehistoriesisH=[n(S1£S2)n.Foranyhs;hn2Hwesaythathshnifhsisapreﬁxofhn.DenoteH=(S1£S2)1.Foragivenh12Hwedenotebyhnitsnthpreﬁx.Let(Ω;¹;F)beaprobabilityspaceandÂbeafunctionfromHtothesetofrandomvariablesdeﬁnedon(Ω;¹;F)thattakesonlyvaluesinf0;1g.Thus,foreveryh2H,Â(h)isarandomvariabledeﬁnedover(Ω;¹;F)thatattainsonlytwovalues,0or1.WhenÂ(h)(!)=1,wesaythatafterthehistoryh,!isactiveandotherwise,that!isinactive.ThefunctionÂiscalledtheindicator.Thepayoﬀattimenafterthehistoryhn¡12Hoflengthn¡1,isdeterminedbyth