华远地产股份有限公司6007432011

整理文档很辛苦,赏杯茶钱您下走!

免费阅读已结束,点击下载阅读编辑剩下 ...

阅读已结束,您可以下载文档离线阅读编辑

资源描述

LearningwithBayesianNetworksDavidHeckermanPresentedbyColinRickert太原房产网BayesiannetworksrepresentanadvancedformofgeneralBayesianprobabilityABayesiannetworkisagraphicalmodelthatencodesprobabilisticrelationshipsamongvariablesofinterest1Themodelhasseveraladvantagesfordataanalysisoverrulebaseddecisiontrees1Outline1.Bayesianvs.classicalprobabilitymethods2.AdvantagesofBayesiantechniques3.ThecointosspredictionmodelfromaBayesianperspective4.ConstructingaBayesiannetworkwithpriorknowledge5.OptimizingaBayesiannetworkwithobservedknowledge(data)6.ExamquestionsBayesianvs.theClassicalApproachTheBayesianprobabilityofaneventx,representstheperson’sdegreeofbelieforconfidenceinthatevent’soccurrencebasedonpriorandobservedfacts.Classicalprobabilityreferstothetrueoractualprobabilityoftheeventandisnotconcernedwithobservedbehavior.Bayesianvs.theClassicalApproachBayesianapproachrestrictsitspredictiontothenext(N+1)occurrenceofaneventgiventheobservedprevious(N)events.Classicalapproachistopredictlikelihoodofanygiveneventregardlessofthenumberofoccurrences.ExampleImagineacoinwithirregularsurfacessuchthattheprobabilityoflandingheadsortailsisnotequal.Classicalapproachwouldbetoanalyzethesurfacestocreateaphysicalmodelofhowthecoinislikelytolandonanygiventhrow.Bayesianapproachsimplyrestrictsattentiontopredictingthenexttossbasedonprevioustosses.AdvantagesofBayesianTechniquesHowdoBayesiantechniquescomparetootherlearningmodels?1.Bayesiannetworkscanreadilyhandleincompletedatasets.2.Bayesiannetworksallowonetolearnaboutcausalrelationships3.Bayesiannetworksreadilyfacilitateuseofpriorknowledge4.Bayesianmethodsprovideanefficientmethodforpreventingtheoverfittingofdata(thereisnoneedforpre-processing).HandlingofIncompleteDataImagineadatasamplewheretwoattributevaluesarestronglyanti-correlatedWithdecisiontreesbothvaluesmustbepresenttoavoidconfusingthelearningmodelBayesiannetworksneedonlyoneofthevaluestobepresentandcaninfertheabsenceoftheother:Imaginetwovariables,oneforgun-ownerandtheotherforpeaceactivist.DatashouldindicatethatyoudonotneedtocheckbothvaluesLearningaboutCausalRelationshipsWecanuseobservedknowledgetodeterminethevalidityoftheacyclicgraphthatrepresentstheBayesiannetwork.Forinstanceisrunningacauseofkneedamage?Priorknowledgemayindicatethatthisisthecase.Observedknowledgemaystrengthenorweakenthisargument.UseofPriorKnowledgeandObservedBehaviorConstructionofpriorknowledgeisrelativelystraightforwardbyconstructing“causal”edgesbetweenanytwofactorsthatarebelievedtobecorrelated.CausalnetworksrepresentpriorknowledgewhereastheweightofthedirectededgescanbeupdatedinaposteriormannerbasedonnewdataAvoidanceofOverFittingDataContradictionsdonotneedtoberemovedfromthedata.Datacanbe“smoothed”suchthatallavailabledatacanbeusedThe“Irregular”CoinTossfromaBayesianPerspectiveStartwiththesetofprobabilities={1,…,n}forourhypothesis.Forcointosswehaveonlyonerepresentingourbeliefthatwewilltossa“heads”,1-fortails.Predicttheoutcomeofthenext(N+1)flipbasedonthepreviousNflips:for1,…,ND={X1=x1,…,Xn=xn}WanttoknowprobabilitythatXn+1=xn+1=headsrepresentsinformationwehaveobservedthusfar(i.e.={D}BayesianProbabilitiesPosteriorProbability,p(|D,):ProbabilityofaparticularvalueofgiventhatDhasbeenobserved(ourfinalvalueof).Inthiscase={D}.PriorProbability,p(|):PriorProbabilityofaparticularvalueofgivennoobserveddata(ourprevious“belief”)ObservedProbabilityor“Likelihood”,p(D|,):LikelihoodofsequenceofcointossesDbeingobservedgiventhatisaparticularvalue.Inthiscase={}.p(D|):RawprobabilityofDBayesianFormulasforWeightedCoinToss(IrregularCoin)where*Onlyneedtocalculatep(|D,)andp(|),therestcanbederivedIntegrationTofindtheprobabilitythatXn+1=heads,wemustintegrateoverallpossiblevaluesoftofindtheaveragevalueofwhichyields:ExpansionofTerms1.Expandobservedprobabilityp(|D,):2.Expandpriorprobabilityp(|):*“Beta”functionyieldsabellcurveuponintegrationwhichisatypicalprobabilitydistribution.Canbeviewedasourexpectationoftheshapeofthecurve.BetaFunctionandIntegrationIntegratinggivesthedesiredresult:CombineproductofbothfunctionstoyieldKeyPointsMultiplytheresultsofthebetafunction(priorprobability)withresultsofthecointossfunctionfor(observedprobability).Resultisourconfidenceforthisvalueof.Integratingtheproductofthetwowithrespecttooverallvaluesof01,isnecessarytoyieldtheaveragevaluethatbestfitstheobservedfacts+priorknowledge.BayesianNetworks1.Constructpriorknowledgefromgraphofcausalrelationshipsamongvariables.2.Updatetheweightsoftheedgestoreflectconfidenceofthatcausallinkbasedonobserveddata(i.e.posteriorknowledge).ExampleNetworkConsideracreditfraudnetworkdesignedtodeterminetheprobabilityofcreditfraudbasedoncertaineventsVariablesinclude:Fraud(f):whetherfraudoccurredornotGas(g):whethergaswaspurchasedwithin24hoursJewelry(J):whetherjewelrywaspurchasedinthelast24hoursAge(a):AgeofcardholderSex(s):SexofcardholderTaskofdeterminingwhichvariablestoincludeisnottrivial,involvesdecisionanalysis.ConstructGraphBase

1 / 28
下载文档,编辑使用

©2015-2020 m.777doc.com 三七文档.

备案号:鲁ICP备2024069028号-1 客服联系 QQ:2149211541

×
保存成功