PROBABILISTIC CONSTRAINT LOGIC PROGRAMMING

整理文档很辛苦,赏杯茶钱您下走!

免费阅读已结束,点击下载阅读编辑剩下 ...

阅读已结束,您可以下载文档离线阅读编辑

资源描述

BerichtNr.117,November1997ISSN0947-6954/97ArbeitspapieredesSonderforschungsbereichs340SprachtheoretischeGrundlagenfurdieComputerlinguistikStefanRiezler:ProbabilisticConstraintLogicProgrammingUniversitatStuttgartUniversitatTubingenDeutschlandInformationssystemeGmbHArbeitspapieredesSFB340BerichtNr.117,Oktober1997PROBABILISTICCONSTRAINTLOGICPROGRAMMINGSTEFANRIEZLERAbstract.Thispaperaddressestwocentralproblemsforprobabilisticprocess-ingmodels:parameterestimationfromincompletedataandecientretrievalofmostprobableanalyses.Thesequestionshavebeenansweredsatisfactorilyonlyforprobabilisticregularandcontext-freemodels.Weaddresstheseproblemsforamoreexpressiveprobabilisticconstraintlogicprogrammingmodel.Wepresentalog-linearprobabilitymodelforprobabilisticconstraintlogicprogramming.Ontopofthismodelwedeneanalgorithmtoestimatethepa-rametersandtoselectthepropertiesoflog-linearmodelsfromincompletedata.ThisalgorithmisanextensionoftheimprovediterativescalingalgorithmofDellaPietra,DellaPietra,andLaerty(1995).Ouralgorithmappliestolog-linearmodelsingeneralandisaccompaniedwithsuitableapproximationmeth-odswhenappliedtolargedataspaces.Furthermore,wepresentanapproachforsearchingformostprobableanalysesoftheprobabilisticconstraintlogicprogrammingmodel.Thismethodcanbeappliedtotheambiguityresolutionprobleminnaturallanguageprocessingapplications.1.IntroductionRabiner(1989)identiedthreebasicproblemsofinterestthatmustbesolvedforaHiddenMarkovModeltobeusefulinreal-worldspeechrecognitionapplications:theparameterestimationproblem,theoptimalstatesequenceproblemandtheobservationsequenceprobabilityproblem.Theseproblemsgeneralizetoarbitraryprobabilisticsymbolprocessingmodelsinvariousreal-worldapplicationsinanobviousmanner.Thersttwoproblemscanbestatedinamoregeneralwayasfollows.ThisworkwassupportedbytheGraduiertenkollegIntegriertesLinguistikstudiumattheUniversityofTubingenandbytheTeilprojektB7oftheSonderforschungsbereich340oftheDeutscheForschungsgemeinschaft.IamgreatlyindebtedtoStevenAbneyforcoachingthisworkinmanyhelpfuldiscussions.Furthermore,IwouldliketothankGrahamKatzandDetlefPrescherfortheirvaluablecomments.ISSN0947-6954/97c1997DerAutor12STEFANRIEZLER1.LetanunanalyzedobservationsequenceO=O1;:::Onandaprob-abilisticprocessingmodelwithparametersetbegiven,andsup-posethatthevalueofisunknownandOformsarandomsamplefromthedistributioninvolving,howcanthemodelparametersbeestimated?2.GivenOiand,howcanthemostprobableanalysisoftheinputOibefoundeciently?Recentinterestinprobabilisticmodelsofnaturallanguageprocessingcanbeattributedtothefactthatsolutionstotheabove-mentionedgeneralproblemscanleadquitedirectlytoeective,butconceptuallysimpleandmathematicallyclearsolutionstovariousproblemsinnaturallanguageprocessing.Thisconnectioncanbeillustratedwiththeproblemofambiguityreso-lution(ordisambiguationorparseranking)asfollows:Grammarsdescrib-inganontrivialfragmentofnaturallanguagemayattachalargenumberofdierentanalysestosentencesofreasonablelength.Sincenotalloftheseanalysesareinaccordwithhumanperceptions,thereisclearlyaneedtodistinguishmoreplausibleanalysesofaninputfromlessplau-sibleortotallyspuriousones.Thesimplebuteectiveideaadoptedinprobabilisticgrammarsistoconnecttheplausibilityofananalysiswithitsprobability.Inthisveinthecorrect,i.e.,mostplausibleanalysisofastringisassumedtobethemostprobableanalysisofthestring.Asolutiontoproblem1willadaptthemodelparameterstotheinputcorpusOandthusjustifytheassumptionthatthecorrectparseofastringOiisthemostprobableparseofOiasproducedbythegrammarparametrizedby.Asolutiontoproblem2willyieldanalgorithmtosearchforthemostprobableparseofagiveninputstringOiasproducedbyaprobabilisticgrammarwithparameterset.Mostpopularapproachestosolvingtheseproblemsintheareaofnat-urallanguageprocessingarebasedonBaum’smaximizationtechnique,whichisknownasthe\Baum-Welchalgorithm(BaumandEagon1967;Baum,Petrie,Soules,andWeiss1970;Baum1972).Thisalgorithmesti-matestheparametersofaHiddenMarkovModel,i.e.,astochasticregulargrammar,inaframeworkofmaximumlikelihoodestimationfromincom-pletedata.Thismeans,theparametersareiterativelyreestimateduntilconvergencetoasetofvalueswhichlocallymaximizethelikelihoodfunc-tion,i.e.,theprobabilitythatthemodelassignstothegivenunanalyzedobservationsequence.Inthissensethemodelparametersareadjustedtobestdescribeagivenobservationsequence.Theestimationalgorithmcanbedenedinductivelyforwardsandbackwards,yieldingtheecientPROBABILISTICCONSTRAINTLOGICPROGRAMMING3\forward-backwardalgorithm1.Baker(1979)generalizedthisalgorithmtotheso-called\inside-outsidealgorithm2,whichecientlyestimatestheparametersofastochasticcontext-freegrammars.BothalgorithmsarespecialinstancesoftheEM-algorithmformaximum-likelihoodes-timationfromincompletedata(Dempster,Laird,andRubin1977).Adynamic-programmingapproachsimilartotheoneusedintheecientversionsoftheparameterestimationalgorithmscanbeusedtondthemostprobableanalysisofstochasticcontext-freeandstochasticregulargrammarsandisknownasthe\Viterbi-algorithm(Viterbi1967).TheclassofalgorithmsbaseduponBaum’smaxi

1 / 36
下载文档,编辑使用

©2015-2020 m.777doc.com 三七文档.

备案号:鲁ICP备2024069028号-1 客服联系 QQ:2149211541

×
保存成功