Persona A contextualized and personalized Web sear

整理文档很辛苦,赏杯茶钱您下走!

免费阅读已结束,点击下载阅读编辑剩下 ...

阅读已结束,您可以下载文档离线阅读编辑

资源描述

Persona:aontextualizedandpersonalizedwebsearhFranisoTanudjaja|fstanudmit.eduLikMui|lmuimit.eduLaboratoryofComputerSieneatMIT,Cambridge,MA02139June1,2001AbstratReentadvanesingraph-basedsearhtehniquesderivedfromKleinberg’swork[1℄havebeenimpressive.Thispaperfurtherimprovesthegraph-basedsearhalgo-rithmintwodimensions.Firstly,variantsofKleinberg’stehniquesdonottakeintoaountthesemantisofthequerystringnorofthenodesbeingsearhed.Asaresult,polysemyofquerywordsannotberesolved.ThispaperpresentsaninterativequeryshemeutilizingthesimplewebontologyprovidedbytheOpenDiretoryProjettoresolvemeaningsofauserquery.Seondly,weextendareentlyproposedpersonalizedversionoftheKleinbergalgorithm[3℄.Simulationresultsarepresentedtoillustratethesensitivityofourtehnique.WeoutlinetheimplementationofouralgorithminthePersonapersonalizedwebsearhsystem.11OverviewSearhenginesindexlargenumbersofdoumentsandletusersquerydesireddou-ments.However,mostsearhenginesarenottailoredtomeetindividualuserpref-erenes.[6℄notedthatalmosthalfofthedoumentsreturnedbysearhenginesaredeemedirrelevantbytheirusers.Thereareseveralaspetstotheproblem.Firstistheproblemofsynonymsandhomonyms.Synonymsaretwowordsthatarespeltdierentlybuthavethesamemeaning.Homonymsarewordsthatarespeltthesamebuthavedierentmeanings.Withoutpriorknowledge,thereisnowayforthesearhenginetopredituserinterestfromsimpletextbasedqueries.Seondly,searhenginesshouldbedeterministiinthatitshouldreturnthesamesetofdoumentstoalluserswiththesamequeryataertaintime.Thereforeitisinherentthatsearhenginesarenotdesignedtoadapttopersonalpreferenes.Currentinformationretrievalanddataminingresearhtriestoenhaneuser’swebexperienefromseveraldiretions.Onediretionistoreateabetterstruturalmodeloftheweb,suhthatitaninterfaemoreeÆientlywithsearhengines.Anotherapproahistomodeluserbehaviorastopreditusers’interestsbetter.Alongthelinesoftheformerareeortsatbetterdeningthemeaningofqueriesthemselves.TheWordnetprojetatPrinetonUniversityisanonlinelexialreferenesystemthatorganizesEnglishwordsintosynonymsets[7℄.Asimilarapproahistobuildataxonomyofwords.Ataxonomyomprisesofatreestrutureinwhihawordbelongstoaertainnode,eahwithparentsandhildren.Anode’sparentservesagen-eralategorythatenompassesallofitshildren.Anodemayhavehildrenthataresub-ategoriesofitself.AnexampleofsuhwordtaxonomiesaretheOpenDiretory2Projet[℄andtheMagellanhierarhy[℄.Yetanotherapproahistoreateasemantistrutureinmahinereadableformat.Asopposedtolassifyingontentfromaperson’spointofview,thismethodembedsmetadataforlassiation,allowingdoumentontenttobemahinereadable.Thereareurrentlyeortsatstandardizingtheselassiation,forexampleOIL(OntologyInterhangeLanguage)andDAML(DARPAAgentMarkupLanguage).Haystak[4℄isanongoingprojetinsemantimetadataindexing.Alongthelinesofthelatterapproah,variousresearhindataminingandknowl-edgerepresentationhavebuildmodelstoreorduserinterestandpredituserbehavior.Ultimately,theseusermodelsinterfaewithasystemsoastogiveitaprioriknowledgeregardinguserpreferenes.Clearly,workinuserprolingisloselyrelatedtobuildingbetterpersonalizedsystems.Dierentmethodsofgatheringuserdataisoftenoupledwithvariousper-sonalizationsystems.Wefoundthattheombinationsthatareavailableintheontextofpersonalizedsearhareunsatisfatory.Weproposeanovelapproahinbuildingabettersystemwiththefollowing.First,weextendexistingtheorywithregardstopersonalizedsearh.Seond,weproposetomodelusersinterestusinganinterativequeryshemeutilizingthewebontologyprovidedbytheOpenDiretoryProjet.Tosupportourargument,wehavebuiltanimplementationofapersonalizedsearhengine.Thesystemwrapsapersonalizationmoduleontoanexistingsearhengine,andrenessearhresultsusingtheproposedextensionofthegraphbasedalgorithm.Atitsore,theproposedsystemutilizesataxonomyofuserinterestanddisinterest.Weuseatreeoloringmethodtorepresentuserproles.Visitednodesare’olored’bythenumberoftimeitisvisited,whethertheuserrateitaspositiveornegative,and3URLsthatitassoiatesto.Inaddition,werunsetsofontrolledexperimentstoanalyzetheperformaneofeahoftheexistingvariants.Theexperimentalresultsverifyourpreditionsandonrmthattheproposedextensionperformsbetter.Weoeraroadmapofthisdoument.Setion2outlinesrelatedworkinpersonal-izedwebbrowsingandreviewsexistingmethodsusinggraphbasedsearhalgorithms.Setion3desribesourextensiontoexistingtheory.Setion4desribestheusermod-elingtehnique.Setion5outlinestheimplementationofPersona.Setion6desribesthesimulationresults.Weonludeinsetion7withsomediretionforfuturework.2RelatedWorks2.1ExamplesofpersonalizationappliationsPersonalizationappliationsoverarangeofspetrum.Atoneendofthespetrum,wehavelteringsystems,whihlterinputfromaninformationresoure.Informationofpossibleinterestaremarked.Anexampleofsuhalteringsystem,SmartPush[8℄ombinesseveralnovelideastogether.Thesystemndsinformationbymeansofsemantimetadatatolternewsartiles.Inaddition,itbuildstheuserproleusingasimplehierarhialoneptmodel.Forexample,undertheategorynews,therearetheategoriessports,literature,eonomis,et.Themodelreordsuserpreferenebygivingweightingstothe

1 / 34
下载文档,编辑使用

©2015-2020 m.777doc.com 三七文档.

备案号:鲁ICP备2024069028号-1 客服联系 QQ:2149211541

×
保存成功