Abstract Features for Image Retrieval An Experimen

jiangquan999
1 ℃
2020-04-19

整理文档很辛苦，赏杯茶钱您下走！

还剩 ... 页未读，继续阅读 >>

免费阅读已结束，点击下载阅读编辑剩下 ... 页

阅读已结束，您可以下载文档离线阅读编辑

资源描述

FeaturesforImageRetrieval:AnExperimentalComparisonThomasDeselaers1,DanielKeysers2,andHermannNey11HumanLanguageTechnologyandPatternRecognition,ComputerScienceDepartment,RWTHAachenUniversity,Germany{deselaers,ney}@cs.rwth-aachen.de2ImageUnderstandingandPatternRecognition,GermanResearchCenterforArtiﬁcialIntelligence(DFKI),Kaiserslautern,Germanydaniel.keysers@dfki.deNovember29,2007AbstractAnexperimentalcomparisonofalargenumberofdiﬀerentimagedescriptorsforcontent-basedimageretrievalispre-sented.Manyofthepapersdescribingnewtechniquesanddescriptorsforcontent-basedimageretrievaldescribetheirnewlyproposedmethodsasmostappropriatewithoutgivinganin-depthcomparisonwithallmethodsthatwereproposedearlier.Inthispaper,weﬁrstgiveanoverviewofalargevari-etyoffeaturesforcontent-basedimageretrievalandcomparethemquantitativelyonfourdiﬀerenttasks:stockphotore-trieval,personalphotocollectionretrieval,buildingretrieval,andmedicalimageretrieval.Fortheexperiments,ﬁvedif-ferent,publiclyavailableimagedatabasesareusedandtheretrievalperformanceofthefeaturesisanalysedindetail.Thisallowsforadirectcomparisonofallfeaturesconsid-eredinthisworkandfurthermorewillallowacomparisonofnewlyproposedfeaturestotheseinthefuture.Additionally,thecorrelationofthefeaturesisanalysed,whichopensthewayforasimpleandintuitivemethodtoﬁndaninitialsetofsuitablefeaturesforanewtask.Thearticleconcludeswithrecommendationswhichfeaturesperformwellforwhattypeofdata.Interestingly,theoftenused,butverysimple,colourhistogramperformswellinthecomparisonandthuscanberecommendedasasimplebaselineformanyapplications.1IntroductionImageretrievalingeneralandcontent-basedimageretrieval(CBIR)inparticulararewell-knownﬁeldsofresearchinin-formationmanagementinwhichalargenumberofmethodshavebeenproposedandinvestigatedbutinwhichstillnosatisfyinggeneralsolutionsexist.Theneedforadequateso-lutionsisgrowingduetotheincreasingamountofdigitallyproducedimagesinareaslikejournalism,medicine,andpri-vatelife,requiringnewwaysofaccessingimages.Forexam-ple,medicaldoctorshavetoaccesslargeamountsofimagesdaily[1],home-usersoftenhaveimagedatabasesofthousandsofimages[2],andjournalistsalsoneedtosearchforimagesbyvariouscriteria[3,4].Inthepast,severalCBIRsystemshavebeenproposedandallthesesystemshaveonethingincommon:imagesarerepresentedbynumericvalues,calledfeaturesordescriptors,thataremeanttorepresenttheprop-ertiesoftheimagestoallowmeaningfulretrievalfortheuser.OnlyrecentlyhavesomestandardbenchmarkdatabasesandevaluationcampaignsbeencreatedwhichallowforaquantitativecomparisonofCBIRsystems.Thesebench-marksallowforthecomparisonofimageretrievalsystemsunderdiﬀerentaspects:usabilityanduserinterfaces,combi-nationwithtextretrieval,oroverallperformanceofasystem.However,toourknowledge,noquantitativecomparisonofthebuildingblocksofthesystems,thefeaturesthatareusedtocompareimages,hasbeenpresentedsofar.In[5]amethodforcomparingimageretrievalsystemswasproposedrelyingontheCoreldatabase,whichhasrestrictedcopyrights,isnolongercommerciallyavailabletoday,andcanthereforenotbeusedforexperimentsthataremeanttobeabasisforothercomparisons.AnotheraspectofevaluatingCBIRsystemsarethere-quirementsoftheusers.In[3]and[4]studiesofuserneedsinsearchingimagearchivesarepresentedandtheoutcomeinbothstudiesisthatCBIRaloneisveryunlikelytoful-ﬁlltheneedsbutthatsemanticinformationobtainedfrommetadataandtextualinformationisanimportantadditionalknowledgesource.Althoughtodaythesemanticanalysisandunderstandingofimagesismuchfurtherdevelopedduetotherecentachievementsinobjectdetectionandrecognition,stillmostoftherequirementsspeciﬁedarenotsatisﬁablefullyautomatically.Therefore,inthispaperwecomparetheper-formanceofalargevarietyofvisualdescriptors.Thesecanthenlaterbecombinedwiththeoutcomeoftextualinforma-tionretrievalasdescribede.g.in[6].1Themainquestionweaddressinthispaperis:Whichfea-turesaresuitableforwhichtaskinimageretrieval?Thisquestionisthoroughlyinvestigatedbyexaminingtheperfor-manceofawidevarietyofdiﬀerentvisualdescriptorsforfourdiﬀerenttypesofCBIRtasks.Thequestionofwhichfeaturesperformhowwelliscloselyrelatedtothequestionwhichfeaturescanbecombinedtoobtaingoodresultsinaparticulartask.Althoughwedonotdirectlyaddressthisquestionhere,theresultsfromthispaperleadtoanewandintuitivemethodtochooseanap-propriatecombinationoffeaturesbasedonthecorrelationoftheindividualfeatures.Fortheevaluationofthefeaturesweuseﬁvediﬀerentpub-liclyavailabledatabaseswhichareagoodstartingpointtoevaluatetheperformanceofnewimagedescriptors.AlthoughtodayvariousinitiativesforevaluationofCBIRsystemshaveevolved,onlyfewofthemresultedinevaluationcampaignswithparticipantsandresults:Benchathlon1wasstartedin2001andlocatedattheSPIEElectronicImagingconferencebuthasbecomesmallerovertime.TRECVID2isaninitiativebytheTREC(TextRetrievalConference)onvideoretrievalinwhichvideoretrievalsystemsarecom-pared.ImageCLEF3ispartoftheCross-LanguageEvalua-tionFramework(CLEF)andstartedin2003withonlyonetaskaimingatacombinationofmulti-lingualinformationre-trievalwithCBIR.In2004,itcomprisedthreetasks,oneofthemfocusedonvisualqueri