1 Introduction Computers Seeing People

整理文档很辛苦,赏杯茶钱您下走!

免费阅读已结束,点击下载阅读编辑剩下 ...

阅读已结束,您可以下载文档离线阅读编辑

资源描述

ComputersSeeingPeopleIrfanA.EssaCollegeofComputing,GVUCenterGeorgiaInstituteofTechnologyAtlanta,GA30332-0280irfan@cc.gatech.eduirfan1IntroductionOneofthemostexcitingandchallengingresearchquestsofthelastthirtyyearshasbeenthebuildingofmachinesthatcansee.Muchefforthasbeenexpendedon“automaticdeductionofstructureofapossi-blydynamicthree-dimensionalworldfromtwo-dimensionalimages.”[65].Therehasbeenconsiderableprogressintheareasofobjectrecognition,imageunderstandingandscenereconstructionfromsingleandmultipleimages.Thisprogresscoupledwiththeimprovementsincomputationalpowerhaspromptedanewresearchfocusofmakingmachinesthatcanseepeople,recognizethem,andinterprettheirgestures,expressions,andactions.Inthispaper,wepresentmethodsthatgivemachinestheabilitytoseepeople,in-terprettheiractionsandinteractwiththem.Wepresentthemotivatingfactorsbehindthiswork,examplesofhowsuchcomputationalmethodsaredevelopedandtheirapplications.Thebasicreasonforprovidingmachinestheabilitytoseepeoplereallydependsonthetaskweareassociatingwithamachine.Anindustrialvisionsystemaimedatextractingdefectsonanassemblylineneednotknowanythingaboutpeople.Similarly,acomputerusedforemailandtextwritingneednotseeandperceivetheusersgesturesandexpressions.However,ifourinterestistobuildintelligentmachinesthatcanworkwithus,supportourneedsandbeourhelpers,thanitmayberequiredforthesemachinestoknowmoreaboutwhotheyaresupportingandhelping.Ifourcomputersaretodomorethensupportourtext-basedneedslikewritingpapers,spreadsheets,andcommunicatingviaemail;perhapstakeonaroleofbeingapersonalassistant,thentheabilitytoseeapersonisessential.Suchanabilitytoperceivepeopleissomethingthatwetakeforgrantedinoureverydayinteractionswitheachother.Atpresentourmodelofamachineormorespecificallyofacomputerissomethingthatisplacedinthecorneroftheroom.Itisdeaf,dumb,andblind,havingnosenseoftheenvironmentthatitisinorofthepersonthatisnearit.Wecommunicatewiththiscomputerusingacodedsequenceoftappingsonakeyboard.Imagineacomputerthatknowsthatyouarenearit,thatyouarelookingatit,knowswhoyouareandwhatyouaretryingtodo.Imagineamachinethatcaninterpretavideosignalbasedonwhoisinthesceneandwhattheyaredoing.Suchabilitiesinacomputerarehardtoimagine,unlessithasanabilitytoperceivepeople.Speechperceptionhasmademuchprogressintherecentyearsandsomeamazingresultsinwordspottinghaverecentlybeenpresented[28,26,75].Thecomputervisionfieldhasalsotakentothisproblemhasmadesomeamazingprogressintherecentyears.Theimportantproblemsthatcomputervisionaimstoaddressinordertomakemachinesthatseepeopleisthatitneedstofirstdetermineifsomeoneisnearit(where)andhowmanypeopleareinitsfieldofview.1Thenextstepisidentifywhoitis.Afterithasidentifiedthepeople,itcaninterpretfacialexpression,handgesturesandbodylanguagetodeterminewhatthepeoplewantoraredoinginthesceneandwhy.Intheupcomingsections,wewilldetailoutthespecificsoftheapproachestodeterminewhere,howmany,who,what,andwhywithreferencetopeopleinascene.Needlesstostatethatdeterminationofeachoftheaboveissuesinasceneisnotpossiblewithoutentirelyonitsownandthateachonedependsontheotherasdictatedbycontext.Beforewegetintodetailsitisimportanttotalkbrieflyabouttheapplicationsofsuchatechnology.ApplicationsApplicationsofcomputervisionmethodsaimedspecificallyatseeingpeoplearemanyandencompassseveraldifferentareas.Effectivehuman-computerinteraction:Imaginecomputersthatcaninteractwithyouaswecaninter-actwitheachother,usingspeechandgestures.Suchcomputerswillbeawareofwhenyouarelookingatthem,willbeabletodetectwhereyouarepointingandinterpretyourgestures.Thesetypesofgesturalinterfacesareanintegralpartofagrowingtrendtowardsmorehuman-centeredinterfacesinHCIresearch.Specificapplicationsforthistechnologyariseinareaswheretraditionalinterfacessuchasthekeyboardandmousearenoteffective.Suchtechniqueswillallowustomovetowardsmorenon-invasiveandunencumberedinterfacesthatareatpresentprevalentinuserinter-facesallowingforInteractivevisualizationofmulti-dimensionalscientificdataanduser-centereddirectinteractionwithvirtualenvironments.SmartandInteractiveEnvironments:Machinethatcanseewillaidusindevelopingsmartrooms;roomsthatknowwhoiswhereandwhattheyaredoing.Suchroomscanbeusedtomonitorchildren,seniorcitizens,orphysicallychallengedindividualsandprovideassistanceandcareasneeded[68].SurveillanceandSecurity:Amoretraditionalapplicationofthisworkissurveillanceandsecurity.Facerecognitionhasbecomequiteausefultechnologyinthesecurityindustry,whereaccessisallowedbasedonfacialidentity.Similarly,alotofworkisbeingdonetoaidinsearchofcriminalsviaautomatedsearchesofmug-shotdatabases.Recently,therehasbeenworkaimedatrecognitionofpeoplesactionsforactivesurveillanceapplications.Entertainment,Education,andTraining:Twoareasofmostrapidgrowththesedaysareeducationandentertainment.ComputerVisionmethodsfornoninvasivetrackingandinterpretationofhumanac-tivitieswillaidtorevolutionizevariousaspectsoftheseareastoo.Anintelligenttutorthatcanseewillbefarmoreresponsivetotheneedsofitspupilifthetutorcanjudgebytheactionsandmoodsofthestudentwhethershe/heisconfused,frustrated,o

1 / 17
下载文档,编辑使用

©2015-2020 m.777doc.com 三七文档.

备案号:鲁ICP备2024069028号-1 客服联系 QQ:2149211541

×
保存成功