Streamline A Scheduling Heuristic for Streaming Ap

整理文档很辛苦,赏杯茶钱您下走!

免费阅读已结束,点击下载阅读编辑剩下 ...

阅读已结束,您可以下载文档离线阅读编辑

资源描述

Streamline:ASchedulingHeuristicforStreamingApplicationsontheGridBikashAgarwalla,NovaAhmed,DavidHilley,UmakishoreRamachandranCollegeofComputing,GeorgiaInstituteofTechnology801AtlanticDriveNW,Atlanta,GA30332-0280,USAEmail:fbikash,nova,davidhi,ramag@cc.gatech.eduAbstractStreamingapplicationssuchasvideo-basedsurveil-lance,habitatmonitoring,andemergencyresponsearegoodcandidatesforexecutingonhigh-performancecom-puting(HPC)resources,duetotheirhighcomputationandcommunicationneeds.Suchanapplicationcanberepre-sentedasacoarse-graindataflowgraph,eachnodecor-respondingtoastageofthepipelineoftransformationsthatmaybeappliedtothedataasitcontinuouslystreamsthrough.MappingsuchapplicationstoHPCresourceshastobesensitivetothecomputationandcommunicationneedsofeachstageofthepipelinetoensureQoScriteriasuchaslatencyandthroughput.Duetothedynamicna-tureofsuchapplications,theyareidealcandidatesforus-ingambientHPCresourcesmadeavailableviathegrid.Sincegridhasevolvedoutoftraditionalhigh-performancecomputing,thetoolsavailable,especiallyforscheduling,tendtobemoreappropriateforbatch-orientedapplica-tions.Wehavedevelopedascheduler,calledStreamline,thattakesintoaccountdynamicnatureofthegridandrunsperiodicallytoadaptschedulingdecisionsusingapplica-tionrequirements(per-stagecomputationandcommunica-tionneeds),applicationconstraints(suchasco-locationofstagesonthesamenode),andcurrentresourceavailability.Theschedulerisdesignedtobeintegratedwiththeexist-inggridframeworkusingGlobusToolkit.TheperformanceofStreamlineiscomparedwithanOptimalplacementforsmallnumberofresourcesandapproximationalgorithmsusingSimulatedAnnealingforlargeresourcesanddataflowgraphs.WehavealsocomparedStreamlinewithabaselinegridscheduler,E-Condor,builtontopofCondorforstream-ingapplications.Forkernelsofsuchstreamingapplica-tions,weshowthatourheuristicperformsclosetoOptimal,andcanbenearlyanorderofmagnitudebetterthantheE-Condorundernon-uniformloadconditions.WeconsideredtwoSimulatedAnnealingalgorithmswithdifferentexecu-tiontimes,andshowthatneighbor-selectionandannealingschedulehavearelativelylargerimpactontheperformanceofSimulatedAnnealingforcommunication-intensiveker-nelsthanforcomputationintensivekernels.WehavealsoconductedscalabilitystudiesandshowthatourschedulerismoreeffectivethanE-Condor,andperformsclosetoSim-ulatedAnnealingalgorithms,withsmallerschedulingtime,inallocatingresourcesforalargestreamingapplication.1IntroductionAdvancesinsensingcapabilities,andcomputingandcommunicationinfrastructuresarepavingthewayfornewanddemandingapplications.Video-basedsurveillance,emergencyresponse,disasterrecovery,habitatmonitor-ing,andtelepresenceareallexamplesofsuchapplications.Theseapplicationsintheirfullformarecapableofstress-ingtheavailablecomputingandcommunicationinfrastruc-turestotheirlimits.Streamingapplications,aswerefertosuchapplicationsinthispaper,havethefollowingcharac-teristics:(1)theyarecontinuousinnature,(2)theyrequireefcienttransportofdatafrom/todistributedsources/sinks,and(3)theyrequiretheefcientuseofhigh-performancecomputingresourcestocarryoutcompute-intensivetasksinatimelymanner.Thefocusofthisworkisinaddressingthethirdcom-ponentoftheabovecharacteristics,namely,theuseofhigh-performancecomputing(HPC)resourcestocarryoutcompute-intensivetasks.Considerforexample,avideo-basedsurveillanceapplication.Thecomputeintensivepartofsuchanapplicationmayconsistofanalyzingmultiplecamerafeedsfromaregiontoextracthigherlevelinfor-mationsuchas“motion”,“presenceorabsenceofahu-manface”,or“presenceorabsenceofanykindofsuspi-ciousactivity”.Suchanapplicationcanberepresentedasacoarse-graindataowgraph,whereinthenodesrepresentincreasingsophisticationofcomputationsthatmayneedtobeperformedonthedatastreamtofacilitatetheextractionofhigh-levelinformation.Atsomelevelsuchcoarse-graindataowgraphsresem-bletask-graphsthathavebeenthefocusofmultiprocessorschedulingworkfromthe70's[2,11,23,17,24,16,20,26].However,thereareseveralcrucialdifferences.Inmultipro-cessorscheduling,atask-graph(adirectedacyclicgraph)isusedasanorderingmechanismtoshowthedependen-ciesamongtheindividualtasksofaparallelcomputation.Thesedependenciesarerespectedandexploitedinarriv-ingatamappingheuristic(sincetheschedulingproblemisknowntobeNP-Complete)thatmaximizestheutiliza-tionofthecomputationalresourcesandreducesthecom-pletiontimeoftheapplication.Thecoarse-graindataowgraphofastreamingapplication,ontheotherhand,isarepresentationoftheprocessingthatiscarriedoutonthedataduringitspassagethroughthepipelineofstages.Inthesteadystate,allthestagesofthepipelineareworkingondifferentsnapshotsofthestreamdata.Forexampleinavideo-basedsurveillanceapplication,whenthen-thstageisworkingonthe(hithertotransformedresultsofthe)ithframeofvideo,therststageofthepipelineisworkingonthe(n+i)thframe.Sotheschedulingofsuchstreamingapplicationsisnotanorderingissue;rather,itisamatterofmappingthedifferentstagesofthepipelinetotheavailableresourcesrespectingthecomputationandcommunicationrequirementsofeachstagewithaviewtooptimizingthelatencyandthroughputmetricsoftheentirepipeline.Aninter

1 / 15
下载文档,编辑使用

©2015-2020 m.777doc.com 三七文档.

备案号:鲁ICP备2024069028号-1 客服联系 QQ:2149211541

×
保存成功