Performance Evaluation of a Multilevel Load Balanc

整理文档很辛苦,赏杯茶钱您下走!

免费阅读已结束,点击下载阅读编辑剩下 ...

阅读已结束,您可以下载文档离线阅读编辑

资源描述

FACULDADEDEINFORM´ATICAPUCRS-Brazilˆea,R.Chanin,A.Sales,R.Scheer,A.ZorzoTECHNICALREPORTSERIES—————————————————————————————————Number048July,2005Contact:mcorrea@inf.pucrs.brchanin@inf.pucrs.brasales@inf.pucrs.brroque.scheer@hp.comzorzo@inf.pucrs.brCopyrightcFaculdadedeInform´atica-PUCRSPublishedbyPPGCC-FACIN-PUCRSAv.Ipiranga,668190619-900PortoAlgre-RS-Brasil1IntroductionThedemandforcomputationalpowerhasbeenincreasingthroughoutthepastyears.Severalsolutionsarebeingusedinordertosupplysuchdemand,e.g.,clustersofworkstations[6]andsharedmemorymultiprocessors.Althoughclustershavelowercost,theiruseimpliesinagreatspecializedprogrammingeffort.ItisnecessarytobuildnewapplicationsorporttheexistingonestoexecuteintheseenvironmentsthroughtheuseofspecificAPI’s,suchasMPI(MessagePassingInterface)[23].Ontheotherhand,sharedmemorymultiprocessorcomputersaremoreexpensive,butsimplertouse,sinceallresourcesaremanagedbyasingleoperatingsystem.SharedmemorymultiprocessorcomputerscanbeclassifiedasUMA(UniformMemoryAccess)orNUMA(Non-UniformMemoryAccess)computers[16].InUMAcomputerseachprocessorcanaccessanymemoryareawiththesameaveragecost.Thispropertysimplifiestheloadbalancing:aprocesscanbemovedtoanyprocessorwithoutanyimpacttoitsmemoryaccesstime,iftheprocessisnotcache-hot.ThemajordrawbackofUMAarchitecturesisthatthenumberofprocessorsislimitedbythecontentiononaccesstomemory,whichbecamesabottleneck,sincetheyallsharethesamememorybus.NUMAarchitecturesallowagreaternumberofprocessorsbecauseprocessorsandmemoryaredistributedinnodes.Memoryaccesstimesdependontheprocessorthataprocessisexecutingandontheaccessedmemoryarea.Thus,theloadbalancingonthesemachinesismorecomplex,sincemovingaprocesstoanodethatisdistantfromitsmemoryareacanincreaseprocessexecutiontime.Loadbalancingforparallelenvironmentsisaproblemthathasbeendeeplystud-iedforalongtime.However,mostofthesestudiesarefocusedonuser-levelloadbalancing,whereusersofthesystemmustknowtheirapplicationsbehaviorandprovideinformationtotheloadbalancingalgorithm.Inthissense,therearemanyproposalsfordifferentplatforms,forexampleclusters[2,5,26]andcomputationalgrids[13,24].SomeauthorshavealsopresentedsolutionsorstudiesfortheloadbalancingproblemonNUMAcomputers.Zhu[25],forinstance,proposesaclusterqueuestructureforprocessesbasedonahierarchicalstructureinordertosolvethelimitationofsinglequeuesystemsandtheloadimbalanceproblemthatresultsofdistributedqueues.Focht[12],ontheotherhand,describesanalgorithmbasedontheLinuxloadbalancingalgorithmthattriestoattractprocessesbacktotheirorig-inalnodeswhentheyaremigrated.TherehasbeenalsosomestudiespresentinganalysisofloadbalancingalgorithmsonNUMAmachines[7].In[10]weproposedanalgorithmthatallowsLinuxtoperformmultilevelloadbalancinginNUMAcomputers.ThecurrentLinuxloadbalancingalgorithmusesastructurecalledscheddomaintobuildahierarchythatrepresentsthemachine’s3topology.Basedonthishierarchy,Linuxtriestokeepprocessesclosertotheirmemoryareas,movingthemamongprocessorsinthesamenodebeforeperform-inginter-nodemigration[4].ThealgorithmthatisresponsibleforconstructingthishierarchyassumesthatallNUMAmachineshaveonlytwomemoryaccesslevels.However,thereareNUMAcomputerswithmorethantwomemoryaccesslevels.Thehierarchybuiltforthesemachines,therefore,doesnotrepresenttheirtopologycorrectly,causingunappropriateloadbalancing.Tocopewiththisproblem,wepro-posedagenericalgorithmtobuildamultilevelscheddomainhierarchy,accordingtothenumberofmemoryaccesslevelsthattheNUMAcomputercontains.InthispaperweevaluatetheperformanceoftheLinuxloadbalancingalgorithmindifferentNUMAarchitectures,usingthe2-levelscheddomainhierarchybuiltbythecurrentLinuxversionandusingann-levelhierarchybuiltbyourproposedalgo-rithm.Toperformthisevaluationweusetwodifferentapproaches:simulationandanalyticalmodels.ThesimulationmodelisdevelopedusingtheJavaSimsimula-tiontool[17],andtheanalyticalmodelisdescribedusingtheStochasticAutomataNetworks(SAN)formalism[22].Thispaperisorganizedasfollows.Section2describesthecurrentLinuxloadbalancingalgorithmandourproposaltoallowLinuxtoperformmultilevelloadbalancing.Section3describestheimplementationofourproposal.Sections4and5presentthesimulationandanalyticalmodelsrespectively,whichwereusedtocompareLinuxloadbalancingperformanceusingthecurrentalgorithmandthepro-posedsolution.Section6showstheresultsofbothevaluationmodelsanddemon-stratethatmultilevelloadbalancingcanpresentabetterperformanceintermsofaverageprocessesexecutiontimethanthecurrentLinuxloadbalancingalgorithm.Finally,Section7assessesfutureworkandemphasizesthemaincontributionsofthispaper.2LoadBalancinginNUMAComputersInaNUMAcomputer,processorsandmainmemoryaredistributedinnodes.Eachprocessorcanaccesstheentirememoryaddressspace,butwithdifferentlatencytimes[16].Ingeneral,ifthesystemhasasmallnumberofprocessors,themachinehasonlytwomemoryaccesslevels.Forexample,Figure1showsthearchitectureofaHPIntegritySuperdomeserver[14]with4nodesand16processors.thismachinehastwodifferentmemorylatencies:whenaproc

1 / 20
下载文档,编辑使用

©2015-2020 m.777doc.com 三七文档.

备案号:鲁ICP备2024069028号-1 客服联系 QQ:2149211541

×
保存成功