High Performance Storage System Scalability Archit

整理文档很辛苦,赏杯茶钱您下走!

免费阅读已结束,点击下载阅读编辑剩下 ...

阅读已结束,您可以下载文档离线阅读编辑

资源描述

HighPerformanceStorageSystemScalability:Architecture,ImplementationandExperienceRichardW.WatsonLawrenceLivermoreNationalLaboratorydwatson@llnl.govAbstractTheHighPerformanceStorageSystem(HPSS)providesscalablehierarchicalstoragemanagement(HSM),archive,andfilesystemservices.Itsdesign,implementationandcurrentdominantusearefocusedonHSMandarchiveservices.Itisalsoageneral-purpose,global,shared,parallelfilesystem,potentiallyusefulinotherapplicationdomains.WhenHPSSdesignandimplementationbeganoveradecadeago,scientificcomputingpowerandstoragecapabilitiesatasite,suchasaDOEnationallaboratory,wasmeasuredinafew10sofgigaops,dataarchivedinHSMsinafew10softerabytesatmost,datathroughputratestoanHSMinafewmegabytes/s,anddailythroughputwiththeHSMinafewgigabytes/day.Atthattime,theDOEnationallaboratoriesandIBMHPSSdesignteamrecognizedthatwewereheadedforadatastorageexplosiondrivenbycomputingpowerrisingtoteraops/petaopsrequiringdatastoredinHSMstorisetopetabytesandbeyond,datatransferrateswiththeHSMtorisetogigabytes/sandhigher,anddailythroughputwithaHSMin10softerabytes/day.ThispaperdiscussesHPSSarchitectural,implementationanddeploymentexperiencesthatcontributedtoitssuccessinmeetingtheaboveordersofmagnitudescalingtargets.Wealsodiscussareasthatneedadditionalattentionaswecontinuesignificantscalingintothefuture.1.IntroductionTheHighPerformanceStorageSystem(HPSS)providesscalablehierarchicalstoragemanagement(HSM),archive,andfilesystemservices.Itsdesign,implementationandcurrentdominantusearefocusedonHSMandarchiveservices.Itisalsoageneral-purpose,global,shared,parallelfilesystem,potentiallyusefulinotherapplicationdomains.WhenHPSSdesignandimplementationbeganoveradecadeago,scientificcomputingpowerandstoragecapabilitiesatasite,suchasaDOEnationallaboratory,wasmeasuredinafew10sofgigaops,dataarchivedinHSMsinafew10softerabytesatmost,datathroughputratestoanHSMinafewmegabytes/s,anddailythroughputwiththeHSMinafewgigabytes/day.Atthattime,theDOEnationallaboratory1andIBMHPSSdesignteamrecognizedthatwewereheadedforadatastorageexplosiondrivenbycomputingpowerrisingtoteraops/petaopsrequiringdatastoredinHSMstorisetopetabytesandbeyond,datatransferrateswiththeHSMtorisetogigabytes/sandhigher,anddailythroughputwithaHSMin10softerabytes/day.Therefore,wesetouttodesignanddeployasystemthatwouldscaleandevolvefromthebaseabovetowardtheseexpectedtargets.Thesetargetshavebeensuccessfullymet.Whiletherapidincreaseinbothcomputationalpowerandmemory,storagedevicecapacity,andnetworkingbandwidthhavemadetheseincreasesinstoragesystemcapacityandperformancepossible,withoutproperattentiontosoftwarearchitecture,implementationanddeployment,thishardwarepotentialcannotbefullyrealizedorexploited.Evenassumingnewfasterhardwareandaproperlydesignedandimplementedstoragesystem,successfulscaling,particularlyfordatatransfer,isnotjustamatterofplugginginthenewhardware,changingafewconfigurationsettingsandrunningthesystem.Itrequirescarefulattentiontoallphasesoftheend-to-endprocess.Therearemanydimensionsofscalabilitytowhichastoragesystemarchitectureandimplementationmustpayattention.Thispaperdiscussesthosedimensionsandillustratesthearchitecturalapproachandsomeoftheimplementationchoicesanddeploymentexperiencesthathavefacilitatedachievingscalabilityinthesedimensions.Italsodiscussessomeareaswherefurtherworkisrequiredasthesystemcontinuestoscaleacrossthesedimensionsinthefuture.1LawrenceLivermore(LLNL),LosAlamos(LANL),LawrenceBerkeley-NationalEnergyResearchSupercomputerCenter(NERSC),OakRidge(ORNL),andSandia(SNL)NationalLaboratories.Proceedingsofthe22ndIEEE/13thNASAGoddardConferenceonMassStorageSystemsandTechnologies(MSST2005)0-7695-2318-8/05$20.00©2005IEEEScalabledatathroughput:Thisdimensionfocusesonend-to-endI/Othroughput,forbothsinglefilesandfortheaggregatethroughputofmanysimultaneousfiletransfersorI/Ooperations.Scalablestoragecapacityandstoragespacemanagement:Thisdimensionincludesscalingstoragecapacity,numbersandtypesofstoragedevices,andfilesandfilesizes.Italsoincludesscalablespacemanagementformigrationandpurgeofdiskcache.Scalablerobustness:Thisdimensionincludestheabilityofthesystemto(1)tolerateorrecoverfromhardwarefailureswithoutlossofuserdataorsystemmetadataand(2)tomaintaintheconsistencyofbothuserdataandsystemmetadatainthefaceofconcurrentaccessesduringnormaloperation.Scalablenameservice:ThisdimensionforHPSSinvolvesascalablehierarchicaldirectoryservicewithvirtuallyunlimitednumbersofdirectoriesanddirectoryentries,andaglobalnamespacespanningmultipledistributedHPSSsystems.Italsoincludesscalingthenumberofsimultaneousdirectoryaccessesandaccessperformance.Scalablenumbersofclients:Thisdimensionincludesbothincreasingnumbersofendusersandinternalclientsandassociatedconcurrentoperations.Scalabledeploymentacrossgeographicaldistancesandmultiplecooperatinginstitutions:Thisdimensioninvolvesdistributionofdatastoragedevicesandmetadataforperformanceandrobustness,andintegrationofmultiplestoragesystemsintoaglobalnamespaceandsecureenvironment.Scalablestoragesystemmanagement:Thisdimensionenablessystemadministratorstomanageandconfig

1 / 15
下载文档,编辑使用

©2015-2020 m.777doc.com 三七文档.

备案号:鲁ICP备2024069028号-1 客服联系 QQ:2149211541

×
保存成功