大数据管理:概念、技术与挑战作者:孟小峰,慈祥,MengXiaofeng,CiXiang作者单位:中国人民大学信息学院北京100872刊名:计算机研究与发展英文刊名:JournalofComputerResearchandDevelopment年,卷(期):2013,50(1)被引用次数:574次参考文献(167条)1.NatureBigData20122.BryantRE;KatzRH;LazowskaEDBig-Datacomputing:Creatingrevolutionarybreakthroughsincommerce,science,andsociety20123.ScienceSpecialonlinecollection:Dealingwithdata20124.AgrawalD;BernsteinP;BertinoEChallengesandopportunitieswithbigdata-AcommunitywhitepaperdevelopedbyleadingresearchersacrosstheUnitedStates20125.ManyikaJ;ChuiM;BrownBBigdata:Thenextfrontierforinnovation,competition,andproductivity20126.WorldEconomicForumBigdata,bigimpact:Newpossibilitiesforinternationaldevelopment20127.BigDataAcrosstheFederalGovernment20128.UNGlobalPulseBigDataforDevelopment:Challenges&Opportunities20129.TimesNYTheageofbigdata201210.GrobelnikMBigdatacomputing:Creatingrevolutionarybreakthroughsincommerce,science,andsociety201211.BarwickHThefourVsofBigData.ImplementingInformationInfrastructureSymposium201212.IBMWhatisbigdata201213.Bigdata201214.HeyT;TansleyS;TolleKTheFourthParadigm:DataintensiveScientificDiscovery200915.LazcrDComputationalsocialscience200916.WattsDJAtwenty-firstcenturyscience2007(7127)17.TheEconomistData,data,everywhereAspecialreportonmanaginginformation201218.KumarRTwocomputationalparadigmforbigdata201219.InformationWeekReportThebigdatamanagementchallenge201220.Storm201221.NeumeyerL;RobbinsB;NairAS4:DistributedStreamComputingPlatform201022.GoodhopeK;KoshyJ;KrepsJBuildingLinkedln'sRealtimeActivityDataPipeline2012(02)23.DeanJ;GhemawatSMapReduce:Simplifieddataprocessingonlargeclusters200424.DasSDataInfrastructureatLinkedIn201125.ScholarSpace201226.HaasLIntegratingExtremelyLargeDataisExtremelyChallenging27.RajaramanA;JeffUllmanMiningofMassiveDatasets201228.ChapmanA;AllenMD;BlausteinBIt'sAbouttheData:ProvenanceasaToolforAssessingDataFitness201229.Hadoop201230.GhemawatS;GobioffH;LeungSTTheGooglefilesystem200331.McKusickK;QuinlanSGFS:Evolutiononfast-forward2010(03)32.ChaikenR;JenkinsB;LarsonP-ASCOPE:Easyandefficientparallelprocessingofmassivedatasets2008(02)33.HDFSArchitectureGuide201234.CloudStore201235.BeaverD;KumarS;LiHCFindingaNeedleinHaystack:Facebook'sPhotoStorage201036.TFS201237.FastDFS201238.BrewerEATowardsrobustdistributedsystems(InvitedTalk)200039.ChangF;DeanJ;GhemawatSBigtable:Adistributedstoragesystemforstructureddata200640.DeCandiaG;HastorunD;JampaniMDynamo:Amazon'shighlyavailablekey-valuestore200741.CooperBF;RamakrishnanR;SrivastavaUPNUTS:Yahoo!'shosteddataservingplatform2008(02)42.NOSQLDatabases201243.StrauchCNoSQLDatabases201244.BakerJ;BondC;CorbettJMegastore:ProvidingScalable,HighlyAvailableStorageforInteractiveServices201145.CorbettJC;DeanJ;EpsteinMSpanner:Google'sglobally-distributeddatabase201246.ShuteJ;OanceaM;EllnerSF1:Thefault-tolerantdistributedRDBMSsupportinggoogle'sadbusiness201247.PengD;DabekFLarge-scaleincrementalprocessingusingdistributedtransactionsandnotifications201048.IyerSC;UttsMHelptestsomenext-generationinfrastructure201249.WangHaixunKDDsummerschool,2012.ManagingandMiningBillion-NodeGraphs201250.ITHbase201251.IHbase201252.ZouYongqiang;LiuJia;WangShicaiCCIndex:Acomplementalclusteringindexondistributedorderedtablesformulti-dimensionalrangequeries201053.AgrawalP;SilbersteinA;CooperBFAsynchronousviewmaintenanceforVLSDdatabases200954.WangJinbao;WuSai;GaoHongIndexingmultidimensionaldatainacloudsystem201055.DingLinlin;QiaoBaiyou;WangGuorenAnefficientquad-treebasedindexstructureforclouddatamanagement201156.ZhangXiangyu;AiJing;WangZhongyuanAnefficientmulti-dimensionalindexforclouddatamanagement200957.PapadopoulosA;KatsarosDA-Tree:Distributedindexingofmultidimensionaldataforcloudcomputingenvironments201158.NishimuraS;DasS;AgrawalDMDHBase:Ascalablemulti-dimensionaldatainfrastructureforlocationawareservices201159.MaYouzhong;RaoJia;HuWeisongAnefficientindexformassiveIOTdataincloudenvironment201260.MalewiczG;AusternMH;BikAJCPregel:Asystemforlarge-scalegraphprocessing201061.LeslieGValiant:Abridgingmodelforparallelcomputation1990(08)62.MelnikS;GubarevA;LongJingjingDremel:Interactiveanalysisofweb-scaledatasets2010(01)63.GoogleBigQuery201264.HallA;BachmannO;BüssowRProcessingatrillioncellspermouseclick2012(11)65.IsardM;BudiuM;YuYuanDryad:Distributeddata-parallelprogramsfromsequentialbuildingblocks200766.Cascading201267.GuY;GrossmanRLSectorandsphere:Thedesignandimplementationofahigh-performancedatacloud2009(1897)68.BattréD;EwenS;HueskeFNephele/PACTs:Aprogrammingmodelandexecutionframeworkforweb-scaleanalyticalprocessing201069.GundaPK;RavindranathL;ThekkathCANectar:Automaticmanagementofdataandcomputationindatacenters201070.PopaL;BudiuMDryadInc:Reusingworkinlargescalecomputations200971.BhatotiaP;WiederA;RodriguesRIncoop:MapReduceforincrementalcomputations201172.YanCairong;YangXin;YuZeIncMR:IncrementaldataprocessingbasedonMapReduce201273.OlstonC;ChiouG;ChitnisLNova:ContinuousPig/Hadoopworkflows201174.CondieT;ConwayN;AlvaroPMapReduceOnline201075.ShiYingjie;MengXiaofeng;WangFushengYoucanstopearlywithCOLA:Onlineprocessingofaggregatequeri