片上多核处理器架构

整理文档很辛苦,赏杯茶钱您下走!

免费阅读已结束,点击下载阅读编辑剩下 ...

阅读已结束,您可以下载文档离线阅读编辑

资源描述

QiZhang,USTC片上多核处理器架构QiZhang(张琦)CS,USTCxiaoga@mail.ustc.edu.cnDec.2007QiZhang,USTC2提纲QiZhang,USTC3片上多核处理器IBM2001,双核RISC处理器Power42006,Cell处理器HP2004,PA-RISC8800双核处理器SUN2004,UltraSPARCIV双核处理器AMD2005,Opteron(酷龙,服务器和工作站)2005,Athlon64X2双核系列(速龙,台式机)2007,Barcelona四核(巴塞罗那)INTEL2006,Woodcrest(Xeon5100)2006,Clovertown四核QiZhang,USTC4发展趋势提高处理器性能提高主频更多核心主频的提高带来功耗的提高,传统的体系结构技术已面临瓶颈,纷纷转向多线程和多内核。QiZhang,USTC5片上多核处理器体系结构CMP(ChipMulti-Processor)将多个计算内核集成在一个处理器芯片中,从而提高计算能力同构多核Intel,AMD异构多核Cell(主处理核+协处理核)QiZhang,USTC6核间通信硬件结构必须支持核间通信CMP处理器各核心执行的程序之间需要进行数据共享和同步高效的通信机制是CMP处理器高性能的重要保障主流片上高效通信机制基于总线共享的cache结构基于片上的互连结构QiZhang,USTC7总线共享cache结构每个CPU内核拥有共享的二级或三级cache(lastlevelcache),用于保存比较常用的数据,并通过连接核心的总线进行通信。优点结构简单通信速度高缺点基于总线的结构可扩展性较差QiZhang,USTC8基于片上互连的结构每个CPU核心具有独立的处理单元和cache,各个核心通过交叉开关或片上网络等方式连接在一起,各个核心间通过消息通信。优点可扩展性好数据带宽有保证缺点硬件结构复杂软件改动较大QiZhang,USTC9如何有效地利用多核技术?现状客户端应用程序开发者多年来一直停留在单线程世界,生产“顺序软件”。多核时代到来后软件开发者必须找出新的开发软件的方法,选择程序执行模型。QiZhang,USTC10程序执行模型编译器设计人员与系统实现人员之间的接口编译器设计人员将一种高级语言程序按一种程序执行模型转换成一种目标机器语言程序系统实现人员该程序执行模型在具体目标机器上的有效实现程序执行模型的适用性决定多核处理器能否以最低的代价提供最高的性能QiZhang,USTC11IntelCore微架构QiZhang,USTC12Intel双核QiZhang,USTC13IntelConroeQiZhang,USTC14IntelCore微架构QiZhang,USTC15Intel四核QiZhang,USTC16Intel四核QiZhang,USTC17AMD双核QiZhang,USTC18AMD四核酷龙LargesharedL3cachesharesdatabetweencoresefficientlywhilehelpingreducelatencytomainmemoryDedicatedL1andL2cachepercorehelpsperformanceofvirtualizedenvironmentsandlargedatabasesbyreducingcachepollutionassociatedwithasharedL2cacheTheL1cacheofAMDOpteronprocessorscanhandledoublethenumberofloadspercycleasSecond-GenerationAMDOpteronprocessorstohelpkeepCPUcoresbusyQiZhang,USTC19Cell处理器架构QiZhang,USTC20Tile64QiZhang,USTC21TILE64™ProcessorBlockDiagramQiZhang,USTC22Tile64™ProcessorFamilyTheTILE64™familyofmulticoreprocessorsdeliversimmensecomputeperformancetodrivethelatestgenerationofembeddedapplications.Thisrevolutionaryprocessorfeatures64identicalprocessorcores(tiles)interconnectedwithTilera’siMesh™on-chipnetwork.Eachtileisacompletefull-featuredprocessor,includingintegratedL1&L2cacheandanon-blockingswitchthatconnectsthetileintothemesh.Thismeansthateachtilecanindependentlyrunafulloperatingsystem,ormultipletilestakentogethercanrunamulti-processingoperatingsystemlikeSMPLinux.TheTILE64™processorfamilyslashesboardrealestateandsystemcostbyintegratingacompletesetofmemoryandI/Ocontrollers,thuseliminatingtheneedforanexternalNorthBridgeorSouthBridge.Itdeliversscalableperformance,powerefficiencyandlowprocessinglatencyinanextremelycompactfootprint.WithastandardANSICprogrammingenvironment,developerscanleveragetheirexistingsoftwareinvestmentaswellasutilizethevastbodyofOpenSourcecodeavailable.Tilescanbegroupedintoclusterstoapplytheappropriateamountofhorsepowertoeachapplication.SincemultipleoperatingsysteminstancescanberunontheTILE64™simultaneously,itcanreplacemultipleCPUsubsystemsforboththedataplaneandcontrolplane.QiZhang,USTC23FeaturesofTile64•8X8gridofidentical,generalpurposeprocessorcores(tiles)•3-wayVLIWpipelineforinstructionlevelparallelism•5Mbytesofon-chipCache•192billionoperationspersecond(32-bit)•27Tbpsofon-chipmeshinterconnect•Upto50GbpsofI/ObandwidthQiZhang,USTC24参考文献“多核程序设计”,清华大学出版社谢谢!

1 / 25
下载文档,编辑使用

©2015-2020 m.777doc.com 三七文档.

备案号:鲁ICP备2024069028号-1 客服联系 QQ:2149211541

×
保存成功