《现代计算机体系结构》课程教学课件(英文讲稿)Lecture 05 Core Pipelining

高级计算机体系结构设计及其在数据中心和云计算的应用Lecture 5Core Pipelining
高级计算机体系结构设计及其在数据中心和云计算的应 用 Lecture 5 Core Pipelining

高级计算机体系结构设计及其在数据中心和云计算的应用Before there was pipelining..Single-cycleinsno.(fetch,decodeexec)insn1.(fetch,decode,execMulti-cycleinsno.fetchinsno.decinsn0.execinsn1.fetchinsn1.decinsn1.exectimeSingle-cycle control:hardwired Low CPI (1)- Long clock period (to accommodate slowest instruction)Multi-cycle control:micro-programmed-Shortclockperiod- High CPICan we have both low CPl and short clock period?
高级计算机体系结构设计及其在数据中心和云计算的应 用 Before there was pipelining. • Single-cycle control: hardwired Single-cycle Multi-cycle insn0.(fetch,decode,exec) insn1.(fetch,decode,exec) insn0.fetch insn0.dec insn0.exec insn1.fetch insn1.dec insn1.exec time – Low CPI (1) – Long clock period (to accommodate slowest instruction) • Multi-cycle control: micro-programmed – Short clock period – High CPI • Can we have both low CPI and short clock period?

高级计算机体系结构设计及其在数据中心和云计算的应用PipeliningMulti-cycleinsno.fetchinsno.decinsn0.execinsn1.fetchinsn1.execinsn1.decinsno.fetchinsno.decinsno.execPipelinedinsn1.execinsn1.fetchinsn1.dectimeinsn2.fetchinsn2.decinsn2.execStart with multi-cycle designWhen insnO goes from stage 1 to stage 2... insn1 starts stage 1Each instruction passes through all stages.. but instructions enter and leave at faster rateCan have asmany insns in flight as there are stages
高级计算机体系结构设计及其在数据中心和云计算的应 用 Pipelining • Start with multi-cycle design Multi-cycle insn0.fetch insn0.dec insn0.exec insn1.fetch insn1.dec insn1.exec time Pipelined insn0.fetch insn0.dec insn0.exec insn1.fetch insn1.dec insn1.exec insn2.fetch insn2.dec insn2.exec • When insn0 goes from stage 1 to stage 2 . insn1 starts stage 1 • Each instruction passes through all stages . but instructions enter and leave at faster rate Can have as many insns in flight as there are stages

高级计算机体系结构设计及其在数据中心和云计算的应用PipelineExamples11addresshit?Stage delay = nBandwidth = ~(1/n)Dhit?addressStage delay =n/2Bandwidth = ~(2/naddresshit?Stage delay = n/3一Bandwidth = ~(3/n)Increasesthroughput at the expense of latency
高级计算机体系结构设计及其在数据中心和云计算的应 用 Pipeline Examples address hit? = = = = address = hit? = Increases throughput at the expense of latency = = = address hit? = = = =

高级计算机体系结构设计及其在数据中心和云计算的应用Processor PipelineReviewDecodeFetchExecuteMemory(Write-back)RegALUI-cacheD-cacheFile
高级计算机体系结构设计及其在数据中心和云计算的应 用 Processor Pipeline Review +4 Fetch Decode Memory (Write-back) Execute I-cache Reg File PC +4 ALU D-cache

高级计算机体系结构设计及其在数据中心和云计算的应用Stage 1: Fetch. Fetch an instruction from memory every cycle- UsePCtoindexmemory-IncrementPC(assumenobranchesfornow)Write state to the pipeline register (IF/ID)- The next stage will read this pipeline register
高级计算机体系结构设计及其在数据中心和云计算的应 用 Stage 1: Fetch • Fetch an instruction from memory every cycle – Use PC to index memory – Increment PC (assume no branches for now) • Write state to the pipeline register (IF/ID) – The next stage will read this pipeline register

高级计算机体系结构设计及其在数据中心和云计算的应用Stage 1: Fetch Diagramtarget.MU十I+OdaPCosInstructionSCacheerIF / IDPipelineregister
高级计算机体系结构设计及其在数据中心和云计算的应 用 Stage 1: Fetch Diagram 1 + M U X PC + 1 Decode target Instruction bits IF / ID Pipeline register PC Instruction Cache en en Decode

高级计算机体系结构设计及其在数据中心和云计算的应用Stage 2: DecodeDecodesopcodebits- Set up Control signals forlater stagesReadinputoperandsfromregisterfile-Specified bydecodedinstructionbitsWrite state to the pipeline register (ID/EX)-Opcode- Register contents- PC+1 (even though decode didn't use it)- Control signals (from insn) for opcode and destReg
高级计算机体系结构设计及其在数据中心和云计算的应 用 Stage 2: Decode • Decodes opcode bits – Set up Control signals for later stages • Read input operands from register file – Specified by decoded instruction bits • Write state to the pipeline register (ID/EX) – Opcode – Register contents – PC+1 (even though decode didn’t use it) – Control signals (from insn) for opcode and destReg

高级计算机体系结构设计及其在数据中心和云计算的应用Stage 2: Decode DiagramtargetI+I+OdregAearegBeeiRegisterFiledestRegdataSaeesIF / IDID/EXPipelineregisterPipelineregister
高级计算机体系结构设计及其在数据中心和云计算的应 用 Stage 2: Decode Diagram regA contents Register File regA regB PC + 1 PC + 1 Fetch Execute destReg target ID / EX Pipeline register regB contents Register File en Instruction bits IF / ID Pipeline register Control signals Fetch Execute destReg data

高级计算机体系结构设计及其在数据中心和云计算的应用Stage 3: Execute.PerformALUoperations- Calculate result of instruction·Control signals select operation·Contentsof regAusedas oneinput: Either regB or constant offset (from insn) used as second input-CalculatePC-relativebranchtargetPC+1+(constantoffset) Write state to the pipeline register (EX/Mem)- ALU result, contents of regB, and PC+1+offset- Control signals (from insn) for opcode and destReg
高级计算机体系结构设计及其在数据中心和云计算的应 用 Stage 3: Execute • Perform ALU operations – Calculate result of instruction • Control signals select operation • Contents of regA used as one input • Either regB or constant offset (from insn) used as second input – Calculate PC-relative branch target • PC+1+(constant offset) • Write state to the pipeline register (EX/Mem) – ALU result, contents of regB, and PC+1+offset – Control signals (from insn) for opcode and destReg
按次数下载不扣除下载券;
注册用户24小时内重复下载只扣除一次;
顺序:VIP每日次数-->可用次数-->下载券;
- 《现代计算机体系结构》课程教学课件(英文讲稿)Lecture 04 Memory Data Prefetching.pdf
- 《现代计算机体系结构》课程教学课件(英文讲稿)Lecture 06 Scoreboarding and Tomasulo.pdf
- 《现代计算机体系结构》课程教学课件(英文讲稿)Lecture 08 Instruction Fetch and Branch Predictioin.pdf
- 《现代计算机体系结构》课程教学课件(英文讲稿)Lecture 07 Instruction Decode.pdf
- 《现代计算机体系结构》课程教学课件(英文讲稿)Lecture 09 Case Study- Jave Branch Prediction Optimization.pdf
- 《现代计算机体系结构》课程教学课件(英文讲稿)Lecture 13 An Introduction to Cloud Data Centers.pdf
- 《现代计算机体系结构》课程教学课件(英文讲稿)Lecture 10 Out of Order and Speculative Execution.pdf
- 《现代计算机体系结构》课程教学课件(英文讲稿)Lecture 11 Multi-core and Multi-threading.pdf
- 《现代计算机体系结构》课程教学课件(英文讲稿)Lecture 14 Towards Renewable Energy Powered Sustainable and Green Cloud Datacenters.pdf
- 《现代计算机体系结构》课程教学课件(英文讲稿)Lecture 15 GPGPU Architecture and Programming Paradigm.pdf
- 《现代计算机体系结构》课程教学课件(英文讲稿)Lecture 12 Shared Memory Multiprocessor.pdf
- 《现代计算机体系结构》课程教学课件(留学生版)Lecture 1 Instruction Set Architecture(Introduction).pdf
- 《现代计算机体系结构》课程教学课件(留学生版)Lecture 0 Introduction and Performance Evaluation.pdf
- 《现代计算机体系结构》课程教学课件(留学生版)Lecture 3 Pipelining.pdf
- 《现代计算机体系结构》课程教学课件(留学生版)Lecture 2 Instruction Set Architecture(Microarchitecture Implementation).pdf
- 《现代计算机体系结构》课程教学课件(留学生版)Lecture 7 Multiprocessors.pdf
- 《现代计算机体系结构》课程教学课件(留学生版)Lecture 4 Spectualtive Execution.pdf
- 《现代计算机体系结构》课程教学课件(留学生版)Lecture 6 Memory Hierarchy and Cache.pdf
- 《现代计算机体系结构》课程教学课件(留学生版)Lecture 5 Out of Order Execution.pdf
- 武汉理工大学:《模式识别》课程教学资源(PPT课件)第4章 基于统计决策的概率分类法.ppt
- 《现代计算机体系结构》课程教学课件(英文讲稿)Lecture 02 Memory Hierarchy and Caches.pdf
- 《现代计算机体系结构》课程教学课件(英文讲稿)Lecture 03 Main Memory and DRAM.pdf
- 《现代计算机体系结构》课程教学课件(英文讲稿)Lecture 01 Introduction and Performance Evaluation-new.pdf
- 东北大学:某学院计算机科学与技术专业《智能信息系统开发》课程教学大纲.pdf
- 东北大学:某学院计算机科学与技术专业《软件工程综合实践》课程教学大纲.pdf
- 东北大学:某学院计算机科学与技术专业《创新创业设计基础》课程教学大纲.pdf
- 东北大学:某学院计算机科学与技术专业《工程领导力》课程教学大纲.pdf
- 东北大学:某学院计算机科学与技术专业《高等数学建模》课程教学大纲(二).pdf
- 东北大学:某学院计算机科学与技术专业《数据库原理》课程教学大纲.pdf
- 东北大学:某学院计算机科学与技术专业《物理建模》课程教学大纲 A.pdf
- 东北大学:某学院计算机科学与技术专业《Java程序设计基础》课程教学大纲.pdf
- 东北大学:某学院计算机科学与技术专业《算法设计与分析》课程教学大纲.pdf
- 东北大学:某学院计算机科学与技术专业《Linux操作系统与内核分析》课程教学大纲.pdf
- 东北大学:某学院计算机科学与技术专业《计算机体系结构》课程教学大纲.pdf
- 东北大学:某学院计算机科学与技术专业《计算机网络组网技术》课程教学大纲.pdf
- 东北大学:某学院计算机科学与技术专业《离散数学》课程教学大纲.pdf
- 东北大学:某学院计算机科学与技术专业《Web开发与应用》课程教学大纲.pdf
- 东北大学:某学院计算机科学与技术专业《信息安全基础》课程教学大纲.pdf
- 东北大学:某学院计算机科学与技术专业《工程经济学》课程教学大纲.pdf
- 东北大学:某学院计算机科学与技术专业《软件体系结构》课程教学大纲.pdf