Performance Eric McCreath Increasing Word Size A simple way of - PowerPoint PPT Presentation

Oct 20, 2023 •325 likes •429 views

Performance Eric McCreath Increasing Word Size A simple way of improving performance is to increase the data word size. This means that each instruction operates on a larger amount of data. This will involve more gates within the CPU. Also it

Performance Eric McCreath
Increasing Word Size A simple way of improving performance is to increase the data word size. This means that each instruction operates on a larger amount of data. This will involve more gates within the CPU. Also it means some overhead when you wish to operate on data which is smaller than the word size. 2
On Chip Caches CPUs have moved caches onto the CPU die which enables the CPU to be physically closer to the cache. This reduces latency. 3
Pipelining The execution of 1 instruction normally involves a number of stages. These stages are generally independent of each other and work on different parts of the CPU. e.g. while the CPU is executing one instruction it can be fetching the next. This is a little like a factory assembly line. So although it may take a number of clock cycles to execute one instruction one instruction can be started on every clock cycle. Execute Instruction Instruction Write Decode Fetch Back 4
Pipelining http://en.wikipedia.org/wiki/Instruction_pipeline 5
Superscale Superscale architectures involve duplicating functional units within the cpu and then starting more than one instruction on the same clock cycle in the pipeline. This enables a larger throughput of instructions. http://en.wikipedia.org/wiki/Superscalar 6
Outer of order execution Sometimes instructions will require data from memory before they can execute, this will stall the pipeline. This can slow the CPU down greatly. The "Outer of order execution" approach loads the next few instructions and starts executing the instruction that has the required data, this means instructions may be executed "out of order". Often there is dependencies between instructions, the CPU must be mindful of these. 7
Multi-Threading CPUs can maintain the programming context of multiple threads (so duplication of state and register information), without the duplication of processing units, caches, TLBs, etc, this enables multiple threads to be executed within the one core. This can hide latency. So while one thread is waiting on a result another can be executing. Switching between threads is very cheap as it is all done in hardware. From the programers perspective it just looks like you have a SMP ( Symmetric Multiprocessing) system. 8
Multi-core The CPU can be duplicated, so effectively you have a number of CPUs which share the same memory. Often they will also share an L2 Cache. 9

Recommend

Performance and Scalability (Chapter 11) Performance and Scalability Performance: How long

Performance and Scalability (Chapter 11) Performance and Scalability Performance: How long is the latency? Scalability: Do we get higher throughput if we add more resources? Performance and Scalability Performance: How long is the

210 views • 17 slides

March 2019 CONTENTS Page Combined Partner Performance 1 Breckland Performance Reports 2-6

Q4 2018-2019 Performance report March 2019 CONTENTS Page Combined Partner Performance 1 Breckland Performance Reports 2-6 East Cambridgeshire Performance Reports 7-11 Fenland Performance Reports 12-16 Forest Heath Performance Reports

612 views • 38 slides

Performance Bas Performance Bas Performance Bas Performance Bas ed ed ed ed Methodology for

Performance Bas Performance Bas Performance Bas Performance Bas ed ed ed ed Methodology for Tracing the Methodology for Tracing the Methodology for Tracing the Methodology for Tracing the Res Res Res Res pons pons pons pons e of

1.16k views • 62 slides

Verification Verification, Performance Performance Analysis Performance Performance Analysis

Verification Verification, Performance Performance Analysis Performance Performance Analysis Analysis and Analysis and Synthesis Synthesis of Embedd f E E b dd d S b dded Sys ystems t ems Kim G. Larsen Kim G. Larsen Aalborg

1.44k views • 116 slides

2019 Performance Audit Workforce Performance Management 3/19/2020 Why we are here FAC

2019 Performance Audit Workforce Performance Management 3/19/2020 Why we are here FAC requested a performance audit of the Agencys workforce performance management Agency growth requires a robust performance management program

414 views • 13 slides

What is a performance evaluation? Performance Management v. Performance Evaluation Evaluation

3/16/2015 Employee Performance Evaluations and Their Importance Presented by Summer Randall March 18, 2015 What is a performance evaluation? Performance Management v. Performance Evaluation Evaluation Management One time event Ongoing

175 views • 6 slides

PERFORMANCE MANAGEMENT Presentation Outline Performance Management definition and rationale.

PERFORMANCE MANAGEMENT Presentation Outline Performance Management definition and rationale. The Performance Management Cycle. Overview of the JSC Performance Management System (PMS) Goal Setting and Performance Contracting

406 views • 28 slides

Lecture: Metrics to Evaluate Performance Topics: Benchmark suites, Performance equation,

Lecture: Metrics to Evaluate Performance Topics: Benchmark suites, Performance equation, Summarizing performance with AM, GM, HM Video 1: Using AM as a performance summary Video 2: GM, Performance Equation Video 3: AM vs. HM vs. GM

341 views • 15 slides

Using AI to solve performance problems Salesforce Performance Engineering Jasmin Nakic | Jackie

Using AI to solve performance problems Salesforce Performance Engineering Jasmin Nakic | Jackie Chu June, 2018 Salesforce Performance Engineering Jasmin Nakic Jackie Chu Lead Performance Engineer Lead Performance Engineer Forward-Looking

807 views • 37 slides

Getting the Performance Out Of Getting the Performance Out Of High Performance Computing High

Getting the Performance Out Of Getting the Performance Out Of High Performance Computing High Performance Computing Jack Dongarra I nnovative Computing Lab University of Tennessee and Computer Science and Math Division Oak Ridge Nat ional

380 views • 13 slides

PERFORMANCE MANAGEMENT SYSTEMS CHAPTER III PERFORMANCE APPRAISAL PERFORMANCE MANAGEMENT SYSTEMS

PERFORMANCE MANAGEMENT SYSTEMS CHAPTER III PERFORMANCE APPRAISAL PERFORMANCE MANAGEMENT SYSTEMS CHAPTER III PERFORMANCE APPRAISAL SCOPE: Meaning & objectives, Uses & benefits, Process, Methods, errors of

956 views • 62 slides

PERFORMANCE APPRAISAL SYSTEMS CHAPTER VII REWARD FOR PERFORMANCE PERFORMANCE APPRAISAL SYSTEMS

PERFORMANCE APPRAISAL SYSTEMS CHAPTER VII REWARD FOR PERFORMANCE PERFORMANCE APPRAISAL SYSTEMS CHAPTER VII REWARD FOR PERFORMANCE Objectives: Understand 1. Different reward systems & their role in organization. 2. Importance of

510 views • 26 slides

PERFORMANCE MANAGEMENT SYSTEMS CHAPTER VI PAY FOR PERFORMANCE PERFORMANCE MANAGEMENT SYSTEMS

PERFORMANCE MANAGEMENT SYSTEMS CHAPTER VI PAY FOR PERFORMANCE PERFORMANCE MANAGEMENT SYSTEMS CHAPTER VI PAY FOR PERFORMANCE Objectives: 1. Explain different types of non-traditional pay systems. 2. Understand concept of gain sharing.

395 views • 26 slides

IN5060 Performance in distributed systems autumn course What is performance? Stage performance

IN5060 Performance in distributed systems autumn course What is performance? Stage performance Download performance by position World Opera Production Dec 2011 @ Troms HTTP Adaptive Streaming measured on Bygdy Ferry, 2011 Download

1.27k views • 92 slides

CPU Performance Lecture 8 CAP 3103 06-11-2014 1.6 Performance Defining Performance Which

CPU Performance Lecture 8 CAP 3103 06-11-2014 1.6 Performance Defining Performance Which airplane has the best performance? Boeing 777 Boeing 777 Boeing 747 Boeing 747 BAC/Sud BAC/Sud Concorde Concorde Douglas Douglas DC-

319 views • 19 slides

High Performance Systems EuroMPI 2015 Objectives Yet another performance analysis tool

Tutorial 1: Performance analysis for High Performance Systems EuroMPI 2015 Objectives Yet another performance analysis tool Developping performance analysis features for your application/library 2 EuroMPI 2015 Performance analysis for

632 views • 30 slides

CS 35101 Computer Architecture Spring 2008 Week 10: Chapter 5.1-5.3 Materials adapated from

CS 35101 Computer Architecture Spring 2008 Week 10: Chapter 5.1-5.3 Materials adapated from Mary Jane Irwin (www.cse.psu.edu/~mji) and Kevin Schaffer [ adapted from D. Patterson slides ] CS 35101 Ch 5.1 Steinfadt, SP08 KSU Heads Up

856 views • 32 slides

4. Performance Analysis of Parallel Programs 4.1 Performance Evaluation of Computer User

4. Performance Analysis of Parallel Programs 4.1 Performance Evaluation of Computer User criteria: - Small response times Computing center criteria: - High throughputs 4.1.1 Evaluation of CPU Performance 4.1.1 Evaluation of CPU Performance

478 views • 24 slides

DUNE DAQ Data format inside FPGA David Cussans 14 th June 2018 Introduction Format for

DUNE DAQ Data format inside FPGA David Cussans 14 th June 2018 Introduction Format for data inside FPGAs Part of definition of processing blocks. Want to run logic ~ 100MHz 400MHz Reuse gates multiple times in single

299 views • 8 slides

CS31001 COMPUTER ORGANIZATION AND ARCHITECTURE Debdeep Mukhopadhyay, CSE, IIT Kharagpur

CS31001 COMPUTER ORGANIZATION AND ARCHITECTURE Debdeep Mukhopadhyay, CSE, IIT Kharagpur Instruction Execution Steps: The Multi Cycle Circuit 1 The Micro Mips ISA The Instruction Format op rs rt rd sh fn 6 bits 5 bits 5 bits 5

314 views • 27 slides

1 Response Time Det tar 4 mnader att odla fram en tomat How long does it take for my job

Foto: Hughes Leglise-Bataille some rights reserved How do we define (speed) performance ? Response time (aka execution time) the time between the start and the Thus, to maximize completion of a task performance, need to Important to

66 views • 5 slides

Lecture 10: Processor design pipelining Overlapping the execution of instructions

Lecture 10: Processor design pipelining Overlapping the execution of instructions Pipeline hazards Different types How to remove them Inf2C Computer Systems - 2011-2012 1 Pipelining Classic case: make all instructions

337 views • 15 slides

PATMOS 2010 An On-Chip Flip-flop Characterization Circuit Andrea Veggetti (ST Agrate) Abhishek

PATMOS 2010 An On-Chip Flip-flop Characterization Circuit Andrea Veggetti (ST Agrate) Abhishek Jain (ST Noida) Dennis Crippa (ST Agrate) Pier Luigi Rolandi (ST Agrate) Agenda Motivation Flip-flop Characterization Parameters

353 views • 12 slides

CPSC 121: Models of Computation Module 9: Sequential Circuits Module 9: Sequential Circuits By

CPSC 121: Models of Computation Module 9: Sequential Circuits Module 9: Sequential Circuits By the start of class, you should be able to Trace the operation of a DFA (deterministic finite- state automaton) represented as a diagram on an input,

613 views • 33 slides