Cache Coherency and Memory Consistency Why On-Chip Cache Coherence - PowerPoint PPT Presentation

Cache Coherency and Memory Consistency

Why On-Chip Cache Coherence is here to stay - Motivation: There is skepticism about the scalability of cache coherence: Some argue: ● Availability of other paradigms such as message passing and incoherent ○ scratchpad memories Some programs do not scale with coherency. ○

Contribution Addresses various concerns with in-depth analysis of each. ● Provides substantial reasons to support the continued use of coherency models. ● ● “... we find no compelling reason to abandon coherence” “performance generally superior to what is achievable with software-implemented coherence” ○ backward compatible ○ ●

Contribution Addresses various concerns with in-depth analysis of each. ● Provides substantial reasons to support the continued use of coherency models. ● ● “... we find no compelling reason to abandon coherence” “performance generally superior to what is achievable with software-implemented coherence” ○ backward compatible ○ Excellent arguments in favor of coherency - consistently refuting possible reasons ● why on-chip coherency cannot scale traffic ○ storage cost ○ maintaining inclusion ○ latency ○ energy ○

Merits Uses practical examples to support these arguments ● If multiple scenarios exist, the paper accounts for them. ● ● Convincing and thorough on the cases covered

Failings Lacks hardware implementations to support arguments ● Does not account for scalability of supporting hardware, though the argument is ● that scalability concerns will come into place from other issues first Does not account for multi-chip coherence ● Could have spent more time discussing the alternatives to “on-chip” coherence. ●

Questions Does the paper hold true today? 8 years later, do you still agree with the authors? ● Is there anything the authors have done in order to eliminate few of the failings? ●

Token Coherence: Decoupling Performance and Correctness - Motivation: Snooping requires total ordering and is not scalable due to bus bandwidth ● limitations. ● Directory based coherence adds indirection, increases latency due to added communication. Coherence is not scalable ●

Contribution TokenB - a new token coherence protocol ● Idea of separating protocol into two, one designed for performance and one ● designed to ensure correctness performance for the common case ○ guaranteed correctness for the worst case ○

Merits Describes novel, correct, and performant principles for improving cache ● coherence protocols ● Allows for use of an unordered interconnect to serve cache-to-cache misses

Failings “correctness substrate” has not been implemented in hardware ● Efficiency arguments not fully convincing ● ● Broadcast required for implementation Cost of torus interconnect not justified ●

Questions Are the additional hardware costs worth the benefits? If so, why isn’t this protocol ● widely implemented? ● Does the use of a modified broadcast network imply that this new protocol is about as unscalable as the ones that it was trying to replace?

Cache Coherency and Memory Consistency Why On-Chip Cache Coherence - PowerPoint PPT Presentation

Cache Coherency and Memory Consistency Why On-Chip Cache Coherence is here to stay - Motivation: There is skepticism about the scalability of cache coherence: Some argue: Availability of other paradigms such as message passing and

Advanced OpenMP Lecture 3: Cache Coherency Cache coherency Main difficulty in building

Web Cache Consistency Web Cache Consistency Web Cache Consistency Web Cache Consistency

4 Chip Multiprocessors (I) Chip Multiprocessors (ACS MPhil) Robert Mullins Overview

Overview Synchronization hardware primitives Cache Coherency Issues Coherence misses

Distributed Memory and Cache Consistency Distributed Memory and Cache Consistency (some slides

Distributed Memory and Cache Consistency Distributed Memory and Cache Consistency (some slides

Cache Coherency Cache coherent processors most current value for an address is the last

Consistency - Chapter 5 Introduce several notions of Local Consistency: arc consistency,

Constraint Programming - An overview Node-consistency Arc-consistency Path-consistency

What Is Memory Hierarchy A typical memory hierarchy today: Lecture 13: Cache Basics and Cache

Generations of Cache 1980: no cache in proc; 1989 first Intel proc with a cache on chip.

locks / cache coherency / spinlocks / other sync (intro) 1 Changelog 12 Feb 2020: add solution

Memory Hierarchy: Cache Memory hierarchy Cache basics Locality Cache organization Cache-aware

CS6354: Snooping Cache Coherency 7 October 2016 1 To read more This days papers:

mutexes / barriers / monitors 1 last time cache coherency multiple cores, each with own cache

General Cache Mechanics CPU Block: unit of data in cache and memory. (a.k.a. line) Memory

Q1 2018 Supplemental Earnings Slides May 3, 2018 1 Cautionary Note on Forward-Looking Statements

REACT: A Framework for Rapid Exploration of Approximate

ECO 317 Economics of Uncertainty Fall Term 2009 Slides to accompany 17. Job Market

Isaiah 40:9-31 By Jared Pratico 1. Behold the Warrior Shepherd (vv. 9-11) 1. Behold the Warrior

Bitonicity of Euclidean TSP in Narrow Strips EuroCG presentation Henk Alkema, Mark de Berg, and

PowerPoint Slides for August 25-2019 Overview : Timothy, be an example of a follower of Jesus

Processes and Instructions Noun Addiction, Double Negatives and Other Lard You should give

Wave optics in the Kerr BH Sousuke Noda (Nagoya Univ.) Wave optics in black hole spacetimes: