Understanding the propagation of hard errors to software and - PowerPoint PPT Presentation

Apr 08, 2023 •305 likes •414 views

Faculty of Computer Science Institute of Systems Architecture, Operating Systems Group Understanding the propagation of hard errors to software and implications for resilient system design M. Li, P. Ramachandran, S. Sahoo, S. Adve, V. Adve, Y.

Faculty of Computer Science Institute of Systems Architecture, Operating Systems Group Understanding the propagation of hard errors to software and implications for resilient system design M. Li, P. Ramachandran, S. Sahoo, S. Adve, V. Adve, Y. Zhou presented by Bjoern Doebel
The old litany • Shrinking feature sizes increase – Susceptibility to radiation – Manufacturing errors – Wear-off – Heat-induced errors • Also: DVFS influences error rates • Need hardware/software measures – Spend as few (additional) resources as possible – Require understanding of how hardware errors manifest
Design Goals • Symptom-based vs. fault-based detection • Don't handle masked faults. • Optimize for the common case. • Keep things customizable • Leverage existing features instead of adding new ones.
Fault injection experiments • Target arch: SPARC v9, Solaris, SPEC benchmarks • Environment: Simics + GEMS – Run in parallel for 10,000,000 cycles – Simics-only afterwards • Inject hard (stuck, bridging) errors • Fault injection in: – Instruction decoder – ALU – Register bus – Physical register file – Reorder buffer – Register Alias Table – Address generation unit – FPU
Symptom-based fault detection • Fatal hardware traps • Abnormal application exit / OS crash • Application/OS hangs – Branch counting • Excessive OS activity – Observation: normal OS activity <10,000 cycles
Initial results
Fatal traps
Fault detection latency
But what about soft errors? • [Saggese2005]: “An experimental study of soft errors in microprocessors” – 53% of injected faults have no effect – 23% crash application – 13% silent data corruption – 12% incomplete execution
Discussion • SPARC vs x86 – Does the max. 10,000 cycles in kernel hold for Linux/x86? Is there an upper bound? – Fewer illegal instructions – No misaligned memory accesses • “I already have all those expensive checkpoint/rollback features in my system, so no need to build something new.”

Recommend

Basic Errors Compiling in Unix Syntax errors Common Errors, and Debugging Run-Time errors

Basic Errors Compiling in Unix Syntax errors Common Errors, and Debugging Run-Time errors CS-121 Logic errors Syntax Errors Syntax Errors An error in which a C++ grammar rule has been violated. Are flagged at compile time Missing ;

338 views • 5 slides

PLANT PROPAGATION An Overview of Plant Propagation Methods Two Techniques of Stem Cutting

PLANT PROPAGATION An Overview of Plant Propagation Methods Two Techniques of Stem Cutting Propagation Types of Propagation Sexual Propagation: Asexual Propagation: Seed Division & Separation Cuttings Grafting & Budding

624 views • 60 slides

Unified error reporting -- A worthy goal? Andi Kleen, Intel Corporation Sep 2009

Unified error reporting -- A worthy goal? Andi Kleen, Intel Corporation Sep 2009 andi@firstfloor.org errors standardized errors machine checks pci-express errors platform errors thermal errors APEI storage errors IO errors SMART events

518 views • 21 slides

Introduction Detecting Errors in Effects of Annotation Errors Detecting Errors in Corpus

Detecting Errors in Introduction Detecting Errors in Effects of Annotation Errors Detecting Errors in Corpus Annotation Corpus Annotation Corpus Annotation Detmar Meurers Detmar Meurers Detmar Meurers University of T ubingen

134 views • 12 slides

THE AMATEURS FRIEND OR Enemy A short course on Propagation Propagation What is it? What

PROPAGATION THE AMATEURS FRIEND OR Enemy A short course on Propagation Propagation What is it? What causes it? How does it effect HF Communications How do we understand the charts Were do we find the propagation information What is it?

978 views • 28 slides

1 How to deal with Radio Propagation How to deal with Radio Propagation Where are you from?

Lecture II Agenda Lecture II Agenda Radio Propagation Physical of radio propagation Two types of propagation models Wireless Multimedia System Outdoor vs. Indoor Radio Propagation Model How to do

538 views • 15 slides

Physical of radio propagation Two types of propagation models

Lecture II Agenda Lecture II Agenda Radio Propagation Physical of radio propagation Two types of propagation models Wireless Multimedia System Outdoor vs. Indoor Radio Propagation Model How to do

633 views • 14 slides

ELO TRANSLATION PROJECT SARAH **** SOME VOCAB Errors Logic Errors Runtime Errors

ELO TRANSLATION PROJECT SARAH **** SOME VOCAB Errors Logic Errors Runtime Errors Strings ex. This is a string Data Structure a particular way of organizing data for computers Dictionary type of data

454 views • 42 slides

Treasurers Institute Sun, Nov. 17, 2019 Property Tax Errors Property Tax Errors Property Tax

Treasurers Institute Sun, Nov. 17, 2019 Property Tax Errors Property Tax Errors Property Tax Errors Property Tax Errors Historically, DCCA presented the Tax Levy training. Last Dept of Revenue Property Tax Rate and Levy Manual Dated Dec

962 views • 57 slides

NMVTIS INFORMATION FOR TACA MARCH 2019 NMVTIS ERRORS Odometer Reading Discrepancies

NMVTIS INFORMATION FOR TACA MARCH 2019 NMVTIS ERRORS Odometer Reading Discrepancies FY 2017 10,077 errors FY2018 7,535 errors Title Discrepancies FY 2017 62,193 errors Based on FY 2018 59,249 errors

337 views • 6 slides

GENIE Systematic Errors GENIE Systematic Errors GENIE Systematic Errors Hugh Gallagher, Tufts

GENIE Systematic Errors GENIE Systematic Errors GENIE Systematic Errors Hugh Gallagher, Tufts University July 11, 2016 - NUTUNE 16, U. Liverpool OUTLINE 1) History/Context 2) Neutrino-Nucleon Interaction Modeling Free nucleon cross

905 views • 43 slides

Unforced Errors Unforced Errors My mother taught me that in polite society, we do not talk

Unforced Errors Unforced Errors My mother taught me that in polite society, we do not talk about: Unforced Errors My mother taught me that in polite society, we do not talk about: politics, Unforced Errors My mother taught me that in

1.01k views • 82 slides

Exceptions Introduction to Computing Using Python Types of errors We saw different types of

Introduction to Computing Using Python Exceptions Introduction to Computing Using Python Types of errors We saw different types of errors in this course There are basically two types of errors: syntax errors erroneous state errors

1.06k views • 85 slides

Geometric Sound Transmission Micah Taylor Overview Geometric propagation Very fast Can be

Geometric Sound Transmission Micah Taylor Overview Geometric propagation Very fast Can be used realtime Several propagation methods Reflection Diffraction Transmission Propagation Reflection Primary propagation method Causes echos

285 views • 11 slides

Lecture no: 2 Short on dB calculations Basics about antennas Propagation mechanisms

RADIO SYSTEMS ETI 051 Contents Lecture no: 2 Short on dB calculations Basics about antennas Propagation mechanisms Free space propagation Reflection and transmission Propagation Propagation over ground plane

456 views • 12 slides

Amateur Radio License Propagation and Antennas Todays Topics Propagation Antennas

Amateur Radio License Propagation and Antennas Todays Topics Propagation Antennas Propagation Modes Ground wave Low HF and below, ground acts as waveguide AM radio Line-of-Sight (LOS) VHF and above, radio waves only

871 views • 46 slides

CSCI 5832 Natural Language Processing Lecture 1 Jim Martin 1/18/08 1 Today 1/15 An

CSCI 5832 Natural Language Processing Lecture 1 Jim Martin 1/18/08 1 Today 1/15 An exercise Overview of the field of NLP Administrivia Course topics Commercial relevance 2 1/18/08 Whats this story about? 2 speech 1

383 views • 11 slides

Healthy New YOU! St. Pius X Church welcomes you, to renew all things in Christ Week 3

Healthy New YOU! St. Pius X Church welcomes you, to renew all things in Christ Week 3 Spiritual Exercise The Litany of Humility O Jesus! meek and humble of heart, Hear me. From the desire of being esteemed, Deliver me, Jesus. From

459 views • 34 slides

All Saints Day Welcome Home Gathering Song For All the Saints For all the saints Who from their

Empowered by the Eucharist Today we celebrate All Saints Day Welcome Home Gathering Song For All the Saints For all the saints Who from their labors rest Who Thee by faith Before the world confessed Thy name O Jesus Be forever blest

407 views • 20 slides

Deep Hough Voting for 3D Object Detection in Point Clouds Charles Qi ( ) GAMES

Deep Hough Voting for 3D Object Detection in Point Clouds Charles Qi ( ) GAMES Webinar December 5th, 2019 Joint work with Or Litany, Kaiming He, Leonidas Guibas. ICCV 2019. 3D object detection Estimate oriented 3D bounding boxes and

363 views • 32 slides

Literary Criticism Overview revised 08.22.12 || English 1302: Composition & Rhetoric II || D.

Literary Criticism Overview revised 08.22.12 || English 1302: Composition & Rhetoric II || D. Glen Smith, instructor Six Types of Analysis [See Portable Legacies , page 35-41 Forms of the Essay about Literature for more info.] 1.

451 views • 12 slides

Digital humanities: modeling semi-structured data from traditional scholarship Tom Lippincott

Digital humanities: modeling semi-structured data from traditional scholarship Tom Lippincott IntroHLT Fall 2019 Human Language Technology Center of Excellence Center for Language and Speech Processing 1 Outline Intro: A few thoughts on

1.17k views • 86 slides

The Power of an Agile Mindset in Determining Quality Linda Rising linda@lindarising.org

The Power of an Agile Mindset in Determining Quality Linda Rising linda@lindarising.org www.lindarising.org @RisingLinda Disclaimer: This provocative presentation is ideally the beginning of a conversation. It won't take long for me to

502 views • 36 slides

Paper Summaries Any takers? The Renderman Shading Language Announcement Logistics

Paper Summaries Any takers? The Renderman Shading Language Announcement Logistics SIGGRAPH animation screenings Checkpoint 3 Every Monday Due last Monday (still working on it) 12:30pm -- 2pm Checkpoint 4 Due Monday

508 views • 12 slides

Understanding the propagation of hard errors to software and - PowerPoint PPT Presentation

Faculty of Computer Science Institute of Systems Architecture, Operating Systems Group Understanding the propagation of hard errors to software and implications for resilient system design M. Li, P. Ramachandran, S. Sahoo, S. Adve, V. Adve, Y.

Basic Errors Compiling in Unix Syntax errors Common Errors, and Debugging Run-Time errors

PLANT PROPAGATION An Overview of Plant Propagation Methods Two Techniques of Stem Cutting

Unified error reporting -- A worthy goal? Andi Kleen, Intel Corporation Sep 2009

Introduction Detecting Errors in Effects of Annotation Errors Detecting Errors in Corpus

THE AMATEURS FRIEND OR Enemy A short course on Propagation Propagation What is it? What

1 How to deal with Radio Propagation How to deal with Radio Propagation Where are you from?

Physical of radio propagation Two types of propagation models

ELO TRANSLATION PROJECT SARAH **** SOME VOCAB Errors Logic Errors Runtime Errors

Treasurers Institute Sun, Nov. 17, 2019 Property Tax Errors Property Tax Errors Property Tax

NMVTIS INFORMATION FOR TACA MARCH 2019 NMVTIS ERRORS Odometer Reading Discrepancies

GENIE Systematic Errors GENIE Systematic Errors GENIE Systematic Errors Hugh Gallagher, Tufts

Unforced Errors Unforced Errors My mother taught me that in polite society, we do not talk

Exceptions Introduction to Computing Using Python Types of errors We saw different types of

Geometric Sound Transmission Micah Taylor Overview Geometric propagation Very fast Can be

Lecture no: 2 Short on dB calculations Basics about antennas Propagation mechanisms

Amateur Radio License Propagation and Antennas Todays Topics Propagation Antennas

CSCI 5832 Natural Language Processing Lecture 1 Jim Martin 1/18/08 1 Today 1/15 An

Healthy New YOU! St. Pius X Church welcomes you, to renew all things in Christ Week 3

All Saints Day Welcome Home Gathering Song For All the Saints For all the saints Who from their

Deep Hough Voting for 3D Object Detection in Point Clouds Charles Qi ( ) GAMES

Literary Criticism Overview revised 08.22.12 || English 1302: Composition & Rhetoric II || D.

Digital humanities: modeling semi-structured data from traditional scholarship Tom Lippincott

The Power of an Agile Mindset in Determining Quality Linda Rising linda@lindarising.org

Paper Summaries Any takers? The Renderman Shading Language Announcement Logistics

Sambuz

Useful Links

Newsletter

Mail Us

Understanding the propagation of hard errors to software and - PowerPoint PPT Presentation

Faculty of Computer Science Institute of Systems Architecture, Operating Systems Group Understanding the propagation of hard errors to software and implications for resilient system design M. Li, P. Ramachandran, S. Sahoo, S. Adve, V. Adve, Y.

Basic Errors Compiling in Unix Syntax errors Common Errors, and Debugging Run-Time errors

PLANT PROPAGATION An Overview of Plant Propagation Methods Two Techniques of Stem Cutting

Unified error reporting -- A worthy goal? Andi Kleen, Intel Corporation Sep 2009

Introduction Detecting Errors in Effects of Annotation Errors Detecting Errors in Corpus

THE AMATEURS FRIEND OR Enemy A short course on Propagation Propagation What is it? What

1 How to deal with Radio Propagation How to deal with Radio Propagation Where are you from?

Physical of radio propagation Two types of propagation models

ELO TRANSLATION PROJECT SARAH **** SOME VOCAB Errors Logic Errors Runtime Errors

Treasurers Institute Sun, Nov. 17, 2019 Property Tax Errors Property Tax Errors Property Tax

NMVTIS INFORMATION FOR TACA MARCH 2019 NMVTIS ERRORS Odometer Reading Discrepancies

GENIE Systematic Errors GENIE Systematic Errors GENIE Systematic Errors Hugh Gallagher, Tufts

Unforced Errors Unforced Errors My mother taught me that in polite society, we do not talk

Exceptions Introduction to Computing Using Python Types of errors We saw different types of

Geometric Sound Transmission Micah Taylor Overview Geometric propagation Very fast Can be

Lecture no: 2 Short on dB calculations Basics about antennas Propagation mechanisms

Amateur Radio License Propagation and Antennas Todays Topics Propagation Antennas

CSCI 5832 Natural Language Processing Lecture 1 Jim Martin 1/18/08 1 Today 1/15 An

Healthy New YOU! St. Pius X Church welcomes you, to renew all things in Christ Week 3

All Saints Day Welcome Home Gathering Song For All the Saints For all the saints Who from their

Deep Hough Voting for 3D Object Detection in Point Clouds Charles Qi ( ) GAMES

Literary Criticism Overview revised 08.22.12 || English 1302: Composition &amp; Rhetoric II || D.

Digital humanities: modeling semi-structured data from traditional scholarship Tom Lippincott

The Power of an Agile Mindset in Determining Quality Linda Rising linda@lindarising.org

Paper Summaries Any takers? The Renderman Shading Language Announcement Logistics

Sambuz

Useful Links

Newsletter

Mail Us

Literary Criticism Overview revised 08.22.12 || English 1302: Composition & Rhetoric II || D.