April 6, 2016 ASPLOS 2016 Atlanta, Georgia. Technology scaling - PowerPoint PPT Presentation

Noam Shalev Technion Hagar Porat Idit Keidar Yaron Weinsberg Eran Harpaz Technion Technion Technion IBM Research April 6, 2016 ASPLOS 2016 Atlanta, Georgia.

 Technology scaling ◦ Many core is here ◦ Machines with a thousand cores are subject to research [ ] 2

 Technology scaling  Nano scale phenomena  Hardware reliability decreases [Radetzki et al., 2013]  Faults more likely 3

 Core failures can no longer be ruled out More Cores Less Reliability 4

 What happens today? 5

 A strategy for overcoming Core Surprise Removal (CSR) ◦ Objective – keep the system alive following a core fault ◦ Easily integrate into existing operating systems 6

 A strategy for overcoming Core Surprise Removal (CSR) ◦ Objective – keep the system alive following a core fault ◦ Easily integrate into existing operating systems  Implementation in the Linux kernel.  Use Hardware Transactional Memory to cope with failures in critical kernel code  Provide a proof of concept on a real system. 7

 Chip Multi-Processor System ◦ Reliable shared memory  Fault-prone cores  Reliable Failure Detection Unit (FDU) [Weis et al. ,2012] ◦ Halts execution of the faulty core ◦ Flush L1 upon failure detection ◦ Reports to OS. 8

 Fail-Stop Model ◦ Faulty core stops executing from some point onward ◦ Registers and buffers are unavailable ◦ L1 Cache data is flushed upon failure [Giorgi et al., 2014]. Core Core Core Core Core Core Core Core L1 L1 L1 L1 L1 L1 L1 L1 L2 Cache L2 Cache L2 Cache L2 Cache L3 Cache Reliable Shared Memory 9

 Flag as faulty ◦ Treat it as offline, and never Hot-plug it again  Reset interrupt affinities ◦ Handle lost interrupts, migrate IPI queue  Migrate tasklets, work-queues  Update kernel services OS dependent ◦ RCU subsystem, performance events, etc.  Terminate the running process ◦ Free its resources  Migrate processes. 10

 Flag as faulty What about  Reset interrupt affinities cascading failures?  Migrate tasklets, work-queues  Update kernel services  Terminate the running process  Migrate processes. 11

Mark Faulty Reset Interrupts Migrate Tasklets Migrate Workqueues Update Services Close Task Migrate Processes

Tasklet Queue Recovery Workqueue Mark Faulty Close Task Reset Interrupts Migrate Workqueues Migrate Tasklets Update Services Migrate Processes Queue Work

Recovery Ops Tasklet Queue Recovery Workqueue FDU Triggered Queue Tasklets Mark Faulty Close Task Verify Visibility Reset Interrupts Migrate Workqueues Ack Inform FDU Migrate Tasklets Update Services Migrate Processes Resume Queue Work 15

 Use tasklets and work-queues to execute the recovery process  In a cascading failure case: ◦ FDU chooses a new core ◦ The third tasklet migrates the remaining operations. Queue Tasklets Mark as faulty Close Task Verify Visibility Reset Interrupts Migrate Workqueues Inform FDU Migrate Tasklets Migrate Tasklets Update Kernel Services Execute Tasklets Queue Work Migrate Tasks 𝑫 𝑬 𝑫 𝑬 16

 Designed to integrate into commodity operating systems  No overhead when the system is correct ◦ Except for the FDU  Tolerates cascading failures  Scalable Recovery guarantees? 17

But … How? 18

 Modified QEMU ◦ Crashes a random core at random time ◦ Distinguish between idle , user and kernel mode  Run different workloads ◦ Postmark , Metis and SPEC CPU2006 benchmarks  Recovery validation ◦ By creating a file and flushing it to the disk using sync 19

 Idle mode success rate: 100%   User mode success rate: 100%   Meaning that the system is protected ALL the time, except for … .  Kernel mode  Well … It ’ s complicated. 20

 Fault during critical kernel section execution ◦ Deadlock ◦ Cannot kill kernel space ◦ Reclaim lock by keeping ownership?  No – inconsistent data. Core#0 Core#1 Core#2 Core#3 21

K-means x8 410.bwaves x4 K-means x16 401.bzip2 x4 4% Successful Recovery 6% 8% 6% 8% 4% 17% 5% Scheduler Locks 8% FS/MM Locks 70% 88% 86% 88% Other Locks Postmark x4 429.mcf x4 Workload Properties 10% 99% 1% 401.bzip2 15% 12% User 99% 1% 410.bwaves 10% System 1% 99% K-means 10% 5% IOWait 68% 70% 22% 14% 45% 19% 429.mcf Idle 5% 21% Postmark 22

K-means x8 410.bwaves x4 K-means x16 401.bzip2 x4 4% Successful Recovery 6% 8% 6% 8% 4% 17% 5% Scheduler Locks 8% FS/MM Locks 70% 88% 86% 88% Other Locks Postmark x4 429.mcf x4 Workload Properties System crashes always 10% 99% 1% 401.bzip2 15% 12% User 99% 1% 410.bwaves 10% System happen due to a held lock 1% 99% K-means 10% 5% IOWait 68% 70% 22% 14% 45% 19% 429.mcf Idle 5% 21% Postmark 23

 Solution : Use Hardware Transactional Memory to execute kernel critical sections ◦ TxLinux [Rossbach et al. SOSP 07 ’ ]  For reliability purposes ◦ Does not use locks  Prevent deadlocks ◦ Execute atomically  Prevent inconsistent data 24

 A strategy for overcoming Core Surprise Removal (CSR) ◦ Objective – keep the system alive following a core fault ◦ Easily integrate into existing operating systems  Implementation in the Linux kernel  Use Hardware Transactional Memory to cope with failures in critical kernel code  Provide a proof of concept on a real system. 25

 Replace scheduler locks with lock elision code  TSX is a best effort HTM ◦ Transactions are not guaranteed to commit  Retry ◦ Not all instructions can commit transactionally  Resort to regular locking ◦ Too large sections  Split Performa formance ce Workloa oad Commit Rate Energ rgy Saving Gain Idle 100% - 4% 16-threads 99.9% 0% 1% 32-threads 99.9% 3% 3% 64-threads 99.8% 4% 2% 26

But again … How? 27

 Crash simulation on a real system interrupts_disable(); //unresponsive If (fault_injection()==smp_processor_id()) while(TRUE); // ” stops ” executing ◦ Executed in kernel mode 28

Load is balanced Tasks migrated to core #0 Failure is detected Core #13 has no tasks • 64-core server, only 0-15 are presented. • 10 tasks are affined to each core. 29

Real Time: 8:00 Real Time: 7:58 Initial correct state cloud setting After a crash, original kernel 30

Real Time: 8:00 Real Time: 7:58 Initial correct state cloud setting After a crash, CSR on host 31

April 6, 2016 ASPLOS 2016 Atlanta, Georgia. Technology scaling - PowerPoint PPT Presentation

Noam Shalev Technion Hagar Porat Idit Keidar Yaron Weinsberg Eran Harpaz Technion Technion Technion IBM Research April 6, 2016 ASPLOS 2016 Atlanta, Georgia. Technology scaling Many core is here Machines with a thousand cores

District 8 Middle Georgia Heart of Georgia Altamaha Southern Georgia Southwest Georgia Middle

SPONSORSHIP OPPORTUNITIES THE ATLANTA JAZZ FESTIVAL IS PRODUCED BY THE CITY OF ATLANTA MAYORS

Project Managers Update 2015 Georgia Resource Center Georgia Resource Center - Atlanta, Georgia

Stability Issues for Georgia Why Georgia Matters for Europe? Not only Georgia is a European

ASPLOS 2014 Welcome! Goals Move the field forward Continue as a broad, multidisciplinary

ASPLOS 2014 Program Chairs Report Goals Move the field forward Continue as a broad,

2017 Atlanta 2017 FIT NATION TW FIT NATION 2017 Atlanta Event Schedule Walk/Runs

Successor Facility to the Georgia Dome between Geo. L. Smith II Georgia World Congress Center

GEORGIA MILESTONES GEORGIA MILESTONES GEORGIA MILESTONES What? Georgia Milestones

How to foster industry collaboration Georgia Institute of Technology Ernesto Escobar About me

Five-Party Memorandum of Agreement Overview presentation for: Atlanta Regional Commission

6TH ANNUAL CONFERENCE Atlanta, Georgia Atlanta Renaissance Concourse Hotel 1 Hartsfield Center

Aquarium Hilton Garden Inn Atlanta, Georgia John Dixon Construction Management

ATLANTA & WEST MIDTOWN MARKET OVERVIEW OVERVIEW OF TRENDS SHAPING THE ATLANTA MARKET 1 TABLE

Atlanta BeltLine Corridor Atlanta BeltLine Corridor Environmental Study Environmental Study

Atlanta College & Career Academy Atlanta Board Update College & Career December 2018

Reliability Engineering - Discussions and Clarifications Reliability Engineering VS.

1 SOUTH SOUTH LOCKW OCKWOOD OOD KENYA WILLIAMS CAPITAL PROJECTS PICK UP PICK UP A A BL

Human Exoskeleton II Team J Ahmad Alharbi ( Project Manager ) Alroumi Alenezi ( Client Contact )

Green computing in IEEE 802.3az enabled clusters Dimitar Pavlov Joris Soeurt SNE July 5, 2012

FCO Preventing Violent Extremism by Building Inclusive and Plural Societies: How Freedom of

ROOFTOP ADDITION 2017.09.01 1 Block map 536 FIRST STREET, BROOKLYN, NY, 11215 1 2 VIEW 1

NEEDS ASSESSMENT GROUP Chair: Thomas Crain Co-Chair: Marc Fowler 12/17/2015 AREAS OF FOCUS

What does it look like in the Colchester School District? National School Lunch Federal Poverty

April 6, 2016 ASPLOS 2016 Atlanta, Georgia. Technology scaling - PowerPoint PPT Presentation

Noam Shalev Technion Hagar Porat Idit Keidar Yaron Weinsberg Eran Harpaz Technion Technion Technion IBM Research April 6, 2016 ASPLOS 2016 Atlanta, Georgia. Technology scaling Many core is here Machines with a thousand cores

District 8 Middle Georgia Heart of Georgia Altamaha Southern Georgia Southwest Georgia Middle

SPONSORSHIP OPPORTUNITIES THE ATLANTA JAZZ FESTIVAL IS PRODUCED BY THE CITY OF ATLANTA MAYORS

Project Managers Update 2015 Georgia Resource Center Georgia Resource Center - Atlanta, Georgia

Stability Issues for Georgia Why Georgia Matters for Europe? Not only Georgia is a European

ASPLOS 2014 Welcome! Goals Move the field forward Continue as a broad, multidisciplinary

ASPLOS 2014 Program Chairs Report Goals Move the field forward Continue as a broad,

2017 Atlanta 2017 FIT NATION TW FIT NATION 2017 Atlanta Event Schedule Walk/Runs

Successor Facility to the Georgia Dome between Geo. L. Smith II Georgia World Congress Center

GEORGIA MILESTONES GEORGIA MILESTONES GEORGIA MILESTONES What? Georgia Milestones

How to foster industry collaboration Georgia Institute of Technology Ernesto Escobar About me

Five-Party Memorandum of Agreement Overview presentation for: Atlanta Regional Commission

6TH ANNUAL CONFERENCE Atlanta, Georgia Atlanta Renaissance Concourse Hotel 1 Hartsfield Center

Aquarium Hilton Garden Inn Atlanta, Georgia John Dixon Construction Management

ATLANTA &amp; WEST MIDTOWN MARKET OVERVIEW OVERVIEW OF TRENDS SHAPING THE ATLANTA MARKET 1 TABLE

Atlanta BeltLine Corridor Atlanta BeltLine Corridor Environmental Study Environmental Study

Atlanta College &amp; Career Academy Atlanta Board Update College &amp; Career December 2018

Reliability Engineering - Discussions and Clarifications Reliability Engineering VS.

1 SOUTH SOUTH LOCKW OCKWOOD OOD KENYA WILLIAMS CAPITAL PROJECTS PICK UP PICK UP A A BL

Human Exoskeleton II Team J Ahmad Alharbi ( Project Manager ) Alroumi Alenezi ( Client Contact )

Green computing in IEEE 802.3az enabled clusters Dimitar Pavlov Joris Soeurt SNE July 5, 2012

FCO Preventing Violent Extremism by Building Inclusive and Plural Societies: How Freedom of

ROOFTOP ADDITION 2017.09.01 1 Block map 536 FIRST STREET, BROOKLYN, NY, 11215 1 2 VIEW 1

NEEDS ASSESSMENT GROUP Chair: Thomas Crain Co-Chair: Marc Fowler 12/17/2015 AREAS OF FOCUS

What does it look like in the Colchester School District? National School Lunch Federal Poverty

ATLANTA & WEST MIDTOWN MARKET OVERVIEW OVERVIEW OF TRENDS SHAPING THE ATLANTA MARKET 1 TABLE

Atlanta College & Career Academy Atlanta Board Update College & Career December 2018