Operational Efficiency Hacks John Allspaw Operations Engineering, - PowerPoint PPT Presentation

Why? As infrastructure grows, try to keep the Humans:Machines ratio from getting out of hand Some of the How: Wednesday, April 8, 2009

Why? As infrastructure grows, try to keep the Humans:Machines ratio from getting out of hand Some of the How: - teach machines to build themselves Wednesday, April 8, 2009

Why? As infrastructure grows, try to keep the Humans:Machines ratio from getting out of hand Some of the How: - teach machines to build themselves - teach machines to watch themselves Wednesday, April 8, 2009

Why? As infrastructure grows, try to keep the Humans:Machines ratio from getting out of hand Some of the How: - teach machines to build themselves - teach machines to watch themselves - teach machines to fix themselves Wednesday, April 8, 2009

Why? As infrastructure grows, try to keep the Humans:Machines ratio from getting out of hand Some of the How: - teach machines to build themselves - teach machines to watch themselves - teach machines to fix themselves - reduce MTTR by streamlining Wednesday, April 8, 2009

Automated Infrastructure Wednesday, April 8, 2009

Automated Infrastructure - If there is only one thing you do, automatic configuration and deployment management should be it. Wednesday, April 8, 2009

Automated Infrastructure - If there is only one thing you do, automatic configuration and deployment management should be it. - See: - Opscode/Chef (http://opscode.com/) - Puppet (http://reductivelabs.com/products/puppet/) - System Imager/Configurator (http://wiki.systemimager.org) Wednesday, April 8, 2009

Conguration Management Codeswarm Wednesday, April 8, 2009

Time Machine time is cheaper than human time. If a failure results in some commands being run to ‘fix’ it, make the machines do it. (i.e., don’t wake people up for stupid things!) Wednesday, April 8, 2009

Aggregate Monitoring Wednesday, April 8, 2009

Aggregate Monitoring Don’t care about single nodes, only care about delta change of metrics/faults - Warn (email) on X % change - Page (wake up) on Y % change Wednesday, April 8, 2009

Aggregate Monitoring Don’t care about single nodes, only care about delta change of metrics/faults - Warn (email) on X % change - Page (wake up) on Y % change High and low water marks for some metrics Wednesday, April 8, 2009

Self-Healing Wednesday, April 8, 2009

Self-Healing Make service monitoring fix common failure scenarios, notify us later about it. Wednesday, April 8, 2009

Self-Healing Make service monitoring fix common failure scenarios, notify us later about it. Daemons/processes run on machines, will take corrective action under certain conditions, and report back with what they did. Wednesday, April 8, 2009

Self-Healing Make service monitoring fix common failure scenarios, notify us later about it. Daemons/processes run on machines, will take corrective action under certain conditions, and report back with what they did. Can greatly reduce your mean time to recovery (MTTR) Wednesday, April 8, 2009

Basic Apache Example Wednesday, April 8, 2009

Basic Apache Example 1. Webserver not running? Wednesday, April 8, 2009

Basic Apache Example 1. Webserver not running? 2. Under certain conditions, try to start it, and email that this happened. (I’ll read it tomorrow) Wednesday, April 8, 2009

Basic Apache Example 1. Webserver not running? 2. Under certain conditions, try to start it, and email that this happened. (I’ll read it tomorrow) 3. Won’t start? Assume something’s really wrong, so don’t keep trying (email that, too) Wednesday, April 8, 2009

MySQL Self-Healing Wednesday, April 8, 2009

MySQL Self-Healing Some MySQL Issues “fixed” by the machines Wednesday, April 8, 2009

MySQL Self-Healing Some MySQL Issues “fixed” by the machines - Kill long-running SELECT queries (marked safe to kill) Wednesday, April 8, 2009

MySQL Self-Healing Some MySQL Issues “fixed” by the machines - Kill long-running SELECT queries (marked safe to kill) - Queries not safe to kill are marked by the application as “ NO KILL ” in comments Wednesday, April 8, 2009

MySQL Self-Healing Some MySQL Issues “fixed” by the machines - Kill long-running SELECT queries (marked safe to kill) - Queries not safe to kill are marked by the application as “ NO KILL ” in comments - Run EXPLAIN on killed queries, and report the results Wednesday, April 8, 2009

MySQL Self-Healing Some MySQL Issues “fixed” by the machines - Kill long-running SELECT queries (marked safe to kill) - Queries not safe to kill are marked by the application as “ NO KILL ” in comments - Run EXPLAIN on killed queries, and report the results - Keep track of the query types and databases that need the most killing, produce a “DBs that Suck” report Wednesday, April 8, 2009

MySQL Self-Healing Wednesday, April 8, 2009

MySQL Self-Healing Some MySQL Replication issues “fixed” by the machines, by error Wednesday, April 8, 2009

Operational Efficiency Hacks John Allspaw Operations Engineering, - PowerPoint PPT Presentation

Operational Efficiency Hacks John Allspaw Operations Engineering, Flickr Wednesday, April 8, 2009 who am I? Manage the Flickr Operations group Wrote a geeky book: Wednesday, April 8, 2009 Efficiencies Wednesday, April 8, 2009

7 hacks. 7 time-saving hacks for course coordination associate professor bronwyn lea Hi there!

Workplace Wellbeing & Delivery Hacks Tuesday 20 September 2016 John Williams Melanie

Operational Challenges Operational Challenges ILO Crisis Response : Trainers Guide InFocus

Diversification, Efficiency, and Diversification, Efficiency, and Diversification, Efficiency,

El Paso Electric El Paso Electric Energy Efficiency Energy Efficiency Standard Offer Programs -

ECON 4100: Industrial Organization Lecture 2- Efficiency 1 Overview Efficiency and markets

LHCONE Operational Framework Part 1 : principles and ideas for the operational model Part 2 :

2014 Results outlook and February 2015 Safety Safety and operational efficiency go together

What is the maximum efficiency that What is the maximum efficiency that What is the maximum

Maximizing the Efficiency Potential Maximizing the Efficiency Potential in New Hampshire N

Deep Efficiency Acquisition: All Fuels Steps in the Process 4. Dynamics of Deep Efficiency

Efficiency Manitoba 2020/23 REVIEW OF EFFICIENCY PLAN Jim Grevatt January 14, 2020 Review of

India s Energy Efficiency India s Energy Efficiency Standards & Labeling Program

Efficiency and Growth T. Gutowski 2.83 and 2.813 1 Efficiency and Growth Can efficiency

Computational Logic Efficiency Issues in Prolog 1 Efficiency In general, efficiency

Improving Algorithmic Efficiency 15-112 Big Ideas Efficiency in Algorithms Now that we know

ARTS GR TS GRANTS PR TS PROGR OGRAM 2 M 2013 - 2 - 2014 Whats it all about? For

THE SUMMER SCHOOL Based on the experience of Nazarbayev Intellectual school of Chemistry and

SCARY STATISTICS Half of the 25-45 year-olds complain that they do not sleep enough with an

European Forum for Restorative Justice Based at the KU Leuven Institute of Criminology Founded in

Working with First Nations in Injury Prevention Child Passenger Safety Eugenia Oudie 1 3 Takuro

United Nations Security Council Resolution 1540: Sharing of Experiences, Lessons Learned, &

STUDENT ENGAGEMENT: FROM THE SYLLABUS TO THE FIRST DAY AND August 22, 2019 BEYOND You Can Do

Changes in precipitation and water Changes in precipitation and water in the Americas in the

Sambuz

Useful Links

Newsletter

Mail Us

Operational Efficiency Hacks John Allspaw Operations Engineering, - PowerPoint PPT Presentation

Operational Efficiency Hacks John Allspaw Operations Engineering, Flickr Wednesday, April 8, 2009 who am I? Manage the Flickr Operations group Wrote a geeky book: Wednesday, April 8, 2009 Efficiencies Wednesday, April 8, 2009

7 hacks. 7 time-saving hacks for course coordination associate professor bronwyn lea Hi there!

Workplace Wellbeing &amp; Delivery Hacks Tuesday 20 September 2016 John Williams Melanie

Operational Challenges Operational Challenges ILO Crisis Response : Trainers Guide InFocus

Diversification, Efficiency, and Diversification, Efficiency, and Diversification, Efficiency,

El Paso Electric El Paso Electric Energy Efficiency Energy Efficiency Standard Offer Programs -

ECON 4100: Industrial Organization Lecture 2- Efficiency 1 Overview Efficiency and markets

LHCONE Operational Framework Part 1 : principles and ideas for the operational model Part 2 :

2014 Results outlook and February 2015 Safety Safety and operational efficiency go together

What is the maximum efficiency that What is the maximum efficiency that What is the maximum

Maximizing the Efficiency Potential Maximizing the Efficiency Potential in New Hampshire N

Deep Efficiency Acquisition: All Fuels Steps in the Process 4. Dynamics of Deep Efficiency

Efficiency Manitoba 2020/23 REVIEW OF EFFICIENCY PLAN Jim Grevatt January 14, 2020 Review of

India s Energy Efficiency India s Energy Efficiency Standards &amp; Labeling Program

Efficiency and Growth T. Gutowski 2.83 and 2.813 1 Efficiency and Growth Can efficiency

Computational Logic Efficiency Issues in Prolog 1 Efficiency In general, efficiency

Improving Algorithmic Efficiency 15-112 Big Ideas Efficiency in Algorithms Now that we know

ARTS GR TS GRANTS PR TS PROGR OGRAM 2 M 2013 - 2 - 2014 Whats it all about? For

THE SUMMER SCHOOL Based on the experience of Nazarbayev Intellectual school of Chemistry and

SCARY STATISTICS Half of the 25-45 year-olds complain that they do not sleep enough with an

European Forum for Restorative Justice Based at the KU Leuven Institute of Criminology Founded in

Working with First Nations in Injury Prevention Child Passenger Safety Eugenia Oudie 1 3 Takuro

United Nations Security Council Resolution 1540: Sharing of Experiences, Lessons Learned, &amp;

STUDENT ENGAGEMENT: FROM THE SYLLABUS TO THE FIRST DAY AND August 22, 2019 BEYOND You Can Do

Changes in precipitation and water Changes in precipitation and water in the Americas in the

Sambuz

Useful Links

Newsletter

Mail Us

Workplace Wellbeing & Delivery Hacks Tuesday 20 September 2016 John Williams Melanie

India s Energy Efficiency India s Energy Efficiency Standards & Labeling Program

United Nations Security Council Resolution 1540: Sharing of Experiences, Lessons Learned, &