Pivotals best practices for achieving sustainable Cloud Operations - PowerPoint PPT Presentation

Pivotal’s best practices for achieving sustainable Cloud Operations Konstantin Semenov - Principal Software Engineer

About myself Started career in software engineering 22 years ago Involved in a diverse range of projects – from database management to 3D modelling Enjoy playing music in my spare time

Agenda Who are Cloud Ops? Pivotal Values Google SRE Extracted Practices Questions

Who are Cloud Ops The reverse of Dev Ops 50% time spent on development Provide feedback to the product teams Develop best practices for operators

Pivotal Core Values eXtreme Programming Test-driven development Small releases ➡ Small updates Pair programming ➡ Pair operations Continuous integration ➡ Continuous upgrade Collective ownership ➡ No superheroes, please Sustainable pace ➡ Sane working hours

HumanOps Humans are part of the system Humans impact systems Humans impact business Human issues count as system issues Escalate to humans as a last resort

HumanOps Human metrics System metrics

Cloud Ops EU Our first experiment – a distributed operations team ! Regularly sharing context within distributed teams is hard

Comic Relief A large-scale tele-marathon in the UK Collected over £82 million in donations in one night Deployed across AWS, vSphere and GCP

Google Partnership We were invited to the CRE trial run Well-aligned with Pivotal principles

Service Levels What does it mean to have 99.9%? What is the SLI/SLO/SLA relationship? How would you choose them?

Service Levels Level Outage per month What does it mean to have 99.9%? 99% 7 hours 99.9% 43.2 minutes 99.95% 21.6 minutes 99.99% 4.32 minutes

Service Levels Loss Issue MTTD MTTR MTBF Impact min/yr Containers run without dependent 0 3 min 90 days 10% 1 services It’s all about risk assessment VM exposed to Internet traffic 120 min 60 min 365 days 10% 18 Set clear expectations of performance Applications can cause collateral 60 min 30 min 90 days 0% 0 damage to log availability Traffic spike prevents mitigation 10 min 60 min 180 days 100% 142

Service Levels It’s all about risk assessment Set clear expectations of performance

Error budget Is usually defined within a 30-day rolling window Helps to prioritise innovation over stability It’s a budget - it is meant to be spent

Pivotal Tracker Web-based project management system Over 100 000 active users Runs on commercial distribution of Cloud Foundry Migrated from AWS to GCP with no downtime

Platform updates Security patches General support timeframe Scheduled nighttime/weekend maintenance windows More error-prone due to human factors No-one to ask for help when something fails

Deployment train Inspired by agile release engineering All pending updates are applied every morning

Train driver Controls what changes board the train, and whether the train is allowed to leave Holds the pager On duty for a week Writes deployment reports

Fire drills Keep the teams in check with certain tooling Can be performed with development teams Are essential for becoming a train driver

Dungeons & Dragons Help develop troubleshooting skills Gets the team more intimately familiar with various parts of the system Are fun!

Toil snake Backup Upgrades Tunnel Backup Upgrades Tunnel Backup Toil reduction prioritisation tool Tunnel Backup Tunnel Backup Clearly indicates the biggest pain

End of General Support Reviewed weekly Feedback to product teams

Bit Rot Indicates how long a component hasn’t been updated Surfaces update issues

Questions? "

References Cloud Foundry Foundation   https://www.cloudfoundry.org/ HumanOps - https://www.humanops.com/ Google SRE - https://landing.google.com/sre/

Thank you! #

Pivotals best practices for achieving sustainable Cloud Operations - PowerPoint PPT Presentation

Pivotals best practices for achieving sustainable Cloud Operations Konstantin Semenov - Principal Software Engineer About myself Started career in software engineering 22 years ago Involved in a diverse range of projects from database

Annual General Meeting May 27, 2015 2014 A PIVOTAL YEAR FOR ADVANTAGE 2 WHY PIVOTAL?

Transforming Nyrstar Port Pirie: reaching a pivotal milestone reaching a pivotal milestone

A pivotal year for Generalized Parton Distributions Pivotal year for GPDs J. Ball, G. Charles,

Spring Cloud, Spring Boot and Netflix OSS http://localhost:4000/decks/cloud-boot-netflix.html

Using Istio and Envoy for Edge Routing in Cloud Foundry Shubha Anjur Tupil Product Manager,

Financial Impacts of Achieving Aggressive Financial Impacts of Achieving Aggressive Financial

Five Steps to Optimization Five Steps to Optimization Beyond Best Practices Beyond Best

Building a Private Cloud Cloud Infrastructure Using Opensource Building a Private Cloud OSCON

KAFKA STREAMS CLOUD MONITORING AWS CLOUD MONITORING AWS APP CLOUD MONITORING AWS HTTP APP

41 1 Sustainable Performance US Dollar Best Trade Best Customer The BIZZ Qatar Corporate

Best Practices: Electronics Cooling Ruben Bons - CD-adapco Best Practices Outline Geometry

Welcome to data visualization best practices in R Nick Strayer Instructor DataCamp

1 Best Practices Conversational UX Design 2 Best Practices Conversational UX Design SET THE

Adapting JDT to the Cloud Alex Boyko Pivotal Jay Arthanareeswaran - IBM John Arthorne - IBM

Further Together: Curated Pairing Culture @Pivotal Neha Batra @nerdneha

Higher Education Trends August 11, 2016 Pivotal Juncture Louis Soares Vice President for

SC Links Integration Point 7 3 rd HiLumi LHC-LARP Daresbury (UK) Sylvain Weisz and

Antiproton production in p-He collisions, and more, at LHCb LHCb on a Space Mission He at 6.5

DO WE DO WE NEED NEED URBAN RBAN PARAMETE PA TERIZATI TION IN HIGH RESO SOLUTI TION

The MJO in Tropical Total Ozone The MJO in Tropical Total Ozone The MJO in Tropical Total Ozone

Social Choice: Single-Peaked Preferences Game Theory Course: Jackson, Leyton-Brown & Shoham

Social Choice Inspired Ordinal Measurement Alexis Tsoukis LAMSADE - CNRS, Universit

Movie Recommender Johannes Kastner, Nemanja Ranitovic, Markus Endres University of Augsburg,

CMU 15-251 Social Choice Teachers: Anil Ada Ariel Procaccia (this time) Social choice theory

Sambuz

Useful Links

Newsletter

Mail Us

Pivotals best practices for achieving sustainable Cloud Operations - PowerPoint PPT Presentation

Pivotals best practices for achieving sustainable Cloud Operations Konstantin Semenov - Principal Software Engineer About myself Started career in software engineering 22 years ago Involved in a diverse range of projects from database

Annual General Meeting May 27, 2015 2014 A PIVOTAL YEAR FOR ADVANTAGE 2 WHY PIVOTAL?

Transforming Nyrstar Port Pirie: reaching a pivotal milestone reaching a pivotal milestone

A pivotal year for Generalized Parton Distributions Pivotal year for GPDs J. Ball, G. Charles,

Spring Cloud, Spring Boot and Netflix OSS http://localhost:4000/decks/cloud-boot-netflix.html

Using Istio and Envoy for Edge Routing in Cloud Foundry Shubha Anjur Tupil Product Manager,

Financial Impacts of Achieving Aggressive Financial Impacts of Achieving Aggressive Financial

Five Steps to Optimization Five Steps to Optimization Beyond Best Practices Beyond Best

Building a Private Cloud Cloud Infrastructure Using Opensource Building a Private Cloud OSCON

KAFKA STREAMS CLOUD MONITORING AWS CLOUD MONITORING AWS APP CLOUD MONITORING AWS HTTP APP

41 1 Sustainable Performance US Dollar Best Trade Best Customer The BIZZ Qatar Corporate

Best Practices: Electronics Cooling Ruben Bons - CD-adapco Best Practices Outline Geometry

Welcome to data visualization best practices in R Nick Strayer Instructor DataCamp

1 Best Practices Conversational UX Design 2 Best Practices Conversational UX Design SET THE

Adapting JDT to the Cloud Alex Boyko Pivotal Jay Arthanareeswaran - IBM John Arthorne - IBM

Further Together: Curated Pairing Culture @Pivotal Neha Batra @nerdneha

Higher Education Trends August 11, 2016 Pivotal Juncture Louis Soares Vice President for

SC Links Integration Point 7 3 rd HiLumi LHC-LARP Daresbury (UK) Sylvain Weisz and

Antiproton production in p-He collisions, and more, at LHCb LHCb on a Space Mission He at 6.5

DO WE DO WE NEED NEED URBAN RBAN PARAMETE PA TERIZATI TION IN HIGH RESO SOLUTI TION

The MJO in Tropical Total Ozone The MJO in Tropical Total Ozone The MJO in Tropical Total Ozone

Social Choice: Single-Peaked Preferences Game Theory Course: Jackson, Leyton-Brown &amp; Shoham

Social Choice Inspired Ordinal Measurement Alexis Tsoukis LAMSADE - CNRS, Universit

Movie Recommender Johannes Kastner, Nemanja Ranitovic, Markus Endres University of Augsburg,

CMU 15-251 Social Choice Teachers: Anil Ada Ariel Procaccia (this time) Social choice theory

Sambuz

Useful Links

Newsletter

Mail Us

Social Choice: Single-Peaked Preferences Game Theory Course: Jackson, Leyton-Brown & Shoham