YES, I test in production. And so should you. By Charity Majors - PowerPoint PPT Presentation

YES, I test in production. And so should you. By Charity Majors @mipsytipsy

@mipsytipsy engineer/cofounder/CEO “the only good diff is a red diff” https://charity.wtf

Testing in production has gotten a bad rap. • Cautionary Tale • Punch Line • Serious Strategy

(I blame this guy)

how they think we are how we should be

Test(n): take measures to check the quality, performance, or reliability. Prod(n): where your users are.

"Testing in production" should not be used as an excuse to skimp on testing or spend less. I am here to tell you how to test *better*, not to help you half-ass it.

Our idea of what the software development lifecycle even looks like is overdue an upgrade in the era of distributed systems.

Deploying code is not a binary switch. Deploying code is a process of increasing your confidence in your code.

Development Production deploy

Development Production Observability

why now?

“Complexity is increasing” - Science

LAMP stack => distributed systems monitoring => observability known unknowns => unknown unknowns

Your system is never entirely ‘up’ Many catastrophic states exist at any given time.

why does this matter more and more? We are all distributed systems engineers now the unknowns outstrip the knowns and unknowns are untestable

Distributed systems are particularly hostile to being cloned or imitated (or monitored). (clients, concurrency, chaotic traffic patterns, edge cases …)

Distributed systems have an infinitely long list of almost- impossible failure scenarios that make staging environments particularly worthless. this is a black hole for engineering time

Only production is production. You can ONLY verify the deploy for any env by deploying to that env

1. Every deploy is a *unique* exercise of your process+   code+system 2. Deploy scripts are production code. If you’re using fabric or capistrano, this means you have fab/cap in production. 😴

Staging is not production.

Why do people sink so much time into staging, when they can’t even tell if their own production environment is healthy or not?

You can catch 80% of the bugs with 20% of the effort. And you should. That energy is better used elsewhere: Production. @caitie’s PWL talk: https://youtu.be/-3tw2MYYT0Q

You need to watch your code run with: Real data Real users Real traffic Real scale Real concurrency Real network Real deploys Real unpredictabilities.

Staging != Prod Environmental differences Security of user data Cost of duplication Time/Effort (diminishing returns) Uncertainty of user patterns

Development Production deploy

test before prod: does it work does my code run does it fail in the ways i can predict does it fail in the ways it has previously failed prod

test in prod: behavioral tests experiments load tests (!!) edge cases canaries weird bugs prod data stuff rolling deploys multi-region

More reasons: You are testing DR or chaos engineering Beta programs where customers can try new features Internal users get new things first You have to test with production data To lower the risk of deployments, you deploy more frequently You need higher concurrency, etc to retro a bug

test before prod: does it work does my code run does it fail in the ways i can predict does it fail in the ways it has previously failed prod Known unknowns

test in prod: behavioral tests experiments load tests (!!) edge cases canaries weird bugs prod data stuff rolling deploys multi-region Unknown unknowns (everything else)

test in staging? meh

Risks: Expose security vulnerabilities Data loss or contamination Cotenancy risks The app may die You might saturate a resource No rollback if you make a permanent error Chaos tends to cascade May cause a user to have a bad experience

also build or use: feature flags (launch darkly) high cardinality tooling (honeycomb) canary canary canaries, shadow systems (goturbine, linkerd) capture/replay for databases (apiary, percona) plz dont build your own ffs

Be less afraid: Feature flags Robust isolation Caps on dangerous behaviors Auto scaling or orchestration Query limits, auto throttling Limits and alarms Create test data with a clear naming convention Separate credentials Be extra wary of testing during peak load hours

Failure is not rare Practice shipping and fixing lots of small problems And practice on your users!!

Failure: it’s “when”, not “if” (lots and lots and lots of “when’s”)

Does everyone … know what normal looks like? know how to deploy? know how to roll back? know how to canary? know how to debug in production? Practice!!~

Charity Majors @mipsytipsy •

YES, I test in production. And so should you. By Charity Majors - PowerPoint PPT Presentation

YES, I test in production. And so should you. By Charity Majors @mipsytipsy @mipsytipsy engineer/cofounder/CEO the only good diff is a red diff https://charity.wtf Testing in production has gotten a bad rap. Cautionary Tale

Exam Review 2 1 ROB: head/tail yes R1 B yes none no X5 R3 A none no no --- --- F

YES & YES! YES & YES! David Grimwade Dept. of Medical & Molecular Genetics,

Yes We Can Yes We Can Yes We Can Yes We Can From biomedical informatics to translational

Model-Based Testing (ISTQB Chapter 4) Arie van Deursen 1 4.1 ISTQB Test Design Test Scripts

SURVEY AREA WWW-YES-2009-France Water Survey Results 3 June 2009 WWW-YES-2009-France water

Marshalltown Dual Language Program Evaluation Group 1: DLP: Yes and Ever an ELL: Yes

Interference in Judgment Aggregation Dorothea Baumeister, Gbor Erdlyi, Olivia Erdlyi, and

200511316 200511316 Test plan Test design specification g p

FLSA DUTIES TEST Exemption/Duties Test Types of Duties/Exemption Test Executive Exemption

Engineering Best Practices Test, test, test, and test some more; test as you go Start from a

Test automation Building automatically repeatable test suites Test automation n Test automation

Nehemiah Prays Nehemiah 1-2 Here is some test text Here is some test text Here is some test

drop hum run If a word Yes! skip has only one syllable Yes! ends with a single consonant

British Museum School? Yes No Family? Yes No Motivation? Spiritual/Emotional Intellectual

Write Through No Write Allocate Cache Write Reference Check tag and index Yes Tag AND

Public Key Algorithms hash: irreversible transformation(message) secret key: reversible

Third Quarter 2 0 1 8 Earnings Conference Call Cautionary Statem ents NYSE:HL Cautionary

Second Quarter 2 0 1 8 Earnings Conference Call Cautionary Statem ents NYSE:HL Cautionary

TRANSITION FROM HIGH SCHOOL TO POST SECONDARY RICHARD DOMINIC WIGGERS Pierre Elliott Trudeau High

Out-of-Class: Launching First-Year STEM Students on a Path to Success Dennis J. Minchella,

Minimal blowup data for potential Navier-Stokes singularities in the half-space Tuan Pham Oregon

Parkwood Elementary School Schematic Design Presentation October 23rd, 2017 Agenda Project

Half Year Results 26 Weeks Ended 29 October 2016 1 Agenda. Overview & Euan Sutherland,

Ha Half lf Yea ear r Ana naly lyst st Mee eetin ting g 20 2019 19 Som omchai hai Le

YES, I test in production. And so should you. By Charity Majors - PowerPoint PPT Presentation

YES, I test in production. And so should you. By Charity Majors @mipsytipsy @mipsytipsy engineer/cofounder/CEO the only good diff is a red diff https://charity.wtf Testing in production has gotten a bad rap. Cautionary Tale

Exam Review 2 1 ROB: head/tail yes R1 B yes none no X5 R3 A none no no --- --- F

YES &amp; YES! YES &amp; YES! David Grimwade Dept. of Medical &amp; Molecular Genetics,

Yes We Can Yes We Can Yes We Can Yes We Can From biomedical informatics to translational

Model-Based Testing (ISTQB Chapter 4) Arie van Deursen 1 4.1 ISTQB Test Design Test Scripts

SURVEY AREA WWW-YES-2009-France Water Survey Results 3 June 2009 WWW-YES-2009-France water

Marshalltown Dual Language Program Evaluation Group 1: DLP: Yes and Ever an ELL: Yes

Interference in Judgment Aggregation Dorothea Baumeister, Gbor Erdlyi, Olivia Erdlyi, and

200511316 200511316 Test plan Test design specification g p

FLSA DUTIES TEST Exemption/Duties Test Types of Duties/Exemption Test Executive Exemption

Engineering Best Practices Test, test, test, and test some more; test as you go Start from a

Test automation Building automatically repeatable test suites Test automation n Test automation

Nehemiah Prays Nehemiah 1-2 Here is some test text Here is some test text Here is some test

drop hum run If a word Yes! skip has only one syllable Yes! ends with a single consonant

British Museum School? Yes No Family? Yes No Motivation? Spiritual/Emotional Intellectual

Write Through No Write Allocate Cache Write Reference Check tag and index Yes Tag AND

Public Key Algorithms hash: irreversible transformation(message) secret key: reversible

Third Quarter 2 0 1 8 Earnings Conference Call Cautionary Statem ents NYSE:HL Cautionary

Second Quarter 2 0 1 8 Earnings Conference Call Cautionary Statem ents NYSE:HL Cautionary

TRANSITION FROM HIGH SCHOOL TO POST SECONDARY RICHARD DOMINIC WIGGERS Pierre Elliott Trudeau High

Out-of-Class: Launching First-Year STEM Students on a Path to Success Dennis J. Minchella,

Minimal blowup data for potential Navier-Stokes singularities in the half-space Tuan Pham Oregon

Parkwood Elementary School Schematic Design Presentation October 23rd, 2017 Agenda Project

Half Year Results 26 Weeks Ended 29 October 2016 1 Agenda. Overview &amp; Euan Sutherland,

Ha Half lf Yea ear r Ana naly lyst st Mee eetin ting g 20 2019 19 Som omchai hai Le

YES & YES! YES & YES! David Grimwade Dept. of Medical & Molecular Genetics,

Half Year Results 26 Weeks Ended 29 October 2016 1 Agenda. Overview & Euan Sutherland,