you build it you run it
play

You build it, you run it Matthias Rampke, SoundCloud You build it, - PowerPoint PPT Presentation

You build it, you run it Matthias Rampke, SoundCloud You build it, you run it Operating SoundCloud's microservice architecture GOTO Berlin 2016 Intro: me Who I am and where I work Engineer in Production Engineering (platform, monitoring,


  1. You build it, you run it Matthias Rampke, SoundCloud

  2. You build it, you run it Operating SoundCloud's microservice architecture GOTO Berlin 2016

  3. Intro: me Who I am and where I work Engineer in Production Engineering (platform, monitoring, availability) previously in Systems Engineering (ops remnant catch-all)

  4. Intro: SoundCloud Who I am and where I work a cloud full of sounds 135M tracks, 12M artists, 175M listeners 300+ employees no ops team

  5. Intro: Agenda Where we came from Where we are today Why we did it How you can do it How does this compare to…?

  6. ⋁ Where we came from ⋁

  7. In the beginning … the early days One team One table One codebase

  8. 2009/2010 growing pains 20-50 engineers hired an ops team, 24/7 on-call app team deploys the monolith first separate "micro"services

  9. 2011/2012 the fork in the road more microservices deployment platform SRE/platforms team multiple on-call rotations

  10. 2013-2015 maturing cambrian explosion of microservices feature teams and collectives client specific APIs shared components & libraries continuous delivery

  11. ⋁ Where we are today ⋁

  12. Org chart simplified

  13. Ownership You buildown it, you run it every feature • service • codebase is owned by a team

  14. On Call owners are on call for what they own groups of teams work together to reduce load remove alerts • write documentation

  15. Shared Components avoid shared infrastructure be flexible don't duplicate work

  16. Production Engineering run the systems that run systems monitoring & availability internal consulting

  17. ⋁ Why we did it ⋁

  18. Delivery get more done, consistently autonomy predictability velocity

  19. Personal growth learn something new every day no pure specialists internal mobility

  20. Better systems simple resilient operable

  21. ⋁ How you can do it ⋁

  22. Prerequisites basic automation openness pride trust

  23. Expanding ownership testing & deployment on-call provisioning dependencies

  24. Checks & Balances internal moves escalation paths documentation tooling

  25. Postmortems learn improve commiserate

  26. ⋁ How does this compare to …? ⋁

  27. Site Reliability Engineering as Google describes it no assignment to SWE teams no on-call handoff no deploy blocks

  28. Radical agility as Zalando describe it more shared code more communication infrastructure & core teams

  29. DevOps as described by Etsy no Ops team less shared infrastructure less standardization deploys spread in a different dimension

  30. Slides: https://bit.ly/gotober16-sc Please rate! . soundcloud.com Berlin New York San Francisco London

Recommend


More recommend