Docker Orchestration: Beyond the Basics Aaron Lehmann Software - PowerPoint PPT Presentation

Docker Orchestration: Beyond the Basics Aaron Lehmann Software Engineer, Docker

About me • Software engineer at Docker • Maintainer on SwarmKit and Docker Engine open source projects • Focusing on distributed state, task scheduling, and rolling updates 2

Swarm mode

Swarm mode is Docker’s built in orchestration • Docker can orchestrate containers over multiple machines without extra software • Example: running a instances of a web service on several machines 4

Getting started with swarm mode • Initialize a new swarm: mgr-1$ docker swarm init • Join an existing swarm: worker-1$ docker swarm join --token <token>   192.168.65.2:2377 5

Swarm mode: Services • Swarm mode deals with services , not individual containers • Each service creates one or more replica tasks , which are run as containers • On manager, create a new service for a search microservice application: mgr-1$ docker service create -p 8080:8080 --name search \ --replicas 4 searchsvc:v1.0 mgr-1$ docker service ls ID NAME REPLICAS IMAGE COMMAND 2xtw9qipmbe9 search 4/4 searchsvc:v1.0 6

Swarm mode: Nodes • Worker nodes just run service tasks • Manager nodes manage the swarm mgr-1$ docker node ls ID HOSTNAME STATUS AVAILABILITY MANAGER STATUS drwxwi4h2fb0tcrwgmpmma2x0 * mgr-1 Ready Active Leader 1mhtdwhvsgr3c26xxbnzdc3yp mgr-2 Ready Active Reachable 516pacagkqp2xc3fk9t1dhjor mgr-3 Ready Active Reachable 9j68exjopxe7wfl6yuxml7a7j worker-1 Ready Active 03g1y59jwfg7cf99w4lt0f662 worker-2 Ready Active dxn1zf6l61qsb1josjja83ngz worker-3 Ready Active 7

Swarm mode topology Manager Manager Manager Worker Worker Worker Worker Worker Worker Search Billing Search Search Search Billing service service service service service service container container container container container container 8

Swarm mode topology Manager Manager Manager Worker Worker Worker Worker Worker Worker Search Billing Search Search Search Billing service service service service service service container container container container container container 9

Swarm mode topology Manager Manager Manager Worker Worker Worker Worker Worker Search Billing Search Search Search Billing service service service service service service container container container container container container 10

High availability

High availability • Survive failures of some portion of workers and managers • If a worker fails, its assigned tasks are rescheduled elsewhere 12

High availability • What about manager failures? • Managers are part of a Raft cluster that replicates the state of the swarm 13

Raft • Raft is a protocol for maintaining a strongly consistent distributed log • Way to avoid a single point of failure 14

Raft concepts • Quorum : A majority of managers • Leader : Randomly chosen manager that can add information to the distributed log • Election : The process of choosing a new leader 15

High availability • The leader is the manager that: • Makes the scheduling decisions • Keeps track of node health • Handles API calls 16

High availability • If the leader fails, another manager is elected in its place • For Raft to function, more than half the managers (a quorum ) must be reachable 17

How many managers for a swarm? • A single manager is fine in some scenarios • Any swarm meant to survive a manager failure should have 3 or 5 managers • No scaling benefit to adding additional managers • Each one replicates a full copy of the swarm's state 18

Manager fault tolerance Number of managers Majority Tolerated Failures 1 1 0 2 2 0 19

Manager fault tolerance Number of managers Majority Tolerated Failures 1 1 0 2 2 0 20

Manager fault tolerance Number of managers Majority Tolerated Failures 1 1 0 2 2 0 3 2 1 4 3 1 21

Manager fault tolerance Number of managers Majority Tolerated Failures 1 1 0 2 2 0 3 2 1 4 3 1 22

Manager fault tolerance Number of managers Majority Tolerated Failures 1 1 0 2 2 0 3 2 1 4 3 1 5 3 2 6 4 2 23

Manager fault tolerance Number of managers Majority Tolerated Failures 1 1 0 2 2 0 3 2 1 4 3 1 5 3 2 6 4 2 24

Manager fault tolerance Number of managers Majority Tolerated Failures 1 1 0 2 2 0 3 2 1 4 3 1 5 3 2 6 4 2 7 4 3 8 5 3 9 5 4 25

Where to deploy the managers • Managers must have static IP addresses • Managers should have very reliable connectivity to each other • Swarms that span a big geographic area aren't recommended • Looking at federation as an eventual solution for multi- region • Spreading managers across a cloud provider's "availability zones" in one region may make sense 26

Advertised IP addresses • All managers must be reachable by all other managers • Managers need to know their own IP addresses so they can tell other managers how to reach them • The address is autodetected if there is only one network device, or in the process of joining an existing swarm 27

Advertised IP addresses • If the address can't be autodetected, provide   --advertise-addr when running   docker swarm init • Many swarm instability issues are actually caused by managers not being able to communicate 28

What to do if quorum is lost • Suppose two out of three managers fail • The swarm won't be able to schedule tasks or perform administrative functions • You will see timeouts from commands like   docker node ls if this happens 29

What to do if quorum is lost • What if these managers are gone forever? • docker swarm init --force-new-cluster on the surviving manager recovers from this state • This modifies the swarm so that it only has a single manager • From that point, new managers can be added 30

Protecting managers from accidental overloading • By default, managers will be assigned tasks just like workers • This makes sense on a laptop-scale deployment • Best practice for serious deployments: avoid running container workloads on managers 31

Protecting managers from accidental overloading • Drain the managers to prevent them from running service tasks: mgr-1$ docker node update --availability=drain <manager id> • Alternatively, set the node.role == worker constraint on all services 32

Rolling updates • Important to avoid downtime during updates • docker service update is a rolling update by default • Parameters: • Update delay ( --update-delay ) • Update failure action: pause or continue   ( --update-failure-action ) • Parallelism ( --update-parallelism ) 33

Rolling updates { Prepare Health Update Prepare Start new new checks delay new Stop old Stop old parallelism Update Prepare Start Health Update Prepare new new checks delay new Stop old Stop old Time 34

Security

Security model • All swarm connections are encrypted and authenticated with mutual TLS • Each node is identified by its certificate (CN = node ID) • The certificate authorizes the node to act as either a worker or manager (OU = swarm-manager or OU = swarm-worker ) • By default, each manager operates as a certificate authority with the same CA key 36

Security around adding nodes • How does a new node authenticate itself before having a certificate? • It presents a join token which is provided to   docker swarm join 37

Security around adding nodes • The join token contains a secret that authorizes the new node to receive either a worker or manager certificate • It also contains a digest of the root CA certificate, for protection against man-in-the-middle attacks • The node does not use or store the join token after joining 38

Node joining example: adding a new worker • On a manager, retrieve the join token: mgr-1$ docker swarm join-token worker To add a worker to this swarm, run the following command: docker swarm join \   --token SWMTKN-1-5f7umqonkff6je2l1kqpxdsok3bwipn73hlr5dxtvx4lusy809 -5yn6jy5zqqq3tnummvq365y7m \   172.17.0.2:2377 39

Node joining example: adding a new worker • Run the command on the new worker: worker-1$ docker swarm join --token \ SWMTKN-1-5f7umqonkff6je2l1kqpxdsok3bwipn73hlr5dxtvx4lusy809 -5yn6jy5zqqq3tnummvq365y7m \   172.17.0.2:2377 This node joined a swarm as a worker. 40

Node joining flow Join token, certificate request Signed certificate Joining node Manager Node registration Task assignments = TLS with no client certificate = Mutually authenticated TLS 41

Rotating join tokens • The join tokens remain valid until they are rotated • It is good practice to periodically rotate them • docker swarm join-token --rotate worker generates a new worker token to replace the old one • docker swarm join-token --rotate manager generates a new manager token to replace the old one 42

Docker Orchestration: Beyond the Basics Aaron Lehmann Software - PowerPoint PPT Presentation

Docker Orchestration: Beyond the Basics Aaron Lehmann Software Engineer, Docker About me Software engineer at Docker Maintainer on SwarmKit and Docker Engine open source projects Focusing on distributed state, task scheduling,

Orchestration in Docker Swarm mode, Docker services and declarative application deployment Mike

docker service is the new docker run Getting Started with Docker Clustering Mike Goelzer /

Setup docker rm $(docker ps -aq) docker network rm my_net Demo - Install and activate yum -y

Docker Provider The Docker provider is used to interact with Docker containers and images. It uses

Docker Review Basic Commands docker image ls # list images currently present locally docker

Going D/S/K Prod Like A Pro BRET FISHER Docker Captain, DevOps Dude, Creator of Docker Mastery

The age of orchestration From Docker basics to cluster management NICOLA PAOLUCCI DEVELOPER

Docker meets Python A look on the Docker SDK for Python pip install docker Jan Wagner

INTRODUCTION TO DOCKER ADRIAN MOUAT SO WHAT IS DOCKER? SIMILAR TO A LIGHTWEIGHT VM Both

AI Driven Orchestration, Challenges & Opportunities Openstack Summit 2018 Sana Tariq (Ph.D.)

Smart Space Orchestration Orchestration The Internet of Things Cyber-Physical Systems Pervasive

Python, Docker, Kubernetes, Python, Docker, Kubernetes, and beyond? and beyond? Peter Bbics

DOING BIG DATA FOR REAL WITH DOCKER MESOSPHERE DCOS Elizabeth Lingg elizabeth@mesosphere.io

Docker: Testing the Waters LA-UR 15-25901 1 LA-UR 15-25901 Docker: Theres No Containing

USING DOCKER SAFELY ADRIAN MOUAT NLUUG 28 MAY 2015 LOT OF NEGATIVE COMMENTS ON DOCKER SECURITY

Docker for fun and profit Solomon Hykes* about Docker: "It uses Linux containers and the

A distributed architecture to support infomobility services Claudia Canali Riccardo Lancellotti

The Dictionary ADT The dictionary ADT models a searchable collection findElement(k): if the

( , , , ) 7 5 = , =

Relativistic Red-Black Trees Philip Howard 4/28/2010 pwh@cecs.pdx.edu 4/27/2010 1 Red-Black

3137 Data Structures and Algorithms in C++ Lecture 4 July 17 2006 Shlomo Hershkop 1

Data Structures Balanced Binary Search Tree Virendra Singh Associate Professor Computer

Combinatorial Aspects of Key Distribution for Sensor Networks Douglas R. Stinson David R.

Grammar-Based Graph Compression Fabian Peternek October 25, 2016 Use of Grammar-Based

Sambuz

Useful Links

Newsletter

Mail Us

Docker Orchestration: Beyond the Basics Aaron Lehmann Software - PowerPoint PPT Presentation

Docker Orchestration: Beyond the Basics Aaron Lehmann Software Engineer, Docker About me Software engineer at Docker Maintainer on SwarmKit and Docker Engine open source projects Focusing on distributed state, task scheduling,

Orchestration in Docker Swarm mode, Docker services and declarative application deployment Mike

docker service is the new docker run Getting Started with Docker Clustering Mike Goelzer /

Setup docker rm $(docker ps -aq) docker network rm my_net Demo - Install and activate yum -y

Docker Provider The Docker provider is used to interact with Docker containers and images. It uses

Docker Review Basic Commands docker image ls # list images currently present locally docker

Going D/S/K Prod Like A Pro BRET FISHER Docker Captain, DevOps Dude, Creator of Docker Mastery

The age of orchestration From Docker basics to cluster management NICOLA PAOLUCCI DEVELOPER

Docker meets Python A look on the Docker SDK for Python pip install docker Jan Wagner

INTRODUCTION TO DOCKER ADRIAN MOUAT SO WHAT IS DOCKER? SIMILAR TO A LIGHTWEIGHT VM Both

AI Driven Orchestration, Challenges &amp; Opportunities Openstack Summit 2018 Sana Tariq (Ph.D.)

Smart Space Orchestration Orchestration The Internet of Things Cyber-Physical Systems Pervasive

Python, Docker, Kubernetes, Python, Docker, Kubernetes, and beyond? and beyond? Peter Bbics

DOING BIG DATA FOR REAL WITH DOCKER MESOSPHERE DCOS Elizabeth Lingg elizabeth@mesosphere.io

Docker: Testing the Waters LA-UR 15-25901 1 LA-UR 15-25901 Docker: Theres No Containing

USING DOCKER SAFELY ADRIAN MOUAT NLUUG 28 MAY 2015 LOT OF NEGATIVE COMMENTS ON DOCKER SECURITY

Docker for fun and profit Solomon Hykes* about Docker: &quot;It uses Linux containers and the

A distributed architecture to support infomobility services Claudia Canali Riccardo Lancellotti

The Dictionary ADT The dictionary ADT models a searchable collection findElement(k): if the

( , , , ) 7 5 = , =

Relativistic Red-Black Trees Philip Howard 4/28/2010 pwh@cecs.pdx.edu 4/27/2010 1 Red-Black

3137 Data Structures and Algorithms in C++ Lecture 4 July 17 2006 Shlomo Hershkop 1

Data Structures Balanced Binary Search Tree Virendra Singh Associate Professor Computer

Combinatorial Aspects of Key Distribution for Sensor Networks Douglas R. Stinson David R.

Grammar-Based Graph Compression Fabian Peternek October 25, 2016 Use of Grammar-Based

Sambuz

Useful Links

Newsletter

Mail Us

AI Driven Orchestration, Challenges & Opportunities Openstack Summit 2018 Sana Tariq (Ph.D.)

Docker for fun and profit Solomon Hykes* about Docker: "It uses Linux containers and the