The Design and Implementation of Open vSwitch Ben Pfaff Justin - PowerPoint PPT Presentation

The Design and Implementation of Open vSwitch Ben Pfaff ∗ Justin Pettit ∗ Teemu Koponen ∗ Ethan J. Jackson ∗ Andy Zhou ∗ Jarno Rajahalme ∗ Jesse Gross ∗ Alex Wang ∗ Jonathan Stringer ∗ Pravin Shelar ∗ Keith Amidon † Martin Casado ∗ VMware ∗ † Awake Networks

What is Open vSwitch? From openvswitch.org: “Open vSwitch is a production quality, multilayer virtual switch licensed under the open source Apache 2.0 license. It is designed to enable massive network automation through programmatic extension, while still supporting standard management interfaces and protocols (e.g. NetFlow, sFlow, SPAN, RSPAN, CLI, LACP, 802.1ag).”

Where is Open vSwitch Used? ● Broad support: – Linux, FreeBSD, NetBSD, Windows, ESX – KVM, Xen, Docker, VirtualBox, Hyper-V, … – OpenStack, CloudStack, OpenNebula, … ● Widely used: – Most popular OpenStack networking backend – Default network stack in XenServer – 1,440 hits in Google Scholar – Thousands of subscribers to OVS mailing lists

Open vSwitch Architecture ... VM 1 VM 2 VM n VMs Netlink Hypervisor OVSDB kernel module ovsdb-server ovs-vswitchd user kernel OpenFlow OVSDB NICs Controller

Use Case: Network Virtualization OpenFlow tables T able 0 T able 1 T able 24 packet packet Flow 1 Flow 1 Flow 1 ... ingress egress Flow 2 Flow 2 Flow 2 ... ... ... L2 Physical Logical to ... Lookup to Logical Physical OpenFlow Pipeline

Implications for Forwarding Performance OpenFlow tables T able 0 T able 1 T able 24 packet packet Flow 1 Flow 1 Flow 1 ingress ... egress Flow 2 Flow 2 Flow 2 ... ... ... L2 Physical to Logical to ... Lookup Logical Physical k 24 hash k 0 hash k 1 hash ... lookups lookups lookups 100+ hash lookups per packet for tuple space search?

Non-solutions ● All of these helped: Multithreading – Userspace RCU – Batching packet processing – Classifier optimizations – Microoptimizations – ● None of it helped enough: % versus x. Classification is expensive on general-purpose CPUs!

OVS Cache v1: Microflow Cache Microflow: hit ● Complete set of packet headers and metadata l e ● Suitable for hash table Microflow Cache n r e ● Shaded data below: k miss Eth IP TCP payload e c a OpenFlow Tables p s r e s u OpenFlow Controller (in theory)

Speedup with Microflow Cache Microflow cache (1 hash lookup) OpenFlow tables T able 0 T able 1 T able 24 packet packet Flow 1 Flow 1 Flow 1 ... egress ingress Flow 2 Flow 2 Flow 2 ... ... ... L2 Physical to Logical to ... Lookup Logical Physical k 24 hash k 0 hash k 1 hash ... lookups lookups lookups From 100+ hash lookups per packet, to just 1!

Microflow Caching in Practice ● Tremendous speedup for most workloads ● Problematic traffic patterns: Port scans – Malicious ● Accidental (!) ● Peer-to-peer rendezvous applications – Some kinds of network testing – ● All of this traffic has lots of short-lived microflows Fundamental caching problem: low hit rate –

Using a More Expensive Cache Cache (k c hash lookups) OpenFlow tables T able 0 T able 1 T able 24 packet packet Flow 1 Flow 1 Flow 1 ingress ... egress Flow 2 Flow 2 Flow 2 ... ... ... L2 Physical to Logical to ... Lookup Logical Physical k 24 hash k 0 hash k 0 hash k 1 hash ... lookups lookups lookups lookups If k c << k 0 + k 1 + … + k 24 : benefit!

Naive Approach to Populating Cache Combine tables 0...24 into one flow table. Easy! Usually, k c << k 0 + k 1 + … + k 24 . But: T able 0+1+...+24 ip_src= a , ip_dst= e , …, eth_dst= i T able 0 T able 1 T able 24 ip_src= a , ip_dst=e, …, eth_dst= j ip_src= a ip_dst= e eth_dst= i eth_dst= j = ip_src= a , ip_dst= e , …, eth_dst= k ip_src= b ip_dst= f × ∙∙∙ × × ... ip_src= c ip_dst= g eth_dst= k eth_dst= m ip_src= d , ip_dst= h , …, eth_dst= k ip_src= d ip_dst= h ip_src= d , ip_dst= h , …, eth_dst= m n 1 flows n 2 flows n 24 flows up to n 1 × n 2 × ∙∙∙ × n 24 flows “Crossproduct Problem”

Lazy Approach to Populating Cache Solution: Build cache of combined “megaflows” lazily as packets arrive. Megafmow Cache T able 0 T able 1 T able 24 ip_src= a , ip_dst= f , …, eth_dst= i ip_src= a ip_dst= e eth_dst= i . . . eth_dst= j = ip_src= c , ip_dst= g , …, eth_dst= k ip_src= b ip_dst= f × ∙∙∙ × × ... . ip_src= c ip_dst= g . eth_dst= k ip_src= d , ip_dst= e , …, eth_dst= m . eth_dst= m ip_src= d ip_dst= h populated dynamically n 1 flows n 2 flows n 24 flows only from actually observed packets Same (or better!) table lookups as naive approach. Traffic locality yields practical cache size.

OVS Cache v2: “Megaflow” Cache hit l e Megaflow Cache n r e k miss e c a OpenFlow Tables p s r e s u

Making Megaflows Better ● Megaflows are more effective when they match fewer fields. – Megaflows that match TCP ports are almost like microflows! – Described approach matches every field that appears in any flow table ● Requirements: – online – fast ● Contribution: Megaflow generation improvements (Section 5).

Megaflow vs. Microflow Cache Performance ● Microflow cache: k 0 + k 1 + ∙∙∙ + k 24 lookups for first packet in microflow – 1 lookup for later packets in microflow – ● Megaflow cache: k c lookups for (almost) every packet – ● k c > 1 is normal, so megaflows perform worse in common case! ● Best of both worlds would be: k c lookups for first packet in microflow – 1 lookup for later packets in microflow –

OVS Cache v3: Dual Caches Microflow Cache μ hit l e n r e M hit k Megaflow Cache miss e c a OpenFlow Tables p s r e s u

Parting Thoughts ● Architectural tension: expressibility vs. performance ● OpenFlow is expressive but troublesome to make fast on x86 – Performance requirements can make applications avoid OpenFlow ● Caching provides OVS with expressibility and performance ● Applications can freely evolve decoupled from performance – Specialized code would be slower ! ● Starting from a more general problem produced better results

The Design and Implementation of Open vSwitch Ben Pfaff Justin - PowerPoint PPT Presentation

The Design and Implementation of Open vSwitch Ben Pfaff Justin Pettit Teemu Koponen Ethan J. Jackson Andy Zhou Jarno Rajahalme Jesse Gross Alex Wang Jonathan Stringer Pravin Shelar Keith Amidon Martin

P4 and Open vSwitch Ben Pfaff blp@nicira.com Open vSwitch Commiter Open vSwitch Architecture

Open vSwitch: Part 2 Ben Pfaff VMware NSBU What is Open vSwitch? Semi-official description:

Configuring and Benchmarking Open vSwitch, DPDK and vhost-user Pei Zhang ( )

An Introduction to Open vSwitch LinuxCon Japan, Yokohama Simon Horman <simon@horms.net>

OVN: Open Virtual Network for Open vSwitch Russell Bryant (@russellbryant) Kyle Mestery

OVN: Open Virtual Network for Open vSwitch Ben Pfaff (@Ben_Pfaff) Justin Pettit

Traffic Monitoring in Open vSwitch An Wang, Yang Guo , Fang Hao, T.V. Lakshman and Songqing Chen

Fast Software Cache Design for Network Appliances Dong Zhou, Huacheng Yu, Michael Kaminsky,

Bri Bring nging ng the the Power r of eB eBPF to to Open vSwitch ch Linux Plumber 2018

Bri Bring nging ng the the Power r of eB eBPF to to Open vSwitch ch Linux Plumber 2018

Open vSwitch in Neutron Performance Challenges and Hardware O ffl oad Date: Hong Kong, 6th Nov.

Open vSwitch Config for libvirt VMs Jonas Andre advised by Johannes Naab Wednesday 9 th January,

Open vSwitch Jus0n Pe2t and Jesse Gross Linux Collabora0on

Open vSwitch: Extending Networking into the Virtualization Layer Ben Pfaff Justin Pettit Teemu

Open vSwitch: A Whirlwind Tour Jus8n Pe:t March 3, 2011

Tired of iptables based security groups? Here's how to gain tremendous speed with Open vSwitch

Leonardo de Moura Quantified SMT formulas. Applications: synthesis, software verification, ...

The Freedom Ladder 5 Tactics 4 Principles for achieving independence through products. Say

Separation of Concerns for Dependable Software Design Daniel Jackson and Eunsuk Kang MIT Nov 7

Introduction to Natural Language Processing Summary Language models Okapi BM25 Binary

Michael Robinson Acknowledgements Collaborators: Brett Jefgerson, Clifg Joslyn, Brenda

Supernova Neutrino Detection Efficiencies In DUNE Logan Clutch Jackson Rice Northern Illinois

Group Colorings and Bernoulli Subflows Su Gao University of North Texas Steve Jackson

A generalization of the quadrangulation relation to constellations and hypermaps Wenjie Fang,

Sambuz

Useful Links

Newsletter

Mail Us

The Design and Implementation of Open vSwitch Ben Pfaff Justin - PowerPoint PPT Presentation

The Design and Implementation of Open vSwitch Ben Pfaff Justin Pettit Teemu Koponen Ethan J. Jackson Andy Zhou Jarno Rajahalme Jesse Gross Alex Wang Jonathan Stringer Pravin Shelar Keith Amidon Martin

P4 and Open vSwitch Ben Pfaff blp@nicira.com Open vSwitch Commiter Open vSwitch Architecture

Open vSwitch: Part 2 Ben Pfaff VMware NSBU What is Open vSwitch? Semi-official description:

Configuring and Benchmarking Open vSwitch, DPDK and vhost-user Pei Zhang ( )

An Introduction to Open vSwitch LinuxCon Japan, Yokohama Simon Horman &lt;simon@horms.net&gt;

OVN: Open Virtual Network for Open vSwitch Russell Bryant (@russellbryant) Kyle Mestery

OVN: Open Virtual Network for Open vSwitch Ben Pfaff (@Ben_Pfaff) Justin Pettit

Traffic Monitoring in Open vSwitch An Wang, Yang Guo , Fang Hao, T.V. Lakshman and Songqing Chen

Fast Software Cache Design for Network Appliances Dong Zhou, Huacheng Yu, Michael Kaminsky,

Bri Bring nging ng the the Power r of eB eBPF to to Open vSwitch ch Linux Plumber 2018

Bri Bring nging ng the the Power r of eB eBPF to to Open vSwitch ch Linux Plumber 2018

Open vSwitch in Neutron Performance Challenges and Hardware O ffl oad Date: Hong Kong, 6th Nov.

Open vSwitch Config for libvirt VMs Jonas Andre advised by Johannes Naab Wednesday 9 th January,

Open vSwitch Jus0n Pe2t and Jesse Gross Linux Collabora0on

Open vSwitch: Extending Networking into the Virtualization Layer Ben Pfaff Justin Pettit Teemu

Open vSwitch: A Whirlwind Tour Jus8n Pe:t March 3, 2011

Tired of iptables based security groups? Here's how to gain tremendous speed with Open vSwitch

Leonardo de Moura Quantified SMT formulas. Applications: synthesis, software verification, ...

The Freedom Ladder 5 Tactics 4 Principles for achieving independence through products. Say

Separation of Concerns for Dependable Software Design Daniel Jackson and Eunsuk Kang MIT Nov 7

Introduction to Natural Language Processing Summary Language models Okapi BM25 Binary

Michael Robinson Acknowledgements Collaborators: Brett Jefgerson, Clifg Joslyn, Brenda

Supernova Neutrino Detection Efficiencies In DUNE Logan Clutch Jackson Rice Northern Illinois

Group Colorings and Bernoulli Subflows Su Gao University of North Texas Steve Jackson

A generalization of the quadrangulation relation to constellations and hypermaps Wenjie Fang,

Sambuz

Useful Links

Newsletter

Mail Us

An Introduction to Open vSwitch LinuxCon Japan, Yokohama Simon Horman <simon@horms.net>