NetSlices: Scalable Mul/-Core Packet Processing in User-Space - PowerPoint PPT Presentation

NetSlices: ¡Scalable ¡Mul/-‑Core ¡Packet ¡ Processing ¡in ¡User-‑Space ¡ Tudor ¡Marian, ¡Ki ¡Suh ¡Lee, ¡Hakim ¡Weatherspoon ¡ Cornell ¡University ¡ ¡ Presented ¡by ¡Ki ¡Suh ¡Lee ¡

Packet ¡Processors ¡ • Essen/al ¡for ¡evolving ¡networks ¡ – Sophis/cated ¡func/onality ¡ – Complex ¡performance ¡enhancement ¡protocols ¡

Packet ¡Processors ¡ • Essen/al ¡for ¡evolving ¡networks ¡ – Sophis/cated ¡func/onality ¡ – Complex ¡performance ¡enhancement ¡protocols ¡ • Challenges: ¡ High-‑performance ¡and ¡ flexibility ¡ – 10GE ¡and ¡beyond ¡ – Tradeoffs ¡

SoMware ¡Packet ¡Processors ¡ • Low-‑level ¡(kernel) ¡vs. ¡High-‑level ¡(userspace) ¡ • Parallelism ¡ in ¡userspace: ¡Four ¡major ¡difficul/es ¡ – Overheads ¡& ¡Conten/on ¡ – Kernel ¡network ¡stack ¡ – Lack ¡of ¡control ¡over ¡hardware ¡resources ¡ – Portability ¡

Overheads ¡& ¡Conten/on ¡ NIC ¡ • Cache ¡coherence ¡ • Memory ¡Wall ¡ ¡ • Slow ¡cores ¡vs. ¡Fast ¡NICs ¡ CPU ¡ Memory ¡ Memory ¡

Kernel ¡network ¡stack ¡& ¡HW ¡control ¡ • Raw ¡socket: ¡ all ¡traffic ¡from ¡ all ¡NICs ¡to ¡user-‑space ¡ • Too ¡general, ¡hence ¡complex ¡network ¡stack ¡ • Hardware ¡and ¡soMware ¡are ¡loosely ¡coupled ¡ • Applica/ons ¡have ¡no ¡control ¡over ¡resources ¡ Applica/on ¡ Applica/on ¡ Network ¡ Network ¡ Network ¡ Applica/on ¡ Network ¡ Stack ¡ Stack ¡ Stack ¡ Applica/on ¡ Stack ¡ Applica/on ¡ Network ¡ Network ¡ Network ¡ Network ¡ Applica/on ¡ Stack ¡ Stack ¡ Network ¡ Stack ¡ Stack ¡ Applica/on ¡ Applica/on ¡ Stack ¡ Applica/on ¡ Raw ¡socket ¡ Network ¡ Stack ¡

Portability ¡ • Hardware ¡dependencies ¡ • Kernel ¡and ¡device ¡driver ¡modifica/ons ¡ – Zero-‑copy ¡ – Kernel ¡bypass ¡

Outline ¡ • Difficul/es ¡in ¡building ¡packet ¡processors ¡ • NetSlice ¡ • Evalua/on ¡ • Discussions ¡ • Conclusion ¡

NetSlice ¡ • Give ¡power ¡to ¡the ¡applica/on ¡ – Overheads ¡& ¡Conten/on ¡ – Lack ¡of ¡control ¡over ¡hardware ¡resources ¡ • Spa/al ¡par//oning ¡exploi/ng ¡NUMA ¡architecture ¡ – Kernel ¡network ¡stack ¡ • Streamlined ¡path ¡for ¡packets ¡ – Portability ¡ • No ¡zero-‑copy, ¡kernel ¡& ¡device ¡driver ¡modifica/ons ¡

NetSlice ¡Spa/al ¡Par//oning ¡ • Independent ¡(parallel) ¡execu/on ¡contexts ¡ – Split ¡each ¡Network ¡Interface ¡Controller ¡(NIC) ¡ • One ¡NIC ¡queue ¡per ¡NIC ¡per ¡context ¡ – Group ¡and ¡split ¡the ¡CPU ¡cores ¡ – Implicit ¡resources ¡(bus ¡and ¡memory ¡bandwidth) ¡ Temporal ¡par//oning ¡ Spa/al ¡par//oning ¡ (/me-‑sharing) ¡ (exclusive-‑access) ¡

NetSlice ¡Spa/al ¡Par//oning ¡Example ¡ • 2x ¡quad ¡core ¡Intel ¡Xeon ¡X5570 ¡(Nehalem) ¡ – Two ¡simultaneous ¡hyperthreads ¡– ¡OS ¡sees ¡16 ¡CPUs ¡ ¡ ¡ – Non ¡Uniform ¡Memory ¡Access ¡(NUMA) ¡ • QuickPath ¡point-‑to-‑point ¡interconnect ¡ – Shared ¡L3 ¡cache ¡

Streamlined ¡Path ¡for ¡Packets ¡ • Inefficient ¡conven/onal ¡network ¡stack ¡ – One ¡network ¡stack ¡“to ¡rule ¡them ¡all” ¡ ¡ – Performs ¡too ¡many ¡memory ¡accesses ¡ – Pollutes ¡cache, ¡context ¡switches, ¡synchroniza/on, ¡ system ¡calls, ¡blocking ¡API ¡ Heavyweight ¡ Network ¡ Stack ¡

Portability ¡ • No ¡zero-‑copy ¡ – Tradeoffs ¡between ¡portability ¡and ¡performance ¡ – NetSlices ¡achieves ¡both ¡ • No ¡hardware ¡dependency ¡ • A ¡run-‑/me ¡loadable ¡kernel ¡module ¡ ¡

NetSlice ¡API ¡ • Expresses ¡fine-‑grained ¡hardware ¡control ¡ • Flexible: ¡based ¡on ¡ ioctl � • Easy: ¡ open, read, write, close ¡ 1: ¡ ¡#include ¡"netslice.h" ¡ 19: ¡for ¡(;;) ¡{ ¡ 2: ¡ ¡ 20: ¡ ¡ ¡ssize_tcnt, ¡wcnt ¡= ¡0; ¡ 3: ¡structnetslice_rw_mul/ ¡{ ¡ 21: ¡ ¡ ¡if ¡((cnt ¡= ¡read(fd, ¡iov, ¡IOVS)) ¡< ¡0) ¡ 4: ¡ ¡ ¡int ¡flags; ¡ 22: ¡ ¡ ¡ ¡ ¡ ¡ ¡EXIT_FAIL_MSG("read"); ¡ 5: ¡} ¡rw_mul/; ¡ 23: ¡ 6: ¡ 24: ¡ ¡ ¡for ¡(i ¡= ¡0; ¡i<cnt; ¡i++) ¡ 7: ¡structnetslice_cpu_mask ¡{ ¡ 25: ¡ ¡ ¡ ¡ ¡ ¡ ¡/* ¡iov_rlen ¡marks ¡bytes ¡read ¡*/ ¡ 8: ¡ ¡ ¡cpu_set_tk_peer, ¡u_peer; ¡ 26: ¡ ¡ ¡ ¡ ¡ ¡ ¡scan_pkg(iov[i].iov_base, ¡iov[i].iov_rlen); ¡ 9: ¡} ¡mask; ¡ 27: ¡ ¡ ¡do ¡{ ¡ 10: ¡ 28: ¡ ¡ ¡ ¡ ¡ ¡ ¡size_twr_iovs; ¡ 11: ¡fd ¡= ¡open("/dev/netslice-‑1", ¡O_RDWR); ¡ 29: ¡ ¡ ¡ ¡ ¡ ¡ ¡/* ¡write ¡iov_rlen ¡bytes ¡*/ ¡ 12: ¡ 30: ¡ ¡ ¡ ¡ ¡ ¡ ¡wr_iovs ¡= ¡write(fd, ¡&iov[wcnt], ¡cnt-‑wcnt); ¡ 13: ¡rw_mul/.flags ¡= ¡MULTI_READ ¡| ¡MULTI_WRITE; ¡ 31: ¡ ¡ ¡ ¡ ¡ ¡ ¡if ¡(wr_iovs< ¡0) ¡ 14: ¡ioctl(fd, ¡NETSLICE_RW_MULTI_SET, ¡&rw_mul/); ¡ 32: ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡EXIT_FAIL_MSG("write"); ¡ 15: ¡ioctl(fd, ¡NETSLICE_CPUMASK_GET, ¡&mask); ¡ 33: ¡ ¡ ¡ ¡ ¡ ¡ ¡wcnt ¡+= ¡wr_iovs; ¡ 16: ¡sched_setaffinity(getpid(), ¡sizeof(cpu_set_t), ¡ ¡ 34: ¡ ¡ ¡} ¡while ¡(wcnt<cnt); ¡ 17: ¡ ¡ ¡&mask.u_peer); ¡ 35: ¡} ¡ 18 ¡ ¡

NetSlice ¡Evalua/on ¡ • Compare ¡against ¡state-‑of-‑the-‑art ¡ – RouteBricks ¡in-‑kernel, ¡Click ¡& ¡pcap-‑mmap ¡user-‑space ¡ • Addi/onal ¡baseline ¡scenario ¡ – All ¡traffic ¡through ¡single ¡NIC ¡queue ¡(receive-‑livelock) ¡ • What ¡is ¡the ¡basic ¡forwarding ¡performance? ¡ – How ¡efficient ¡is ¡the ¡streamlining ¡of ¡one ¡NetSlice? ¡ • How ¡is ¡NetSlice ¡scaling ¡with ¡the ¡number ¡of ¡cores? ¡

Experimental ¡Setup ¡ • R710 ¡packet ¡processors ¡ ¡ – dual ¡socket ¡quad ¡core ¡2.93GHz ¡Xeon ¡X5570 ¡ (Nehalem) ¡ – 8MB ¡of ¡shared ¡L3 ¡cache ¡and ¡12GB ¡of ¡RAM ¡ • 6GB ¡connected ¡to ¡each ¡of ¡the ¡two ¡CPU ¡sockets ¡ • Two ¡Myri-‑10G ¡NICs ¡ • R900 ¡client ¡end-‑hosts ¡ – four ¡socket ¡2.40GHz ¡Xeon ¡E7330 ¡(Penryn) ¡ – 6MB ¡of ¡L2 ¡cache ¡and ¡32GB ¡of ¡RAM ¡

Simple ¡Packet ¡Rou/ng ¡ • End-‑to-‑end ¡throughput, ¡MTU ¡(1500 ¡byte) ¡ packets ¡ 12000 ¡ best ¡configura/on ¡ 9.7 ¡ 9.7 ¡ 9.7 ¡ 10000 ¡ receive-‑livelock ¡ Throughput ¡(Mbps) ¡ 74% ¡of ¡ ¡ 7.6 ¡ 7.5 ¡ kernel ¡ 8000 ¡ 5.6 ¡ 6000 ¡ 4000 ¡ 1/11 ¡of ¡ 2.3 ¡ 2.3 ¡ NetSlice ¡ 2000 ¡ 0 ¡ kernel ¡ RouteBricks ¡ NetSlice ¡ pcap ¡ pcap-‑mmap ¡ Click ¡user-‑ space ¡

Linear ¡Scaling ¡with ¡CPUs ¡ • IPsec ¡with ¡128 ¡bit ¡key—typically ¡used ¡by ¡VPN ¡ – AES ¡encryp/on ¡in ¡Cipher-‑block ¡Chaining ¡mode ¡ 10000 ¡ 9.2 ¡ RouteBricks ¡ 8.5 ¡ 9000 ¡ NetSlice ¡ 8000 ¡ pcap ¡ Throughput ¡(Mbps) ¡ 7000 ¡ pcap-‑mmap ¡ 6000 ¡ Click ¡user-‑space ¡ 5000 ¡ 4000 ¡ 3000 ¡ 2000 ¡ 1000 ¡ 0 ¡ 2 ¡ 4 ¡ 6 ¡ 8 ¡ 10 ¡ 12 ¡ 14 ¡ 16 ¡ # ¡of ¡CPUs ¡used ¡

Outline ¡ • Difficul/es ¡in ¡building ¡packet ¡processors ¡ • Netslice ¡ • Evalua/on ¡ • Discussions ¡ • Conclusion ¡

SoMware ¡Packet ¡Processors ¡ • Can ¡support ¡10GE ¡and ¡more ¡at ¡line-‑speed ¡ – Batching ¡ • Hardware, ¡device ¡driver, ¡cross-‑domain ¡batching ¡ – Hardware ¡support ¡ • Mul/-‑queue, ¡mul/-‑core, ¡NUMA ¡, ¡GPU ¡ – Removing ¡IRQ ¡overhead ¡ – Removing ¡memory ¡overhead ¡ • Including ¡zero-‑copy ¡ – Bypassing ¡kernel ¡network ¡stack ¡

SoMware ¡Packet ¡Processors ¡ Batching ¡ Parallelism ¡ Zero-‑Copy ¡ Portability ¡ Domain ¡ Raw ¡socket ¡ User ¡ RouteBricks ¡ Kernel ¡ PacketShader ¡ User ¡ PF_RING ¡ User ¡ netmap ¡ User ¡ Kernel-‑bypass ¡ User ¡ NetSlice ¡ User ¡

NetSlices: Scalable Mul/-Core Packet Processing in User-Space - PowerPoint PPT Presentation

NetSlices: Scalable Mul/-Core Packet Processing in User-Space Tudor Marian, Ki Suh Lee, Hakim Weatherspoon Cornell University Presented by Ki Suh

Network Slicing Terms and Systems draft-galis-netslices-revised-problem-statement-01

Towards High- -performance performance Towards High Flow- -level Packet Processing level

Fast, Scalable, and Programmable Packet Scheduler in Hardware Vishal Shrivastav Cornell

PACKET PROCESSING ON GPU Elena Agostini SW Engineer, Nvidia Chetan Tekur - Solution

Towards TVF 4 TVF 3 TVF 2 TVF 1 r log(Packet Value) r + D 3 D 3 r + D 3 + D 2 Core-Stateless

Main Use Cases and Gap Analysis for Network Slicing draft-netslices-usecases-01

Main Use Cases and Gap Analysis for Network Slicing draft-netslices-usecases-01

Scalable Interconnection Networks 1 Scalable, High Performance Network At Core of Parallel

DRFQ : Multi-Resource Fair Queueing for Packet Processing Ali Ghodsi 1,3 , Vyas Sekar 2 , Matei

SAX-PAC (Scalable And eXpressive PAcket Classification) Kirill Kogan Purdue University and

Exploiting Order Independence for Scalable and Expressive Packet Classification Author: Kirill

Does your tool support PAPI SDEs yet? 13 th Scalable Tools Workshop Anthony Danalis, Heike

Reliable and Scalable Packet Striping Hari Adiseshu Guru Parulkar George Varghese Washington

A Scalable Processing-in-Memory Accelerator for Parallel Graph Processing Seminar on Computer

Scalable Name-Based Packet Forwarding: From Millions to Billions Tian Song , songtian@bit.edu.cn,

Hybrid cache architecture for high-speed packet processing Z. Liu, K. Zheng and B. Liu Abstract:

Stash Directory: A Scalable Directory for Many-Core

ECE 697J Advanced Topics Advanced Topics ECE 697J in Computer Networks in Computer

OpenCL-Based Design Pattern for Line Rate Packet Processing Jehandad Khan, Peter Athanas

StriD 2 FA: Scalable Regular Expression Matching for Deep Packet Inspection Xiaofei Wang

What is Scalable Data Processing? S CALABLE DATA P ROCES S IN G IN R Michael J. Kane and Simon

Scalable Multi-core Model Checking: Technology & Applications of Brute Force Day I:

GASPP: A GPU-Accelerated Stateful Packet Processing Framework

Introduction to Packet Tracer What is Packet Tracer? Packet Tracer is a protocol simulator

NetSlices: Scalable Mul/-Core Packet Processing in User-Space - PowerPoint PPT Presentation

NetSlices: Scalable Mul/-Core Packet Processing in User-Space Tudor Marian, Ki Suh Lee, Hakim Weatherspoon Cornell University Presented by Ki Suh

Network Slicing Terms and Systems draft-galis-netslices-revised-problem-statement-01

Towards High- -performance performance Towards High Flow- -level Packet Processing level

Fast, Scalable, and Programmable Packet Scheduler in Hardware Vishal Shrivastav Cornell

PACKET PROCESSING ON GPU Elena Agostini SW Engineer, Nvidia Chetan Tekur - Solution

Towards TVF 4 TVF 3 TVF 2 TVF 1 r log(Packet Value) r + D 3 D 3 r + D 3 + D 2 Core-Stateless

Main Use Cases and Gap Analysis for Network Slicing draft-netslices-usecases-01

Main Use Cases and Gap Analysis for Network Slicing draft-netslices-usecases-01

Scalable Interconnection Networks 1 Scalable, High Performance Network At Core of Parallel

DRFQ : Multi-Resource Fair Queueing for Packet Processing Ali Ghodsi 1,3 , Vyas Sekar 2 , Matei

SAX-PAC (Scalable And eXpressive PAcket Classification) Kirill Kogan Purdue University and

Exploiting Order Independence for Scalable and Expressive Packet Classification Author: Kirill

Does your tool support PAPI SDEs yet? 13 th Scalable Tools Workshop Anthony Danalis, Heike

Reliable and Scalable Packet Striping Hari Adiseshu Guru Parulkar George Varghese Washington

A Scalable Processing-in-Memory Accelerator for Parallel Graph Processing Seminar on Computer

Scalable Name-Based Packet Forwarding: From Millions to Billions Tian Song , songtian@bit.edu.cn,

Hybrid cache architecture for high-speed packet processing Z. Liu, K. Zheng and B. Liu Abstract:

Stash Directory: A Scalable Directory for Many-Core

ECE 697J Advanced Topics Advanced Topics ECE 697J in Computer Networks in Computer

OpenCL-Based Design Pattern for Line Rate Packet Processing Jehandad Khan, Peter Athanas

StriD 2 FA: Scalable Regular Expression Matching for Deep Packet Inspection Xiaofei Wang

What is Scalable Data Processing? S CALABLE DATA P ROCES S IN G IN R Michael J. Kane and Simon

Scalable Multi-core Model Checking: Technology &amp; Applications of Brute Force Day I:

GASPP: A GPU-Accelerated Stateful Packet Processing Framework

Introduction to Packet Tracer What is Packet Tracer? Packet Tracer is a protocol simulator

Scalable Multi-core Model Checking: Technology & Applications of Brute Force Day I: