FAWN: A Fast Array of Wimpy Nodes David G. Andersen, Jason Franklin, - PowerPoint PPT Presentation

FAWN: A Fast Array of Wimpy Nodes David G. Andersen, Jason Franklin, Michael Kaminsky * , Amar Phanishayee, Lawrence Tan, Vijay Vasudevan Carnegie Mellon University, * Intel Labs SOSP’09 1 CAS – ICT – Storage System Group

Outline  Introduction  Problems  Designs  FAWN-KV  FAWN-DS  Evaluation  Related Work  Conclusions  Acknowledgments 2 CAS – ICT – Storage System Group

Introduction  Large-scale data-intensive applications are growing in both size and importance.  Common characteristics:  I/O intensive, requiring random access over large datasets;  Massively parallel with thousands of concurrent, mostly- independent operations;  High load requires large clusters to support;  The size of objects stored is typically small. 3 CAS – ICT – Storage System Group

Problems  Small-object random-access workloads are ill- served by conventional disk-based clusters.  DRAM-based clusters are expensive and consume a surprising amount of power. FAWN Flash Performance Energy 4 CAS – ICT – Storage System Group

What is FAWN?  FAWN:  Hardware: a specified wimpy node, embedded CPU as the processor and limited DRAM and flash as the storage medium.  Software: FAWN-KV System, a system that can manage thousands of FAWN nodes efficiently. 5 CAS – ICT – Storage System Group

Why FAWN?  Increasing CPU-I/O Gap  Using wimpy processors selected to reduce I/O-included idle cycles.  CPU power consumption grows super-linearly with speed  Dynamic power scaling on traditional systems is surprisingly inefficient 6 CAS – ICT – Storage System Group

FAWN-KV Architecture-I  Back-end: responsible for serving particular key.  Front-end: Front-end:Back-end = 1:n  Maintain membership list.  Forward requests to back-end node. Ring 7 CAS – ICT – Storage System Group

FAWN-KV Architecture-II Client Back-end Back-end FAWN-DS Front-end Switch …… Back-end Manages back-ends Back-end Routes Requests If the front-end which the client contacted with was not the back-end belonged to, How to deal this scene? 8 CAS – ICT – Storage System Group

FAWN-KV Architecture-III Map Client Back-end table Back-end FAWN-DS Front-end Switch …… Back-end Back-end Front-end 1 、 client aware of the front-end mapping 2 、 front-end cache values. 9 CAS – ICT – Storage System Group

FAWN-KV Architecture-IV  Replication and Consistency  Chain replication: strong consistency. 10 CAS – ICT – Storage System Group

FAWN-KV Architecture-V  Joins and Leaves  Joins:  Key range split;  Data transmission, new vnode should get a copy of the key range;  Update the front-end to valid the new vnode for requests;  Free the space of the vnode witch down from the chain. 11 CAS – ICT – Storage System Group

FAWN-KV Architecture-VI  Phase 1: Datastore pre-copy  E1 sends C1 a copy of the datstore log file.  Phase 2: Chain insertion, log flush and play-forward  Update each node’s neighbor state to add C1 to the chain;  Ensure any in-flight updates sent after the phase 1 completed are flushed to C1. 12 CAS – ICT – Storage System Group

FAWN-DS-I  FAWN-DS  Log-structured key-value store;  Using a in-DRAM hash table to map keys to an offset in the append-only Data Log on flash. i bit 15 bit flash DRAM keyFrag index 160- bit key Log Entry hashtable Key Len Data … 13 15 14 0 Data Log delete valid keyFrag 2 i buckets Inserted values Fragment pnt are appended Offset 13 CAS – ICT – Storage System Group

FAWN-DS-II  Back-end Interface:  Get(key, key_len, &data);  Delete(key, key_len);  Insert(key, key_len, data, length).  Key step of the above:  Find the correct bucket of the key in the Hash index. How to map the key to hash index? 2 160 to 2 i ? 14 CAS – ICT – Storage System Group

FAWN-DS-III  Conflict chain: depth = 8.  Different hash functions: three funcs. h1(key) h2(key) h3(key) … … 15 CAS – ICT – Storage System Group

FAWN-DS-IV  Maintenance: Split, Merge, Compact  Split: triggered by a node addition. H A G B F C D 16 CAS – ICT – Storage System Group

Nodes Stream Data Range-I  Create new Datastore A(dsA);  Scan Datastore B(dsB) and transfer the data in rang A to dsA. Datastore list Scan and split dsB Concurrent inserts dsA 17 CAS – ICT – Storage System Group

Nodes Stream Data Range-II  Create new Datastore A(dsA);  Scan Datastore B(dsB) and transfer the data in rang A to dsA. Datastore list Scan and split dsB unlock lock Concurrent inserts dsA 18 CAS – ICT – Storage System Group

Evaluation  Evaluation Items:  K/V lookup efficiency comparison;  Impact of Ring Membership Changes;  TCO analysis for random read.  Evaluation Hardware:  AMD Geode LX processor, 500MHz;  256 MB DDR SDRAM, 400MHz;  100Mbit/s Ethernet;  4GB Sandisk Extreme IV CF. 19 CAS – ICT – Storage System Group

K/V Lookup Efficient Comparison-I  FAWN-based system over 6x more efficient than the other traditional systems 20 CAS – ICT – Storage System Group

K/V Lookup Efficient Comparison-II 21 CAS – ICT – Storage System Group

Impact of Ring Membership Changes-I 22 CAS – ICT – Storage System Group

Impact of Ring Membership Changes-II 23 CAS – ICT – Storage System Group

TCO Analysis for Random Read-I  TCO = Capital Cost + Power Cost ($0.1/kWh) 24 CAS – ICT – Storage System Group

TCO Analysis for Random Read-II  How many nodes are required for a cluster? 25 CAS – ICT – Storage System Group

TCO Analysis for Random Read-III 26 CAS – ICT – Storage System Group

Related Work  Hardware architecture:  Pairing an array of flash chips and DRAM with low- power CPUs for low-power data intensive computing.  File systems for Flash:  Several file systems, such as JFFS2, are specialized for use on flash.  High-throughput Storage and Analysis:  Some systems like Hadoop, provide bulk throughput for massive datasets with low selectivity. 27 CAS – ICT – Storage System Group

Conclusions  FAWN architecture reduce energy consumption of cluster computing.  FAWN-KV address the challenges of wimpy nodes for a key-value store:  Log-structured , memory efficient datastore;  Efficient replication;  Meets the energy efficiency and performance goals. 28 CAS – ICT – Storage System Group

Acknowledgment  Article Understanding ：  Prof. Xiong  Fengfeng Pan  Zigang Zhang  PPT Production ：  Fengfeng Pan  Biao Ma 29 CAS – ICT – Storage System Group

Thank You! 30 CAS – ICT – Storage System Group

FAWN: A Fast Array of Wimpy Nodes David G. Andersen, Jason Franklin, - PowerPoint PPT Presentation

FAWN: A Fast Array of Wimpy Nodes David G. Andersen, Jason Franklin, Michael Kaminsky * , Amar Phanishayee, Lawrence Tan, Vijay Vasudevan Carnegie Mellon University, * Intel Labs SOSP09 1 CAS ICT Storage System Group Outline

FAWN - a Fast Array of Wimpy Nodes Tomasz Dubrownik University of Warsaw January 12, 2011

FAWN FAST ARRAY OF WIMPY NODES VIRAJ SULE FAWN is a cluster architecture for low-power

FAWN - Fast Array of Wimpy Nodes David G. Andersen et al. Presented by: Ravi Kiran Boggavarapu

CSE 6350 File and Storage System Infrastructure in Data centers Supporting Internet-wide Services

A WIMPy Leptogenesis Miracle Baryogenesis via WIMP freeze-out Brian Shuve with Yanou Cui and

singly linked lists Sept. 18, 2017 1 Recall last lecture: Java array array array array of

A Brief History of Chain Replication Christopher Meiklejohn // @cmeik QCon 2015, November 17th,

Breakfast Menu Breakfast Menu Paper: PopSet Fawn 120g Size: 594 x 420 mm Scale: 40%

Review We can declare an array of any type, even other arrays A 2D array is an array of

Cache Performance 1 C and cache misses (1) int array[1024]; // 4KB array int even_sum = 0,

Habanero Operating Committee January 25 2017 Habanero Overview 1. Execute Nodes 2. Head Nodes

Minimum Number Of Nodes Minimum number of nodes in a binary tree whose height is h. At

Minimum Number Of Nodes Minimum number of nodes in a binary tree whose height is h. At

Minimum Number Of Nodes Minimum number of nodes in a binary tree whose height is h. At

Being a METS Startup Fast Failure; Fast Reward November 2016 Fast Failure; Fast Reward

Ener Energy gy and and Pe Performance Can Can a Wi Wimpy mpy Node Node Cl Clus uster

DU DUNE NE's Hardware Trigger architecture, Su Supern rnova tri rigger Ba Babak k Ab Abi

Flexible Timing Simulation of RISC-V Processors with Sniper Neet eethu B Bal al M Mal ally

Preleminary work in Lyon Florent de Dinechin, Nicolas Brunie Introduction Introduction First

Introduction to Parallel Application Performance Engineering Brian Wylie Jlich Supercomputing

Digital System on Chip (SoC) Computer-Aided Design Flow ELEC 4200 Digital Systems Design

the rdyncall package A n i m p r o v e d f o r e i g n f u n c t i o n i n t e r f a c e f o

Visible Surface Detection (Chapt. 15 in FVD, Chapt. 13 in Hearn & Baker) 1 Given a set

Non Photorealistic Rendering BY DMYTRO TKACHUK COMPUTER GRAPHICS SEMINAR Non photorealistic

FAWN: A Fast Array of Wimpy Nodes David G. Andersen, Jason Franklin, - PowerPoint PPT Presentation

FAWN: A Fast Array of Wimpy Nodes David G. Andersen, Jason Franklin, Michael Kaminsky * , Amar Phanishayee, Lawrence Tan, Vijay Vasudevan Carnegie Mellon University, * Intel Labs SOSP09 1 CAS ICT Storage System Group Outline

FAWN - a Fast Array of Wimpy Nodes Tomasz Dubrownik University of Warsaw January 12, 2011

FAWN FAST ARRAY OF WIMPY NODES VIRAJ SULE FAWN is a cluster architecture for low-power

FAWN - Fast Array of Wimpy Nodes David G. Andersen et al. Presented by: Ravi Kiran Boggavarapu

CSE 6350 File and Storage System Infrastructure in Data centers Supporting Internet-wide Services

A WIMPy Leptogenesis Miracle Baryogenesis via WIMP freeze-out Brian Shuve with Yanou Cui and

singly linked lists Sept. 18, 2017 1 Recall last lecture: Java array array array array of

A Brief History of Chain Replication Christopher Meiklejohn // @cmeik QCon 2015, November 17th,

Breakfast Menu Breakfast Menu Paper: PopSet Fawn 120g Size: 594 x 420 mm Scale: 40%

Review We can declare an array of any type, even other arrays A 2D array is an array of

Cache Performance 1 C and cache misses (1) int array[1024]; // 4KB array int even_sum = 0,

Habanero Operating Committee January 25 2017 Habanero Overview 1. Execute Nodes 2. Head Nodes

Minimum Number Of Nodes Minimum number of nodes in a binary tree whose height is h. At

Minimum Number Of Nodes Minimum number of nodes in a binary tree whose height is h. At

Minimum Number Of Nodes Minimum number of nodes in a binary tree whose height is h. At

Being a METS Startup Fast Failure; Fast Reward November 2016 Fast Failure; Fast Reward

Ener Energy gy and and Pe Performance Can Can a Wi Wimpy mpy Node Node Cl Clus uster

DU DUNE NE's Hardware Trigger architecture, Su Supern rnova tri rigger Ba Babak k Ab Abi

Flexible Timing Simulation of RISC-V Processors with Sniper Neet eethu B Bal al M Mal ally

Preleminary work in Lyon Florent de Dinechin, Nicolas Brunie Introduction Introduction First

Introduction to Parallel Application Performance Engineering Brian Wylie Jlich Supercomputing

Digital System on Chip (SoC) Computer-Aided Design Flow ELEC 4200 Digital Systems Design

the rdyncall package A n i m p r o v e d f o r e i g n f u n c t i o n i n t e r f a c e f o

Visible Surface Detection (Chapt. 15 in FVD, Chapt. 13 in Hearn &amp; Baker) 1 Given a set

Non Photorealistic Rendering BY DMYTRO TKACHUK COMPUTER GRAPHICS SEMINAR Non photorealistic

Visible Surface Detection (Chapt. 15 in FVD, Chapt. 13 in Hearn & Baker) 1 Given a set