Cloud Gateways Suli Yang, Kiran Srinivasan, Kishore Udayashankar, - PowerPoint PPT Presentation

Tombolo: Performance Enhancements for Cloud Gateways Suli Yang, Kiran Srinivasan, Kishore Udayashankar, Swetha Krishnan, Jingxin Feng, Yupu Zhang, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau 1

Storage is Moving to the Cloud Clients Clients Cloud Gateway NFS Servers Cloud Storage ¡ Cloud storage widely adopted for elasticity and agility ¡ Enterprise mostly use them for archival data but not expensive primary data 2

Question Can cloud gateway support primary enterprise workloads? 3

Enterprise Workloads • Data Mining Tier-1 workloads • Financial Databases • Server virtualization Tier-2 workloads • E-mail • Workgroup files • Development and test • File distribution Tier-3 workloads • E-mail archive • File archive • Backup/DR 4

What we did ¡ Analyze two enterprise tier-2 workload – Their access patterns work well with cloud gateways ¡ Introduce new prefetching scheme for cloud gateways – Leverage I/O history – Combine sequentiality- and history-based prefetch ¡ Show the feasibility of moving tier-2 workloads to the cloud – Reduce cache miss ratio down to ~6% – Reduce 90 th tail latency to ~30 ms 5

Overview ¡ Tier-2 workloads characteristics ¡ Prefetching Techniques ¡ Evaluation and Results ¡ Conclusion 6

Tier-2 Workload Traces Corporate Engineering Used by 1000 employees in 500 Engineers Marketing and Finance Workloads Office, Access, VM Home directory and images build data Dataset Size 3 TB 19 TB Data Read 203.8 GB 192.1 GB Data Written 119.9 GB 87.2 GB Trace Duration 42 days 38 days 7

How big is the working set of data? 8

Tier-2 Workloads: Working Set Size Dataset Size Corp: 19TB Eng: 3 TB Tier-2 workloads have a small working set and can be cached effectively 9

How predictable are the access patterns? 10

Tier-2 Workloads: Sequential Run Size Tier-2 workloads have both sequential and random access patterns We need smart prefetching scheme 11

Terminology trigger distance prefetch degree accessed unaccessed in cache to prefetch 13

Uniqueness in Cloud Gateways (and the implications) ¡ Long and variable cloud latency : – dynamically determine trigger distance ¡ Monetary cost involved: – reduce prefetch wastage – dynamically adjust prefetch degree Additional complexities and overhead acceptable given good results 14

State of the Art: Adaptive Multi-Stream [1] ¡ Track each sequential stream identified ¡ Adjust trigger distance ¡ Adjust prefetch degree Sequential prefetching not enough How can we do better? [1] Gill et. al AMP: Adaptive Multi-Stream Prefetching in a Shared Cache 15

History-Based Prefetch ¡ Leverage I/O history to N1 capture random access [15, 25] 0.3 0.7 patterns N2 N3 ¡ Use a probability graph [26, 30] [75, 90] P 41 to represent access P 34 history ¡ Traverse the graph to N4 [0,1] find prefetch candidates 16

Challenge: History Graph Too Big ¡ Nodes represent block ranges instead of individual N1 blocks [15, 25] 0.3 – Reduce graph size by 99% 0.7 ¡ Split block ranges based on N2 N3 client accesses [26, 30] [75, 90] P 41 – Allow fine granularity control P 34 ¡ Populate the graph only with random accesses N4 – Reduce graph size by 80% [0,1] – Reduce traversal time by 90% 17

Challenge: Wrongful Prefetch ¡ Balanced expansion instead of BFS or DFS traversal N1 – Always fetch the most likely blocks to [15, 25] be accessed P 13 P 12 ¡ Remember wrongfully prefetched and evicted blocks N2 N3 [26, 30] [75, 90] P 41 ¡ Use history-based prefetch in P 34 conjunction with sequentiality- based prefetch N4 – Only traverse the graph when the [0,1] block accessed does not belong to any sequential stream 18

Experiment Methodology: Simulation ¡ Replay tier-2 I/O traces ¡ Simulator closely resembles enterprise storage system – Log structured file system – Caching for data and metadata – Deduplication Engine ¡ Cloud latency distribution drawn from real cloud backend (S3/CloudFront) 20

Cache Miss Ratio ¡ GRAPH consistently outperforms SEQ or AMP ¡ GRAPH is able to capture prefetching opportunities not available to sequential prefetching algorithms 21

End-to-End I/O Latency 90 th 95 th 99 th SEQ 745 ms 1335 ms 2115 ms AMP 705 ms 1255 ms 2095 ms GRAPH 33 ms 885 ms 1976 ms Tail Latency S3 backend, Corp Dataset, 90 GB Cache ¡ GRAPH can reduce tail latency significantly ¡ Good prefetching algorithms can mask cloud latencies even for cache misses 22

Is It Good Enough? 90 th 95 th 99 th SEQ 745 ms 1335 ms 2115 ms AMP 705 ms 1255 ms 2095 ms GRAPH 33 ms 885 ms 1976 ms Tail Latency S3 backend, Corp Dataset, 90 GB Cache Modern data center provides similar guarantees • PriorityMeister (2014): 90 th tail latency is 700 ms for an Exchange workload • Google Cloud (2015): 90 th TTBF (Time to First Byte) latency of VM accessing data hosted in the same region is 52 ms 23

Question Tier-2 Can cloud gateway support primary enterprise workloads? 24

Conclusion ¡ Cloud gateway feasible for tier-2 workloads ¡ Cloud gateway environment is unique: decisions we make for traditional storage systems may not be valid any more ¡ Re-examine other aspects of cloud gateways? 26

Can cloud gateway support tier-2 enterprise workloads? 28

90 th 95 th 99 th SEQ 745 ms 1335 ms 2115 ms AMP 705 ms 1255 ms 2095 ms GRAPH 33 ms 885 ms 1976 ms CIFS: 15 seconds CIFS: tolerate up to 15 seconds of PriorityMeister(2014): 700 ms latency in the path of retrieval PriorityMeister(2014): 90 th tail latency Google Cloud (2015): 52 ms is 700 ms for an Exchange workload Google Cloud(2015): 90 th TTBF (Time to First Byte) latency of VM accessing data hosted in the same region is 52 ms 29

Combine Graph with Sequential Prefetch ¡ If the block accessed belongs to a sequential stream: prefetch sequentially ¡ Otherwise, traverse the graph to find prefetch candidates ¡ Significantly outperforms solely sequential or graph-based prefetch 30

Challenge: History Graph Too Big ¡ Use block ranges instead of blocks as the unit of accessing ¡ Balanced Expansion: always choose the most likely nodes to be accessed – outperforms BFS or DFS ¡ Set trigger distance and prefetch degree similar to AMP, but in a graph-aware manner 31

Probability Graph ¡ Node: block range (BR) BR1 based on client access [15, 25] P 13 P 12 ¡ Edge: <BR1, BR2>, access pattern of BR1 BR2 BR3 followed by BR2 [26, 30] [75, 90] P 41 P 34 ¡ Weight: conditional probability of accessing BR2 given the access of BR4 BR1 [0,1] 32

¡ Tier-2 applications: require good performance but can tolerate occasional long latency – CIFS: tolerate up to 15 seconds of latency in the path of retrieval ¡ Modern data center provides similar guarantees – PriorityMeister (2014): 90 th tail latency is 700 ms for an Exchange workload – Google Cloud (2015): 90 th TTBF (Time to First Byte) latency of VM accessing data hosted in the same region is 52 ms 33

Is this guarantee good enough for tier-2 workloads? 34

Probability Graph: Traversal ¡ Multiply the probabilities while traversing ¡ Balanced Expansion: always choose the most likely nodes to be accessed – outperforms BFS or DFS ¡ Set trigger distance and prefetch degree similar to AMP, but in a graph-aware manner 35

Simulation Setup ¡ Workloads: corp+eng trace on 240GB dataset ¡ Simulator 36

Previous results on sequential-based prefetching Cache size: 30% Read Hit Ratio 90% 88% 86% 84% 82% 80% 78% 76% 74% 72% LRU + SEQ LRU + AMP SARC + SEQ SARC + AMP 37

First approach: assign likelihood based on probability P 38

Access Pattern Analysis on Traces Access patterns without Access patterns with Context Info Context Info Only 10% Only 10% repeated repeated and random and random accesses! accesses! 10% 11% 21% 23% SEQ_ONCE SEQ_ONCE RAND_ONCE RAND_ONCE SEQ_REPEATED SEQ_REPEATED 21% RAND_REPEATED 48% 19% RAND_REPEATED 47% 39

Access Pattern Repetition and Cache Hit Ratio 50.0% 45.0% 40.0% 35.0% 30.0% 25.0% 20.0% WRITE 15.0% HIT 10.0% MISS 5.0% 0.0% RAND_REPEATE SEQ_ONCE RAND_ONCE SEQ_REPEATED D TOTAL 21.0% 21.0% 47.0% 10.0% MISS 12.7% 1.8% 6.8% 0.5% HIT 1.3% 7.7% 24.6% 5.3% WRITE 7.8% 11.3% 15.3% 4.8% 40

Second approach: consider Sequentiality when assigning likelihoods 𝑄 12 = # 𝑝𝑔 𝐶𝑆 2 𝑏𝑠𝑓 𝑏𝑑𝑑𝑓𝑡𝑡𝑓𝑒 𝑏𝑔𝑢𝑓𝑠 𝐶𝑆 1 / # 𝑝𝑔 𝑢𝑗𝑛𝑓𝑡 𝐶𝑆 1 𝑏𝑠𝑓 𝑏𝑑𝑑𝑓𝑡𝑡𝑓𝑒 if BR2 and BR1 are not sequential 𝑄 12 = 1 if BR2 and BR1 are not sequential 41

¡ This slide should be a bit spoiler to show the key results … 42

Cloud Gateways Suli Yang, Kiran Srinivasan, Kishore Udayashankar, - PowerPoint PPT Presentation

Tombolo: Performance Enhancements for Cloud Gateways Suli Yang, Kiran Srinivasan, Kishore Udayashankar, Swetha Krishnan, Jingxin Feng, Yupu Zhang, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau 1 Storage is Moving to the Cloud Clients

Tunneling and Gateways Tunneling and Gateways Examples Gateways Motivation

Kurma: Secure Geo-distributed Multi-cloud Storage Gateways Ming Chen and Erez Zadok Stony Brook

Tunneling and Gateways Tunneling and Gateways Srinidhi Varadarajan Topics Topics Tunneling

Gateways: Access to Jewish Education Mission: Gateways provides high quality special education

Leftovers: Leftovers: MPLS, Multicast, MPLS, Multicast, Gateways and Firewalls, Gateways and

Building a Private Cloud Cloud Infrastructure Using Opensource Building a Private Cloud OSCON

KAFKA STREAMS CLOUD MONITORING AWS CLOUD MONITORING AWS APP CLOUD MONITORING AWS HTTP APP

Open community software: Building science gateways and workflows Marlon Pierce, Suresh Marru

Image credit: astronaut.com, history.com, theinfosphere.org Clients Gateways Cluster F L F

+ Device Management for OSGi IoT Gateways Luca uca Dazi @ Eurotech Julien ien Vermi rmilla

Automatic Generation of Network protocol gateways David Bromberg, Laurent Rveillre, Julia

Building Blocks for Science Gateways Carol Song, Ph.D. Purdue University International Workshop

Title Slide Team Intro to Presentation Purpose Characteristics to be examined

Dealing with Public Ethernet Jacks: Switches, Gateways, and Authentication Bob Beck

New Results for the PTB-PTS Attack on Tunneling Gateways Vincent Roca Ludovic Jacquin Saikou

Restricted Community Accounts Securing Science Gateways at the Account Level Kevin J. Price

Transistor Amplifiers Lecture notes: Sec. 6 Sedra & Smith (6 th Ed): Sec. 5.6, 5.8, 6.6 &

Operational amplifiers ENGR 40M lecture notes August 7, 2017 Chuan-Zheng Lee, Stanford

AMP: Program-Context Specific Buffer Caching Feng Zhou, Rob von Behren, Eric Brewer University

How to impress your management when you are an Active Directory noob? Vincent LE TOUX 15:15

AMP: Accelerated Mobile Pages Re-Imagined Karen Stevenson Director of Technology L U L L A B O

Low-rank Matrix Estimation via Approximate Message Passing Andrea Montanari Ramji Venkataramanan

Rediscover Google AMP Learn to integrate AMP with your Drupal project Twin Cities Drupal Camp |

The Kikuchi Hierarchy and Tensor PCA Alex Wein Courant Institute, NYU Joint work with: Ahmed El

Cloud Gateways Suli Yang, Kiran Srinivasan, Kishore Udayashankar, - PowerPoint PPT Presentation

Tombolo: Performance Enhancements for Cloud Gateways Suli Yang, Kiran Srinivasan, Kishore Udayashankar, Swetha Krishnan, Jingxin Feng, Yupu Zhang, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau 1 Storage is Moving to the Cloud Clients

Tunneling and Gateways Tunneling and Gateways Examples Gateways Motivation

Kurma: Secure Geo-distributed Multi-cloud Storage Gateways Ming Chen and Erez Zadok Stony Brook

Tunneling and Gateways Tunneling and Gateways Srinidhi Varadarajan Topics Topics Tunneling

Gateways: Access to Jewish Education Mission: Gateways provides high quality special education

Leftovers: Leftovers: MPLS, Multicast, MPLS, Multicast, Gateways and Firewalls, Gateways and

Building a Private Cloud Cloud Infrastructure Using Opensource Building a Private Cloud OSCON

KAFKA STREAMS CLOUD MONITORING AWS CLOUD MONITORING AWS APP CLOUD MONITORING AWS HTTP APP

Open community software: Building science gateways and workflows Marlon Pierce, Suresh Marru

Image credit: astronaut.com, history.com, theinfosphere.org Clients Gateways Cluster F L F

+ Device Management for OSGi IoT Gateways Luca uca Dazi @ Eurotech Julien ien Vermi rmilla

Automatic Generation of Network protocol gateways David Bromberg, Laurent Rveillre, Julia

Building Blocks for Science Gateways Carol Song, Ph.D. Purdue University International Workshop

Title Slide Team Intro to Presentation Purpose Characteristics to be examined

Dealing with Public Ethernet Jacks: Switches, Gateways, and Authentication Bob Beck

New Results for the PTB-PTS Attack on Tunneling Gateways Vincent Roca Ludovic Jacquin Saikou

Restricted Community Accounts Securing Science Gateways at the Account Level Kevin J. Price

Transistor Amplifiers Lecture notes: Sec. 6 Sedra &amp; Smith (6 th Ed): Sec. 5.6, 5.8, 6.6 &amp;

Operational amplifiers ENGR 40M lecture notes August 7, 2017 Chuan-Zheng Lee, Stanford

AMP: Program-Context Specific Buffer Caching Feng Zhou, Rob von Behren, Eric Brewer University

How to impress your management when you are an Active Directory noob? Vincent LE TOUX 15:15

AMP: Accelerated Mobile Pages Re-Imagined Karen Stevenson Director of Technology L U L L A B O

Low-rank Matrix Estimation via Approximate Message Passing Andrea Montanari Ramji Venkataramanan

Rediscover Google AMP Learn to integrate AMP with your Drupal project Twin Cities Drupal Camp |

The Kikuchi Hierarchy and Tensor PCA Alex Wein Courant Institute, NYU Joint work with: Ahmed El

Transistor Amplifiers Lecture notes: Sec. 6 Sedra & Smith (6 th Ed): Sec. 5.6, 5.8, 6.6 &