A Scheduling Framework that Makes any Disk Schedulers - PowerPoint PPT Presentation

A Scheduling Framework that Makes any Disk Schedulers Non-work-conserving solely based on Request Characteristics Yuehai Xu and Song Jiang Department of Electrical and Computer Engineering Wayne State University Cluster and Internet Computing Laboratory Wayne State University

Disk Performance and Workload Spatial Locality � The disk is cost effective with its ever increasing capacity and peak throughput. � The performance with non-sequential access is critical for the disk to be competitive. – Virtual machine environment – Consolidated storage system � The effective performance depends on exploitation of spatial locality. – This locality is usually exploited statically in the request scheduling. – In this work, we exploit it in both space and time dimensions. 2

Quantifying Request Service Time Logical Block Address (LBA) 3

From 1-D Locality to 2-D Locality T 1 = service_time(pending_request) To exploit the locality, usually select LBA LBA minimal T 1 among pending requests. Disk Head T 1 Current Time Time Time 4

From 1-D Locality to 2-D Locality T 1 = service_time(pending_request) T 2 = wait_time (future_request) T 3 = service_time (future_request) � LBA LBA To exploit 1-D locality, select min( T 1 ) among pending requests. T 2 Disk Head � To exploit 2-D locality, select min( T 1 , T 2 +T 3 ) among pending and T 3 future requests with non-work- T 1 conserving scheduling. Current Time Time Time 5

Challenges of Exploiting 2-D Locality T 1 = service_time(pending_request) T 2 = wait_time (future_request) T 3 = service_time (future_request) LBA LBA � Predicting arrival times and locations of T 2 future requests whose T 2 +T 3 < T 1 ; Disk Head � Determining what request history should T 3 be used for the prediction. T 1 T 2 +T 3 < T 1 Current Time Time Time 6

How does anticipatory handle them? � The anticipatory scheduling (AS) groups requests according to their issuing processes. � AS explicitly tracks request arrival times and locations for LBA LBA each process to make a prediction for the next request. Disk Head Current Time Time Time 7

Anticipatory ’s Limitations � Requests in a local disk region may be issued by different processes. � Maintaining/analyzing long history access statistics can be expensive . � The process information may be unavailable ! (VM, SAN, NFS, LBA LBA and PVFS etc.) Disk Head Time Time 8

Related Approaches � Antfarm infers process information in the virtual machine monitor by tracking activities of processes in VMs [ USENIX ATC’06 ]. – Applicable only to VM. – Guest OS needs to be open for instrumentation. � Hints, such as accessed files’ directory or owner, are used for grouping requests in the NFS servers. [ Cluster’08 ]. – Hints may not be always relevant. � The Linux prefetching policy exploits spatial locality by tracking file access for every processes’ opened file. [ Linux Symposium’04] – File abstraction may not be available to the disk schedulers. – Its efficient tracking and decision making mechanisms can be leveraged. 9

Design Goals of Stream Scheduling � Use only request characteristics, i.e., request arrival times and locations – Process information is not required in any way. � Introduce minimal overhead – Remember minimal history access information – Conduct minimal computation in its locality analysis � Integrate seamlessly with any work-conserving schedulers – Designed as a framework to make them non-work-conserving 10

Design of Stream Scheduling � Group requests into streams so that the intra-stream locality is stronger than the inter-stream locality. � Track judicious scheduling decisions rather than locality metrics – Wait or not wait? (future request vs. pending request) – A stream is a sequence of requests for which judicious decisions are “wait”. � A stream is maintained as Linux prefetching does. – A stream is built up or torn down depending on next judicious decision. 11

Stream Scheduling Illustration LBA LBA T 2 +T 3 < T 1 b b d c c a 4 4 2 2 3 3 1 1 Req 1 has its child (Req 2). The stream length increases to two. Time Time Time period serving other requests Arrival of a request Time period serving this request Completion of a request Link showing relationship between parent request and child request

Maintenance of Streams � A stream grows when a completed request sees its child. – Determining existence of a child is independent of actual scheduling. – A stream is established when its length exceeds a threshold. – An established stream leads to non-work-conserving scheduling. � The scheduler stops serving a stream when – the stream is broken; or – the time slice allocated to the stream runs out; or – an urgent request appears. � To maintain a stream, only current stream lengths need to be remembered. – The cost is trivial ! � We have design of stream scheduling for the disk array. – It is described in the paper. 13

Experiment Settings � Software settings – Stream Scheduling (SS) is prototyped in Linux kernel 2.6.31.3 using Deadline as its work-conserving component. – The default stream length threshold is 4 . – The default stream time slice is 124ms . � Hardware settings – Intel Core2 Duo with 2GB DRAM memory. – 7200RPM, 500GB Western Digital Caviar Blue SATA II with a 16MB built-in cache. � Adaptation for NCQ – Disk head position is indicated by the last request sent to the disk. 14

Storage without Process Information par-read grep TPC-H PostMark � � par- -read read : four independent processes, each reading a 1GB file using 4KB requests in par parallel. � � Grep Grep : two grep instances, each searching in a Linux directory tree. � � TPC TPC- -H H : three TPC-H instances, each using PostgreSQL as its database server and DBT3 to create its tables. � � PostMark : four PostMark instances, each creating a data set of 10,000 files. PostMark

Storage without Process Information Service Time (ms) Pending Time (ms) Execution Time (s) Execution Time (s) par- -read read : four independent processes, each reading a 1GB file using 4KB par requests in parallel.

Storage with Inadequate Process Information � � multi- -threads: threads: four processes, each forking two threads for reading files multi with periodic synchronization between them. � � mpi- -io io- -test test: : four mpi-io-test program instances running on PVFS2 where mpi files are striped over eight data servers. � � ProFTPD: : a ProFTPD FTP server on each Xen VM supporting four clients ProFTPD to simultaneously download four 300MB files. � � TPC- -H: H: three TPC-H instances on each Xen VM. TPC

Conclusions � The stream scheduling framework turns any disk scheduler into a non-work-conserving one. – Process information is not required in the scheduling. – Both time and space overheads are low. � The framework can be extended to disk arrays to recover and exploit the locality weakened by file striping. � Experiments on its Linux prototype show significantly improved performance for representative benchmarks. 18

A Scheduling Framework that Makes any Disk Schedulers - PowerPoint PPT Presentation

A Scheduling Framework that Makes any Disk Schedulers Non-work-conserving solely based on Request Characteristics Yuehai Xu and Song Jiang Department of Electrical and Computer Engineering Wayne State University Cluster and Internet Computing

Disk Management Disk Structure Disk Scheduling RAID Disk Block Management

CPSC 410/611: Disk Management Disk Structure Disk Scheduling RAID Disk Block

CPSC 410/611: Disk Management Disk Structure Disk Scheduling RAID

Disk Storage Disk Storage Different types of disk storage: The smallest addressable unit

CPU Scheduling Schedulers Structure of a CPU scheduler Criteria for scheduling

CPSC 410/ 611: Week 9 Disk St ruct ure Disk Scheduling RAI D Disk Block

Brett Ayoob, PSP Best Practices for CPM Schedulers // Introduction The Corporate Teams Plan

CPU Scheduling Schedulers in the OS Structure of a CPU Scheduler Scheduling =

CPU Scheduling Schedulers in the OS Structure of a CPU Scheduler Scheduling = Selection

1 2 Single Disk (a) Side view of a magnetic disk. (b) Top view of a magnetic disk. 3

Today How is data saved in the hard disk? Magnetic disk Disk speed parameters Disk

Chapter 14: Mass-Storage Systems Disk Structure Disk Scheduling Disk Management

Chapter 14: Mass-Storage Systems ! Disk Structure ! Disk Scheduling ! Disk Management ! Swap-Space

Module 13: Secondary-Storage Structure Disk Structure Disk Scheduling Disk Management

Chapter 14: Mass-Storage Systems Disk Structure Disk Scheduling Disk Management

Chapter 14: Mass-Storage Systems ! Disk Structure ! Disk Scheduling ! Disk Management ! Swap-Space

Apache FTP Server integration Yoann Canal - twitter.com/y_canal Sophiacom Agenda Apache FTP

Learn with Enfocus Basic Use Cases in Switch 10 November 18 & 21, 2011 Bert van Rooijen,

TCP Behavior across Multihop Wireless Networks and the Wired Internet Kaixin Xu, Sang Bae, Mario

PWG Quarterly June 2008 Projector & Display Management WG Status Rick Landau Dell, CTO

Energy and Green Computing Energy cost has become a major factor in the total cost of

Downloading data using curl DATA P ROCES S IN G IN S H ELL Susan Sun Data Person What is

Some Internet Measuremen t Thoughts Richard Barnes BBN Technologies RIPE MAT Co - Chair

Dr. Mais Nijim 1 20.07.2010 Motivation Introduction Related Work 2 20.07.2010

Sambuz

Useful Links

Newsletter

Mail Us

A Scheduling Framework that Makes any Disk Schedulers - PowerPoint PPT Presentation

A Scheduling Framework that Makes any Disk Schedulers Non-work-conserving solely based on Request Characteristics Yuehai Xu and Song Jiang Department of Electrical and Computer Engineering Wayne State University Cluster and Internet Computing

Disk Management Disk Structure Disk Scheduling RAID Disk Block Management

CPSC 410/611: Disk Management Disk Structure Disk Scheduling RAID Disk Block

CPSC 410/611: Disk Management Disk Structure Disk Scheduling RAID

Disk Storage Disk Storage Different types of disk storage: The smallest addressable unit

CPU Scheduling Schedulers Structure of a CPU scheduler Criteria for scheduling

CPSC 410/ 611: Week 9 Disk St ruct ure Disk Scheduling RAI D Disk Block

Brett Ayoob, PSP Best Practices for CPM Schedulers // Introduction The Corporate Teams Plan

CPU Scheduling Schedulers in the OS Structure of a CPU Scheduler Scheduling =

CPU Scheduling Schedulers in the OS Structure of a CPU Scheduler Scheduling = Selection

1 2 Single Disk (a) Side view of a magnetic disk. (b) Top view of a magnetic disk. 3

Today How is data saved in the hard disk? Magnetic disk Disk speed parameters Disk

Chapter 14: Mass-Storage Systems Disk Structure Disk Scheduling Disk Management

Chapter 14: Mass-Storage Systems ! Disk Structure ! Disk Scheduling ! Disk Management ! Swap-Space

Module 13: Secondary-Storage Structure Disk Structure Disk Scheduling Disk Management

Chapter 14: Mass-Storage Systems Disk Structure Disk Scheduling Disk Management

Chapter 14: Mass-Storage Systems ! Disk Structure ! Disk Scheduling ! Disk Management ! Swap-Space

Apache FTP Server integration Yoann Canal - twitter.com/y_canal Sophiacom Agenda Apache FTP

Learn with Enfocus Basic Use Cases in Switch 10 November 18 &amp; 21, 2011 Bert van Rooijen,

TCP Behavior across Multihop Wireless Networks and the Wired Internet Kaixin Xu, Sang Bae, Mario

PWG Quarterly June 2008 Projector &amp; Display Management WG Status Rick Landau Dell, CTO

Energy and Green Computing Energy cost has become a major factor in the total cost of

Downloading data using curl DATA P ROCES S IN G IN S H ELL Susan Sun Data Person What is

Some Internet Measuremen t Thoughts Richard Barnes BBN Technologies RIPE MAT Co - Chair

Dr. Mais Nijim 1 20.07.2010 Motivation Introduction Related Work 2 20.07.2010

Sambuz

Useful Links

Newsletter

Mail Us

Learn with Enfocus Basic Use Cases in Switch 10 November 18 & 21, 2011 Bert van Rooijen,

PWG Quarterly June 2008 Projector & Display Management WG Status Rick Landau Dell, CTO