on dynamic resource management contents mechanism using
play

On Dynamic Resource Management Contents Mechanism using Control - PDF document

On Dynamic Resource Management Contents Mechanism using Control Theoretic Approach for Wide-Area Grid Computing Introduction Resources management in Grid computing Modeling as feedback control Hiroyuki Ohsaki Dynamic resource


  1. On Dynamic Resource Management Contents Mechanism using Control Theoretic Approach for Wide-Area Grid Computing � Introduction � Resources management in Grid computing � Modeling as feedback control Hiroyuki Ohsaki � Dynamic resource management mechanism DRM-DC � Performance evaluation using simulation Graduate School of � Conclusion Information Science & Technology Osaka University, Japan 1 2 CCA '05 CCA '05 Background Grid Computing: Current and Future Lightweight Messaging Parallel Compiler � Emergence of Grid computing Small – Integrate geographically distributed resources DRM-DC Network Latency Fast Transport Protocol – Enable large-scale scientific computations MPI Fast Messaging Protocol Job/Task Granularity Firewall/Security � Problems in wide-area Grid computing Cluster RPC – Variation in the amount of available resources Computing � Resources are shared by multiple users GridMPI Grid Computing � e.g., computing resources, network, disk space AccessGrid Globus I/O, GridRPC – Transfer delay between sites is non-negligible Data Grid SETI@Home Large � Round-trip time might be > 100ms LAN, Myrinet Testbed Network the Internet � Dynamic resource management is required Small Network Size Large 3 4 CCA '05 CCA '05 Resource Management in Grid GRAM (Grid Resource Allocation and Computing Management) � A resource management component of Globus toolkit � GRAM building blocks – Client � Create job, and ask gatekeeper for its execution – Gatekeeper � Authenticate user, and create job manager – Job manager � Assign job to local resource manager – Local resource manager � Execute job using site's available resource 5 6 CCA '05 CCA '05

  2. Requirements for Resource Problems in Wide-Area Grid Management in Grid Computing Computing � Objectives � Variation in the amount of available resources – Adjust the amount of jobs injected into a site... – Simple open-loop control is insufficient � According to the amount of available resources – Closed-loop control using feedback information is indispensable � Desired characteristics � Transfer delay between sites is non-negligible – Steady-state performance – Difficult to realize stability and fast convergence � High resource utilization , fast job execution – Feedback delay must be taken account of – Transient-state performance � Fast convergence – Stability, robustness, flexibility... 7 8 CCA '05 CCA '05 Research Objectives Modeling Assumptions � Propose dynamic resource management mechanism � Parameter-sweep applications – DRM-DC (Dynamic Resource Management with Delay – Job manager always has jobs to execute Compensator) � Job = multiple tasks – Feedback-based control with a delay compensator – Task granularity is small – Realize high steady-/ transient-state performance � i.e., task execution time < < transfer delay � Performance evaluation of DRM-DC � Transfer delay is constant – Implement DRM-DC in Simgrid simulator – Network between sites is not heavily congested – Comparison of DRM-DC with PI controller – Show DRM-DC's effectiveness in WAN environment 9 10 CCA '05 CCA '05 Modeling Site (Plant) Modeling Job Manager (Controller) � Can be modeled as a single queue � Can be modeled as a controller Site processes assigned tasks one by one Job manager assigns tasks to each site – – Receive tasks from job manager Receive feedback information from sites 1. 1. Queue them in buffer of local resource manager Adjust the amount of task assignments to sites 2. 2. Process tasks in the buffer one by one Continuous-time system model � 3. � Continuous-time system model Input x(t) : the number of waiting tasks in site – Input u(t) : task injection rate from job manager Output u(t) : task injection rate to site – – Output x(t) : the number of tasks in the buffer – 11 12 CCA '05 CCA '05

  3. Dynamic Resource Management Mechanism DRM-DC Block Diagram of DRM-DC � DRM-DC (Dynamic Resource Management with Delay Compensator) � Controller running on job manager – Input: the number of tasks waiting in site – Output: task injection rate to site � DRM-DC control objective – Keep the number of tasks in the buffer at target level � Avoid overload and realize high utilization � PI (Proportional Integral) controller with a delay compensator (Smith Predictor) 13 14 CCA '05 CCA '05 Simulation Scenario: Performance Evaluation Parameter Configuration � Modified a discrete-time simulator Simgrid � Simulation scenarios – Implement discrete-time version of DRM-DC 1. Varying amount of available resources (controls at every constant interval T) Varying transfer delay between job manager and site 2. � Simulation conditions – Parameter-sweep applications – Number of sites: 10 – Network: LAN (1 [ms]), WAN (100 [ms]) � Performance metrics – Queue dynamics (i.e., the number of tasks in buffer) 15 16 CCA '05 CCA '05 Simulation Result (Queue Dynamics Simulation Result (Rise Time, for Varying Available Resources) Overshoot, Settling Time) m= 1000 [MIPS](t< 5) m= 200 [MIPS](5< t< 10) m= 1500 [MIPS](15< t) DRM-DC shows DRM-DC shows DRM-DC shows better significantly shorter smaller overshoot utilization, faster rise time/settling time convergence 17 18 CCA '05 CCA '05

  4. Conclusion and Future Work � Proposed dynamic resources management mechanism – Designed for wide-area Grid computing – Utilize a delay compensator (Smith predictor) � Verified its effectiveness by simulation – Realize high steady-/transient-state performance – Effective in WAN environment with a large delay � Future work – Improvement of stability, robustness, and flexibility – Implementation in Globus toolkit 19 CCA '05

Recommend


More recommend