ngs resource broker
play

NGS Resource Broker Presented by Mike Mineter Slides from: Matthew - PowerPoint PPT Presentation

http://www.nesc.ac.uk/training http://www.ngs.ac.uk NGS Resource Broker Presented by Mike Mineter Slides from: Matthew Viljoen, STFC RAL Grid Deployment Group, RAL Grid Deployment Group, RAL http://www.grid-support.ac.uk/ Talk Outline Talk


  1. http://www.nesc.ac.uk/training http://www.ngs.ac.uk NGS Resource Broker Presented by Mike Mineter Slides from: Matthew Viljoen, STFC RAL Grid Deployment Group, RAL Grid Deployment Group, RAL http://www.grid-support.ac.uk/

  2. Talk Outline Talk Outline • Introduction & Background • What is a Resource Broker (RB)? • gLite WMS-LB • Future work 2

  3. Introduction Introduction Grid Deployment Group, RAL • NGS Helpdesk • NGS Helpdesk • Services : CA, MyProxy, GSI-SSHTerm – NGS : BDII, RB, Monitoring etc. NGS : BDII, RB, Monitoring etc. – EGEE : GOCDB, UKI ROC etc. http://www.ngs.ac.uk support@grid-support.ac.uk pp @g pp 3

  4. RB Background RB Background – NGS started in 2004 NGS d i 2004 – GT2.4 based middleware – Meanwhile EGEE deployed new Resource Broker – “Workload Management System” – RB always high on NGS wish-list l hi h GS i h li – gLite RB+UI: pre-production from 2007/03 – Core sites RB-compliant – RB: now available for users 4

  5. RB – What is it (and isn't)… [ ] component to allow users to submit jobs and performs all tasks required to [...] component to allow users to submit jobs and performs all tasks required to • • submit them, without exposing the user to the complexities of the Grid ¹ • An interface to Grid resources • It can: – choose the best resource to run your job • It enables: It enables: – resources to scale transparently load balancing – It is not anything to do with the Storage Resource Broker (SRB)! It is not anything to do with the Storage Resource Broker (SRB)! • • 5

  6. Before Before… Nodes User User User User • Direct interaction with nodes Di t i t ti ith d • Need to know resource addresses, capabilities 6

  7. With a Resource Broker... B k User User U U RB RB RB RB Nodes • User doesn’t care where jobs are run • Faster results • Easier, more scalable – get benefit of new nodes , g 7

  8. NGS Resource Broker NGS Resource Broker • Based on gLite WMS-LB from EGEE B d Lit WMS LB f EGEE • Can send jobs to other grids Can send jobs to other grids • Can be used from: – web portals (P-GRADE, in future NGS Portal) – from a User Interface (UI) by command line – from a User Interface (UI) by command line 8

  9. Major components Enabling Grids for E-sciencE “User “User Input “sandbox” interface interface” interface interface” I f I f Information Information i i Output “sandbox” Service Service Resource Resource Broker Broker Job Submi J Author. A th Job Qu &Authen. Publ uery it Event ish Storage Storage Element Element Logging & Logging & Logging & Logging & Computing Computing Computing Computing Job Status Book Book- -keeping keeping Element Element 9 INFSO-RI-508833

  10. Sim ple Workflow Sim ple Workflow 1. Log onto User Interface box f b 2. Write job description in JDL, + required files. Specify resource requirements 3. Submit job with glite-job-submit <RB chooses best resource matching description> 4. Check status with glite-job-status Waiting → Ready → Scheduled → Running → Done Waiting → Ready → Scheduled → Running → Done 5. Retrieve output with glite-job-output 10

  11. Sam ple JDL file Sam ple JDL file Type = "Job"; Type = "Job"; JobType = "Normal"; JobType = "Normal"; yp yp Executable = "/usr/ngs/GAUSSIAN_G03_C02"; Executable = "/usr/ngs/GAUSSIAN_G03_C02"; StdInput = "/usr/local/applications StdInput = "/usr/local/applications chemistry/gaussian/g03_C02/g03/tests/com/test001.com"; chemistry/gaussian/g03_C02/g03/tests/com/test001.com"; StdOutput = "test001.out"; StdOutput = "test001.out"; StdError = "test001.err"; StdError = "test001.err"; OutputSandbox = {"test001.out", "test001.err"}; OutputSandbox = {"test001.out", "test001.err"}; RetryCount = 4; RetryCount = 4; RetryCount = 4; RetryCount = 4; ShallowRetryCount = ShallowRetryCount = - -1; 1; Requirements = Member("GAUSSIAN_G03_C02", Requirements = Member("GAUSSIAN_G03_C02", other.GlueHostApplicationSoftwareRunTimeEnvironment) && other.GlueHostApplicationSoftwareRunTimeEnvironment) && pp pp ) ) Mds Mds- -Computer Computer- -platform == "i686" ; platform == "i686" ; 11

  12. RB Resource m atching RB Resource m atching • To see the resources matching your JDL: $>glite $>glite-job $>glite $>glite job job list job-list list match list-match match match – –-rank sample.jdl rank sample jdl rank sample jdl rank sample.jdl ****************************************************************** ****************************************************************** COMPUTING ELEMENT IDs LIST COMPUTING ELEMENT IDs LIST COMPUTING ELEMENT IDs LIST COMPUTING ELEMENT IDs LIST The following CE(s) matching your job requirements have been The following CE(s) matching your job requirements have been found: found: *CEId* *Rank* *CEId* *CEId* *CEId* *Rank* *R *R k* k* grid grid- -data.man.ac.uk:2119/jobmanager data.man.ac.uk:2119/jobmanager- -pbs pbs- -router 0 router 0 grid grid- -data.rl.ac.uk:2119/jobmanager data.rl.ac.uk:2119/jobmanager- -lsf lsf- -normal normal - -92160 92160 grid grid- -compute.leeds.ac.uk:2119/jobmanager compute.leeds.ac.uk:2119/jobmanager- -pbs pbs- -router router - -428703 428703 grid grid- -compute.oesc.ox.ac.uk:2119/jobmanager compute.oesc.ox.ac.uk:2119/jobmanager- -pbs pbs- -router router - -4036455 4036455 ****************************************************************** ****************************************************************** 12

  13. gLite-speak glossary gLite speak glossary – WMS-LB ≈ Workload Management System and Logging and Bookkeeping System – CE (Computing Element) ≈ Queue on NGS node – SE (Storage Element) ≈ No equivalent yet on NGS – UI (User Interface) ≈ Machine with client tools installed – Information Service provides up-to-date status of p p resources (BDII) 13

  14. Future work Future work • MPI jobs support • Enable all NGS nodes to be RB-compliant Enable all NGS nodes to be RB compliant • Make NGS Portal RB aware • Upgrade to gLite3.1? WM-Proxy?? (parameter sweeps, U d t Lit 3 1? WM P ?? ( t collective jobs) • Standardize job execution across NGS - role out NGS UEE (Uniform Execution Environment) • Roll out more User Interfaces across UK? 14

  15. Sum m ary Sum m ary • RB – smart & easy way of submitting jobs across RB smart & eas a of s bmitting jobs across resources • User interacts with RB by Portal or UI U i t t ith RB b P t l UI • RB chooses best resource for your job • Great for running existing applications, not ideal for developing your own applications (less interaction for p g y pp ( debugging etc.) 15

  16. Practical Practical • GSISSh to connect to ngsui01.ngs.rl.ac.uk • Follow link in the agenda page (or jump to http://wiki.ngs.ac.uk/index.php?title=Resource Broker http://wiki.ngs.ac.uk/index.php?title Resource_Broker_ Tutorial ) 16

Recommend


More recommend