Federated Cloud Computing Environment for Malaria Fighting INNOVAR - PowerPoint PPT Presentation

MORFEO NUBA http://nuba.morfeo-project.org Federated Cloud Computing Environment for Malaria Fighting INNOVAR PARA GANAR Vilnius April-11-2011 Aurelio Rodriguez, Carlos Fernández, Ruben Díez, Hugo Gutierrez and Álvaro Simón Proyecto parcialmente subvencionado por el subprograma Avanza I+D de la Acción Estratégica de Telecomunicaciones y Sociedad de la Información del Ministerio de Industria, Turismo y Comercio de España. Número de proyecto: TSI-020301-2009-30 1

Outline  Introduction • Motivation. • About Synergy. • About NUBA.  Computer-Aided Drug Design. INNOVAR PARA GANAR • Synergy Collaboration Pilots. • Chemical Database. • Database Preparation.  Federated Cloud for HPC. • The issue. • Hardware resources. • OpenNebula. • Virtual Clusters. • Network Configuration. N U • OpenNebula Frontend. B • Experiment Results. A  Conclusions. M 2 O

INTRODUCTION INNOVAR PARA GANAR N U B A M O

Motivation  Third word disease.  500 million cases per year.  1.5 – 3 million deaths per year (children bellow 5!).  Number of cases constantly increasing.  Several therapeutic tools but all of them generate resistances. INNOVAR PARA GANAR N U B A M O

Scientists Against Malaria Scientists Against Malaria INNOVAR PARA GANAR Virtual Organisation for Drug Discovery Virtual Organisation for Drug Discovery N U B A Jeffrey Wiseman M O

About NUBA  NUBA is a R+D+i project to develop a federated cloud computing platform (Infrastructure as Service).  The new federated cloud platform will assist to deploy new Internet business services in an automated way. INNOVAR PARA GANAR  New services will be escalated dynamically based on business objectives and performance criterions.  CESGA team is collaborating to deploy this new cloud infrastructure: • OpenNebula testbed and infrastructure coordination. • Cloud infrastructure monitoring and accounting. • E-IMRT use case (radiotherapy treatment planning on cloud). N U B A M 6 O

COMPUTER-AIDED DRUG DESIGN INNOVAR PARA GANAR N U B A M O

INNOVAR PARA GANAR N U B A M O

Chemical Database Processing  The Chemical database in U. of Cincinatti:  Pipeline Pilot Generation of all possible xomers.  No filtering (look for pharmacological tools).  The database is provided as an SDFile. INNOVAR PARA GANAR ~350K original compounds ~1.3M molecular entities!! N CHALLENGE: Docking 10 6 molecules U B A M 9 O

Data Base Preparation SDFile Openbabel <code> (Hs add) Mol2 File (1.3M UCxxxxxxx SDFile entries, 4Gb) < InChI > UC code (3D) InChI strings scripting (split) INNOVAR PARA GANAR 25073 directories 25073 directories 50 pdbqt each 50 single mol2 ADT each 50 “vina.conf” each (mol2 to pdbqt) N U B A Ready to the cloud!! M 10 O

FEDERATED CLOUD FOR HPC INNOVAR PARA GANAR N U B A M O

The Issue  Synergy chemical processing needs a HPC/HTC (High Productivity /High Throughput) cluster as big as possible to work properly.  These resources are available at CESGA and FCSCL centers (one center alone is not enough). INNOVAR PARA GANAR  Cloud Computing solves this issue joining distributed computing resources to work as a standalone HPC cluster.  Applications requirements not suitable for static computing infrastructures: • OS requirements. • Software installation. • Jobs Management. N U  Needs a “Custom” cluster solution. B A M 12 O

Hardware Resources  CESGA (Santiago de Compostela): • 40 HP ProLiant SL2x170z G6. 2 Intel E5520 (Nehalem). 4 cores per processor. RAM 16 GB. • 1 HP ProLiant DL160 G6 2 Intel E5504 (Nehalem). 4 cores per processor. RAM 32 GB. • 1 HP ProLiant DL165 G6 2 AMD Opteron 2435. 6 cores per INNOVAR PARA GANAR processor. RAM 32 GB. • 6 HP ProLiant DL180 G6. 2 Intel E5520 (Nehalem). 4 cores per processor. 16 TB de almacenamiento total.  FCSCL (Leon): • 32 Proliant BL2x220c. 2 Intel Xeon E5450. 4 cores per processor. RAM 16 GB. • 800 GB storage (NFS) N U B A M 13 O

OpenNebula  Features: • VMs could be connected using a pre-defined “Virtual Network”. • VMs could be started using a “golden copy” machine as reference. • It's possible to define a different “context” for each executed INNOVAR PARA GANAR VMs to modify the original “golden copy”. • Could be defined a scheduling mechanism to select a specific physical host (based on round robin/ host load/ etc). • It's possible to stop, start, migrate and save VMs. • OpenNebula cluster could be used as HPC cluster (we manage Virtual Cluster VC instead of Virtual Machines). N U B A M 14 O

Virtual Clusters  A Virtual Cluster (VC) could be used as a group of VMs: • This VC includes a VM head node. • Several VMs are associated to VC head. • VC Virtual machines are interconnected using their own network. INNOVAR PARA GANAR  VCs are managed using different scripts: • make_cluster.sh: To create a new VC. (Cluster name, network, nodes number, etc) • kill_cluster.sh: Delete VC. (Selects a cluster name to destroy). • make_extra_node.sh: To add cluster nodes. • delete_n_nodes.sh: Delete specific number of nodes.  VCs offers: N • Automated network configuration. U B • GE batch system is configured automatically with each VC A creation. • Head node is not affected by VC nodes creation or destruction. M 15 O

Network Configuration  We need a “path” between resource centers (CESGA and FCFSL).  OpenNebula server and the physical nodes must have a configured network routing. INNOVAR PARA GANAR  VC “head” must have public and private IPs.  VC nodes are connected using a private network. N U B A M 16 O

Network Configuration INNOVAR PARA GANAR N U B A M 17 O

OpenNebula Frontend  User can connect to a web page to create or destroy VM.  User also can use a private machines repository or store their own SO images. INNOVAR PARA GANAR N U B A M 18 O

Experiment Results ● Job execution was started on August 15. ● Finished on September 31. Used Total Execution Total Average Job Efficiency (%) Cores Time/s Jobs execution time/s VINA 322 1214530 25690 3412 22.4 VSW 64 331016 191 96390 86.9 INNOVAR PARA GANAR VSW already has a efficient job manager Vina: 131 jobs exceed 12500 s. Some jobs reach near 700000 s. Vina supports SMP parallelization + Efficient job grouped algorithm is needed N U B A Efficient vina job manager to be developed M 19 O

CONCLUSIONS INNOVAR PARA GANAR N U B A M O

Conclusions  Cloud Computing thecniques allow to test VCs in a short period of time.  Deploy VCs is faster than a physical cluster installation. INNOVAR PARA GANAR  Ad-hoc clustering for different users need (SO, Software, etc).  And Its maintenance consumes less manpower and time.  Users can administrate their own virtual machines using VCs.  VC “head” must have public and private IPs.  It's possible to create geographical distributed VCs. N U B A M 21 O

THANK YOU FOR YOUR ATTENTION! INNOVAR PARA GANAR ¿Questions? N U B A M 22 O

Federated Cloud Computing Environment for Malaria Fighting INNOVAR - PowerPoint PPT Presentation

MORFEO NUBA http://nuba.morfeo-project.org Federated Cloud Computing Environment for Malaria Fighting INNOVAR PARA GANAR Vilnius April-11-2011 Aurelio Rodriguez, Carlos Fernndez, Ruben Dez, Hugo Gutierrez and lvaro Simn Proyecto

Docker in the EGI Docker in the EGI Federated Cloud Federated Cloud Carlos Gimeno

Network and Computing Resource Sharing in Federated Cloud Systems Walter Cerroni Dept. of

Linking EUBrazilCloudConnect and EGI Federated Cloud Ignacio Blanquer on behalf of the

Cloud Computing & Cloud Models Cloud Models Topics Defining cloud computing

OpenStack on the EGI Federated Cloud Enol En l Fe Fernndez Cloud ud Archi hitect EGI

An Introduction to Cloud Computing with OpenNebula Carlos Martn OpenNebula Project Engineer

Fed-DIC: Diagonally Interleaved Coding in a Federated Cloud Environment Giannis Tzouros

The MNM-CloudLab -- Ideas, Concepts & Implemenation 17.7. 22.7. 2011 Nils gentschen Felde

What is the Cloud? Simply put, Cloud Computing is the delivery of computing services,

OVERSEE: Outsourcing Verification to Enable Resource Sharing in Edge Environment Reporter:

CLOUD COMPUTING Overview of Use, Benefits, and Risks of Commercial Cloud Computing Commercial

SmartGrid Implications of Cloud Computing New Technology in the Utility Environment PG&E

Cloud Computing Tom Hendrickx RESEARCH QUESTION Define Cloud Computing in context of the higher

Getting Started with Cloud Computing Niels Olof Bouvin 1 Overview What is Cloud Computing?

Cloud Computing and Cloud Storage By: Maurice Kelly History of Internet and Cloud Computing

MESSAGING FOR THE CLOUD with and Hadrian Zbarcea & Jamie Goodyear Cloud Computing

Linux Containers Drive P2P Social Cloud Computing By Alex Karasulu Social cloud computing ,

Spatial Cloud Computing how could geographers use and help to shape cloud computing? Chaowei Phil

Cloud Computing SENY KAMARA MICROSOFT RESEARCH Computing as a Service 2 Computing is a

Spatial Cloud Computing how could geographers use and help to shape cloud computing? Chaowei Phil

CLOUD-SCALE INFORMATION RETRIEVAL Ken Birman, CS5412 Cloud Computing Styles of cloud computing

Big Data for Data Science Cloud Computing event.cwi.nl/lsde Cloud computing What?

Cloud Tutorial: AWS IoT CSE 520S Spring, Jan. 16, 2020 Ruixuan Dai XaaS: Basics in Cloud

for information exchange Federated data & computing infrastructure Giuseppe Fiameni (CINECA)

Federated Cloud Computing Environment for Malaria Fighting INNOVAR - PowerPoint PPT Presentation

MORFEO NUBA http://nuba.morfeo-project.org Federated Cloud Computing Environment for Malaria Fighting INNOVAR PARA GANAR Vilnius April-11-2011 Aurelio Rodriguez, Carlos Fernndez, Ruben Dez, Hugo Gutierrez and lvaro Simn Proyecto

Docker in the EGI Docker in the EGI Federated Cloud Federated Cloud Carlos Gimeno

Network and Computing Resource Sharing in Federated Cloud Systems Walter Cerroni Dept. of

Linking EUBrazilCloudConnect and EGI Federated Cloud Ignacio Blanquer on behalf of the

Cloud Computing &amp; Cloud Models Cloud Models Topics Defining cloud computing

OpenStack on the EGI Federated Cloud Enol En l Fe Fernndez Cloud ud Archi hitect EGI

An Introduction to Cloud Computing with OpenNebula Carlos Martn OpenNebula Project Engineer

Fed-DIC: Diagonally Interleaved Coding in a Federated Cloud Environment Giannis Tzouros

The MNM-CloudLab -- Ideas, Concepts &amp; Implemenation 17.7. 22.7. 2011 Nils gentschen Felde

What is the Cloud? Simply put, Cloud Computing is the delivery of computing services,

OVERSEE: Outsourcing Verification to Enable Resource Sharing in Edge Environment Reporter:

CLOUD COMPUTING Overview of Use, Benefits, and Risks of Commercial Cloud Computing Commercial

SmartGrid Implications of Cloud Computing New Technology in the Utility Environment PG&amp;E

Cloud Computing Tom Hendrickx RESEARCH QUESTION Define Cloud Computing in context of the higher

Getting Started with Cloud Computing Niels Olof Bouvin 1 Overview What is Cloud Computing?

Cloud Computing and Cloud Storage By: Maurice Kelly History of Internet and Cloud Computing

MESSAGING FOR THE CLOUD with and Hadrian Zbarcea &amp; Jamie Goodyear Cloud Computing

Linux Containers Drive P2P Social Cloud Computing By Alex Karasulu Social cloud computing ,

Spatial Cloud Computing how could geographers use and help to shape cloud computing? Chaowei Phil

Cloud Computing SENY KAMARA MICROSOFT RESEARCH Computing as a Service 2 Computing is a

Spatial Cloud Computing how could geographers use and help to shape cloud computing? Chaowei Phil

CLOUD-SCALE INFORMATION RETRIEVAL Ken Birman, CS5412 Cloud Computing Styles of cloud computing

Big Data for Data Science Cloud Computing event.cwi.nl/lsde Cloud computing What?

Cloud Tutorial: AWS IoT CSE 520S Spring, Jan. 16, 2020 Ruixuan Dai XaaS: Basics in Cloud

for information exchange Federated data &amp; computing infrastructure Giuseppe Fiameni (CINECA)

Cloud Computing & Cloud Models Cloud Models Topics Defining cloud computing

The MNM-CloudLab -- Ideas, Concepts & Implemenation 17.7. 22.7. 2011 Nils gentschen Felde

SmartGrid Implications of Cloud Computing New Technology in the Utility Environment PG&E

MESSAGING FOR THE CLOUD with and Hadrian Zbarcea & Jamie Goodyear Cloud Computing

for information exchange Federated data & computing infrastructure Giuseppe Fiameni (CINECA)