g info portal
play

g-INFO portal Doan Trung Tung Aurlien BERNARD, Ana Lucia DA-COSTA, - PowerPoint PPT Presentation

g-INFO portal Doan Trung Tung Aurlien BERNARD, Ana Lucia DA-COSTA, Vincent BLOCH, Thanh-Hoa LE, Yannick LEGRE, Lydia MAIGNE, Jean SALZEMANN, Hong-Quang NGUYEN, Vincent BRETON 1 Outline Introduction Overview of g-INFO


  1. g-INFO portal Doan Trung Tung Aurélien BERNARD, Ana Lucia DA-COSTA, Vincent BLOCH, Thanh-Hoa LE, Yannick LEGRE, Lydia MAIGNE, Jean SALZEMANN, Hong-Quang NGUYEN, Vincent BRETON 1

  2. Outline  Introduction  Overview of g-INFO  Implementation of g-INFO  Conclusions and perspectives 2

  3. Why g-INFO?  H5N1 (avian flu) 262 deaths 436 cases WHO - July 2009 287 deaths 486 cases WHO – March 2010 3

  4. Influenza surveillance  Data collection  Data processing in batch mode  BioHealthBase  General phylogenetic pipelines  NCBI  Specific phylogenetic  LosAlamos pipelines g-INFO: Grid-based  Deployment of International phylogenetic tools on Network for Flu clusters / grids Oservation 4

  5. Global Surveillance Network g-INFO’s overview 5

  6. g-INFO’s goals  Integration of influenza virus data sources into a federation of databases  Automatic phylogenetic pipelines  Specific molecular epidemiology studies 6

  7. Architecture of g-INFO system  Each data provider has its own server(s) to store his data  Data provider export only selected data to a data grid interface server  The data exported is integrated in a common schema on the interface servers  Providers can keep the privilege of granting access rights to their data 7

  8. Architecture of g-INFO system  Epidemiologic pipelines will be deployed on the grid  BLAST  Alignment  Phylogenetic trees  Visualisation  ... and more 8

  9. Phylogenetic workflow g-INFO’s implementation 9

  10. Data collection >ABV25634 MKAILLVLLCAFAATNADTLCIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDSHNGKLCRLGGIAPLQLG KCNIAGWLLGNPECDLLLTVSSWSYIVETSNSDNGTCYPGDFIDYEELREQLSSVSSFEKFEIFPKTSSW PNHETTRGVTAACPYAGASSFYRNLLWLVKKENSYPKLSKSYVNNKGKEVLVLWGVHHPPTSTDQQSLYQ NADAYVSVGSSKYDRRFTPEIAARPKVRGQAGRMNYYWTLLEPGDTITFEATGNLVAPRYAFALNRGSES GIITSDAPVHDCDTKCQTPHGAINSSLPFQNIHPVTIGECPKYVKSTKLRMVTGLRNIPSIQSRGLFGAI AGFIEGGWTGLIDGWYGYHHQNGQGSGYAADQKSTQNAIDGITNKVNSVIEKMNTQFTVVGKEFNNLERR IKNLNKKVDDGFLDVWTYNAELLVLLENERTLDFHDSNVKNLYEKARSQLRNNAKEIGNGCFEFYHKCDD ACMESVRNGTYDYPKYSEESKLNREEIDGVKLESMMVYQILAIYSTVASSLVLLVSLGAISFWMCSNGSL QCRICI Grid DB FTP NCBI Metadata Sequences Daily Protein, Nucleotide, updates Coding region IDs 10

  11. WISDOM Production Environment 11

  12. Integration of g-INFO into WPE g-INFO Data Manager Job Manager Data collection Job Submitter Amga Data service Task Manager WISDOM Information System MUSCLE Gblocks Amga PhyML BLAST 12

  13. Automatic phylogenetic pipeline Task g-INFO Manager pipeline Job Job g-INFO g-INFO portal database Job Job Wisdom IS Manager 13

  14. Automatic phylogenetic pipeline NCBI > Run daily a phylogenetic workflow on the grid Prepare Data in correct format AMGA Metadata Alignment + Curation Sequences ( Muscle + Gblocks ) Protein, Nucleotide, Coding region IDs Phylogenetic Visualisation tool Analysis ( PhyML ) Grid portal 14

  15. Manual phylogenetic workflow MOTEUR MOTEUR Task desktop web Manager tool services Job Job g-INFO g-INFO portal database Job Job Wisdom IS Manager 15

  16. Workflow execution example 16

  17. Phylogenetic workflow g-INFO portal 17

  18. g-INFO portal  Collaboration among IFI, IOIT and HPC:  IFI: web services to interact between the portal and the system  IOIT: design and develop the portal  HPC: visualization tool  Technologies:  JSF 2.0, Ajax, web services, Java aplet, … 18

  19. g-INFO portal MOTEUR Task web Manager services Job Job g-INFO Intermediate g-INFO portal web services database Job JSF 2.0 Web services Ajax Job JDBC Wisdom IS Manager 19

  20. g-INFO portal – home page 20

  21. g-INFO portal - search 21

  22. g-INFO portal – search results 22

  23. g-INFO portal – search results 23

  24. g-INFO portal – working sessions 24

  25. g-INFO portal – define working session template 25

  26. g-INFO portal – define working session template 26

  27. g-INFO portal – define working session template 27

  28. g-INFO portal – define working session template 28

  29. g-INFO portal – run working session 29

  30. g-INFO portal – run working session Pipeline01 result01 Pipeline02 result02 Input01 Input02 Pipeline03 result03 Input03 WorkingSession Pipeline04 result04 inputN PipelineN resultN 30

  31. g-INFO portal – visualization 31

  32. Conclusions  A success in terms of international collaboration  An example of developping grid application in Vietnam  A complementary service for the public health research community 32

  33. Perspectives  Provide more tools and pipelines  Import other database resources  Improve system’s performance  We are expecting the research community to contribute with more useful tools  Can be applied for other emerging diseases 33

  34. Thank you! 34

Recommend


More recommend