intelligent systems for scientific discovery
play

Intelligent Systems for Scientific Discovery Yolanda Gil - PowerPoint PPT Presentation

Intelligent Systems for Scientific Discovery Yolanda Gil Information Sciences Institute and Department of Computer Science University of Southern California http://www.isi.edu/~gil @yolandagil gil@isi.edu USC Information


  1. 
 
 Intelligent Systems for Scientific Discovery 
 Yolanda Gil Information Sciences Institute and Department of Computer Science University of Southern California http://www.isi.edu/~gil @yolandagil gil@isi.edu USC Information Sciences Institute Yolanda Gil gil@isi.edu 1

  2. Data-Intensive Computing in Science USC Information Sciences Institute Yolanda Gil gil@isi.edu 2

  3. Artificial Intelligence and Scientific Discovery Pittsburg Post Gazette Archives USC Information Sciences Institute Yolanda Gil gil@isi.edu 3

  4. Computational Scientific Discovery ■ [Lenat 1976] ■ [Lindsay, Buchanan, Feigenbaum & Lederberg 1980] ■ [Langley & Simon 1981] ■ [Simon et al 1983] ■ [Falkenhainer 1985] ■ [Langley et al 1987] ■ [Kulkarni and Simon 1988] ■ [Cheeseman et al 1989] ■ [Zytkow et al 1990] ■ [Valdes-Perez 1997] ■ [Todorovski et al 2000] USC Information Sciences Institute Yolanda Gil gil@isi.edu 4

  5. http://commons.wikimedia.org/wiki/File:MRI_brain_sagittal_section.jpg http://commons.wikimedia.org/wiki/File:Earth_Eastern_Hemisphere.jpg http://www.nasa.gov/mission_pages/swift/bursts/uv_andromeda.html USC Information Sciences Institute Yolanda Gil gil@isi.edu 5

  6. AI’s Coming of Age RoboCup Soccer Tesla AutoPilot Netfix Recommenders IBM Watson Google Knowledge Graph Apple Siri https://en.wikipedia.org/wiki/Watson_(computer)#/media/File:IBM_Watson.PNG https://en.wikipedia.org/wiki/Siri#/media/File:SirioniOS9.png https://commons.wikimedia.org/wiki/File:Google_Knowledge_Panel.png https://commons.wikimedia.org/wiki/File:13-06-28-robocup-eindhoven-005.jpg http://www.greencarreports.com/news/1100482_tesla-autopilot-the-10-most-important-things-you-need-to-know USC Information Sciences Institute Yolanda Gil gil@isi.edu 6 https://en.wikipedia.org/wiki/Netflix#/media/File:NetflixDVD.jpg

  7. Before There Was the Knowledge Graph… Google Knowledge Graph Linked Data (2012) (2007) USC Information Sciences Institute Yolanda Gil gil@isi.edu 7

  8. Giving Meaning to Hyperlinks on the Web http://www.w3.org/TR/2014/NOTE-rdf11-primer-20140624/ USC Information Sciences Institute Yolanda Gil gil@isi.edu 8

  9. The Semantic Web USC Information Sciences Institute Yolanda Gil gil@isi.edu 9

  10. Data and Ontologies on the Semantic Web <Bob> <is a> <person>. <Bob> <is a friend of> <Alice>. <Bob> <is born on> <the 4th of July 1990>. <Bob> <is interested in> <the Mona Lisa>. <the Mona Lisa> <was created by> <Leonardo da Vinci>. <the video 'La Joconde à Washington'> <is about> <the Mona Lisa>. <Person> <type> <Class> <is a friend of> <type> <Property> <is a friend of> <domain> <Person> <is a friend of> <range> <Person> <is a good friend of> <subPropertyOf> <is a friend of> USC Information Sciences Institute Yolanda Gil gil@isi.edu 10

  11. Interlinked Data and Ontologies in the Semantic Web "Linking Open Data cloud diagram 2014, by Max Schmachtenberg, Christian Bizer, Anja Jentzsch and Richard Cyganiak. http://lod-cloud.net/" USC Information Sciences Institute Yolanda Gil gil@isi.edu 11

  12. Interlinked Data and Ontologies on the Web 2007 2011 2015 Datasets 294 571 3426 Triples 2B 31B 85B Cross-refs 2M 500M 74% of datasets in a weakly connected component FOAF: from 27% to 59% DC: from 31% to 56% http://lod-cloud.net http://stats.lod2.eu USC Information Sciences Institute Yolanda Gil gil@isi.edu 12

  13. Interlinking Scientific Knowledge Taxonomical Networks Simulations Bayesian Mathematical USC Information Sciences Institute Yolanda Gil gil@isi.edu 13

  14. Complexity of Scientific Endeavors USC Information Sciences Institute Yolanda Gil gil@isi.edu 14

  15. Focus: Intelligent Systems for Data Analysis What is the state of the art? What is a good problem to work on? What is a good experiment to design? What data should be collected? What is the best way to analyze the data? What are the implications of the experiments? What are appropriate revisions of current models? What to focus on next? USC Information Sciences Institute Yolanda Gil gil@isi.edu 15

  16. Capturing Scientific Knowledge Data Software Provenance Meta-Workflows Workflows DISK USC Information Sciences Institute Yolanda Gil gil@isi.edu 16

  17. Knowledge about Data: Linked Earth Wiki 
 Work with Julien-Emile Geay of USC and Nick McKay of NAU From: http://www.ncdc.noaa.gov/paleo/metadata/noaa-coral-1865.html AI opportunities: {{ #ask: [[Is a::dataset]] | ?Domain=geochemistry - collection | ?Archive | ?MeasurementMaterial - normalization | ?MeasurementStandard | ?MeasurementUnits}} - organization USC Information Sciences Institute Yolanda Gil gil@isi.edu 17

  18. Linked Data and Linked Knowledge Isotopes Oxygen -16 Quelccaya Ice Cap Quelccaya 20C Ice Core USC Information Sciences Institute Yolanda Gil gil@isi.edu 18

  19. Capturing Scientific Knowledge Data Software Provenance Meta-Workflows Workflows DISK USC Information Sciences Institute Yolanda Gil gil@isi.edu 19

  20. Knowledge about Software: 
 OntoSoft Work with C. Duffy of PSU, C. Mattmann of JPL, S. Peckham of CU, and E. Robinson of ESIP USC Information Sciences Institute Yolanda Gil gil@isi.edu 20

  21. Knowledge About Software: 
 Physical Variables and Assumptions USC Information Sciences Institute Yolanda Gil gil@isi.edu 21

  22. OntoSoft: 
 Comparing Software Implementations PIHM PIHMgis DrEICH TauDEM WBMsed USC Information Sciences Institute Yolanda Gil gil@isi.edu 22

  23. OntoSoft: 
 Publishing Software Metadata as RDF AI opportunities: - functional desc. - organization - linking to data USC Information Sciences Institute Yolanda Gil gil@isi.edu 23

  24. Linked Data and Linked Knowledge Isotopes Oxygen -16 Quelccaya Ice Cap Neotoma Quelccaya 20C Navier-Stokes Ice Core USC Information Sciences Institute Yolanda Gil gil@isi.edu 24

  25. Capturing Scientific Knowledge Data Software Provenance Meta-Workflows Workflows DISK USC Information Sciences Institute Yolanda Gil gil@isi.edu 25

  26. Knowledge about Data Analysis: 
 WINGS Work with V. Ratnakar (USC) DailySensorData ¡ ¡ ¡isa ¡Hydrolab_Sensor_Data ¡ ¡ ¡ ¡siteLong ¡rdf:datatype= “ long” ¡ ¡ ¡siteLa9tude ¡rdf:datatype= “ lat” ¡ ¡ ¡dateStart ¡rdf:datatype= “ date” ¡ ¡ ¡forSite ¡rdf:datatype=”site” ¡ ¡ ¡numberOfDayNights ¡rdf:datatype= “ int” ¡ ¡ ¡avgDepth ¡rdf:datatype=”depth” ¡ ¡ ¡avgFlow ¡rdf:datatype= “ flow” ¡ ¡ ¡ ¡ low O ’ Connor-Dobbins flow med Churchill flow high Owens-Gibbs flow USC Information Sciences Institute Yolanda Gil gil@isi.edu 26

  27. WINGS Dynamically Customizes the 
 Workflow Based on Daily Sensor Readings O’Connor-Dobbins Churchill model Owens-Gibbs model model AI opportunities: - generation - mining - linking to data USC Information Sciences Institute Yolanda Gil gil@isi.edu 27

  28. Describing Execution (Provenance) vs General Method (Workflow) SensorData- SensorData- August2011 TimePeriod AI opportunities: - abstraction - repurposing 23 800 8 5 - assembly Metabolism- Metabolism- August2011 TimePeriod USC Information Sciences Institute Yolanda Gil gil@isi.edu 28

  29. Linked Data and Linked Knowledge Isotopes Oxygen -16 Quelccaya Ice Cap Vegetation Estimates Quelccaya Neotoma 20C Ice Navier-Stokes Core USC Information Sciences Institute Yolanda Gil gil@isi.edu 29

  30. Capturing Scientific Knowledge Data Software Provenance Meta-Workflows Workflows DISK USC Information Sciences Institute Yolanda Gil gil@isi.edu 30

  31. Knowledge about Meta-Processes: 
 DISK DISK Work with P. Mallick (Stanford U) and S. Pierce (UT Austin) Confidence Value = ?n Evidence = { ……. } Springflow Pumping rate ExpectedResponse at ?L2 ?y% up ?x% at ?L1 Input: Simulation models for ?L1 with pumping rate parameter ?x Workflows generate data for springflow at ?L2 by y% USC Information Sciences Institute Yolanda Gil gil@isi.edu 31

  32. DISK: 
 DISK Hypotheses Springflow at Pumping rate up ExpectedResponse Cayuga 50% lower 10% at Kemp 33 groundwater models for Texas USC Information Sciences Institute Yolanda Gil gil@isi.edu 32

  33. DISK: 
 DISK Hypotheses Confidence Value = 0 Evidence = { } Springflow at Pumping rate up ExpectedResponse Cayuga 50% lower 10% at Kemp USC Information Sciences Institute Yolanda Gil gil@isi.edu 33

  34. DISK: 
 DISK Lines of Inquiry Confidence Value = ?n Evidence = { ……. } Springflow Pumping rate ExpectedResponse at ?L2 ?y% up ?x% at ?L1 Input: Simulation models for ?L1 with pumping rate parameter ?x Workflows generate data for springflow at ?L2 by y% USC Information Sciences Institute Yolanda Gil gil@isi.edu 34

  35. DISK: 
 DISK Lines of Inquiry Meta-workflows Confidence Value = ?n Evidence = { ……. } Confidence Springflow Cross-method Pumping rate ExpectedResponse assessment assessment at ?L2 ?y% up ?x% at ?L1 Novel Input: Simulation models for ?L1 Data growth results with pumping rate parameter ?x assessment Workflows generate data for springflow at ?L2 by y% USC Information Sciences Institute Yolanda Gil gil@isi.edu 35

Recommend


More recommend