Report on the Discovery Informatics Workshop (DIW 2012) Held on - PowerPoint PPT Presentation

http://diw.isi.edu/2012 Report on the Discovery Informatics Workshop (DIW 2012) Held on February 2-3, 2012 in Arlington, VA Yolanda Gil (USC/ISI), co-chair Haym Hirsh (Rutgers U.), co-chair Funded by NSF with grant IIS-1151951

Workshop Participants Cecilia Aragon , U. Washington (interaction Kerstin Kleese van Dam , Pacific Northwest   and visualization) National Laboratory (semantic scientific data management) Phil Bourne , UC San Diego (biology, future Vipin Kumar , U. Minnesota (machine learning and   scientific publications) climate) Elizabeth Bradley , U. Colorado (qualitative Pat Langley , Arizona State U. (computational scientific   reasoning) discovery) Will Bridewell , Stanford U. (machine learning Hod Lipson , Cornell U. (robotics)   and discovery) Huan Liu , Arizona State U. (social computing)  Paolo Ciccarese , Harvard U. (ontologies and  Yan Liu , U. Southern California (data mining and biology)  semantic web) Miriah Meyer , U. Utah (scientific visualization)  Susan Davidson , U. Pennsylvania (databases  Andrey Rzhetsky , U. Chicago (genetics)  and provenance) Steve Sawyer , Syracuse U. (social computing)  Helena Deus , Digital Enterprise Research  Alex Schliep , Rutgers U. (bioinformatics)  Institute Ireland (semantic web) Yolanda Gil , U. Southern California (workflows Christian Schunn , U. Pittsburgh (cognitive science   and semantic web) and discovery) Clark Glymour , Carnegie Mellon U. Nigam Shah , Stanford U. (ontologies and semantic   (philosophy of science, causality) web) Carla Gomes , Cornell U. (constraint reasoning Karsten Steinhaeuser , U. Minnesota (data mining   and sustainability) and climate) Alexander Gray , Georgia Institute of Alex Szalay , The Johns Hopkins U. (astrophysics and   Technology (data mining and astrophysics) citizen science) Haym Hirsh , Rutgers U. (social computing) Loren Terveen , U. Minnesota (interaction and social   Larry Hunter , U. Colorado Denver (natural computing)  Raul E. Valdes-Perez , Vivisimo Inc.  language and biology) David Jensen , U. Massachusetts Amherst (commercialization, knowledge-based discovery)  Evelyne Viegas , Microsoft Research (semantic  (machine learning) computing)

Outline  Motivation for Discovery Informatics  Why now  Possible Grand Challenges in Discovery Informatics  Themes in Discovery Informatics  Research challenges  Vision scenarios for several domain sciences

Science Has a Never-Ending Thirst for Technology  Computing is a substrate for science innovation

Data-Intensive Computing in Science

Hallmarks of 21st Century Science  Discovery processes are increasingly complex  Processes remain largely human-driven  Need new approaches to address this complexity  Data has a central role to the detriment of models  Models that predict/explain data are often not in computational form  Need to increase our ability to connect knowledge/models to data  Discovery is an increasingly social endeavor  Ad-hoc collaborations that draw from diverse expertise and skills  Need technologies that can synthesize human abilities in all forms Human cognitive limitations have become a bottleneck

What is Discovery Informatics  Computing advances aimed to identify scientific discovery processes that require knowledge assimilation and reasoning, and to apply principles of intelligent computing and information systems to understand, automate, improve, and innovate any aspects of those processes. • understanding publications, lab notebooks, and other science products • synthesis of models from first principles, hypotheses, or data analysis • dynamic and adaptive design of data analysis methods • design, execution, and steering of experiments • selective data collection • data and model visualization • theory and model revision • collaborative activities that improve data understanding and synthesis • intelligent interfaces for scientists • design of new processes for scientific discovery • computational mechanisms to represent and communicate scientific knowledge

Discovery Informatics: Why Now  Address the human bottleneck  Cognitive limitations, process efficiency  Big data will exacerbate this  “Multiplicative science”: Investments in this area can be leveraged across science and engineering  Address current redundancy in {bio|geo|eco|…}-informatics  Enable lifelong learning and training of future workforce  Will result in usable tools that encapsulate, automate, and disseminate important aspects of state-of-the-art scientific practice  Empower as well as leverage the public  “Personal data” will give rise to “personal science”  I study my genes, my local schools, my backyard’s ecosystem  Harness the efforts of massive numbers of diverse individuals  Students, expert volunteers, aspiring scientists, …

Possible Grand Challenges for Discovery Informatics 1) A Web for scientists  Search engine goes all over diverse open sites  Across all sciences  Each result is Cyclin E � “hyperlinked” to data, models, processes, scientists, etc.  Highlights contradictions  When drilling down, Carbon rates Lake Mendota � specialized tools come up  Easy to reuse and adapt processes Networks with abnormal Katz centrality �

Possible Grand Challenges for Discovery Informatics 2) The Scientist’s Associate  Watches the scientist at work  What he/she did today, last month, last year  Is aware of what others do  Makes connections  Suggests:  “I brought you an article that contradicts your results”  “I run your experiment with another dataset I found and result supports your theory”  “Would you want to try a method that was published last week and is applicable to your data?”

Possible Grand Challenges for Discovery Informatics 3) “Movie credits” for Science  Social tools that take goals, find Director resources/expertise, shepherd Barbara Jones Executive producer subactivities Sandeep Jain  Dynamically assembled from Producers Matthew Gaines and Li Cheng scratch, as if we were producing a Director’s assistant movie …  All forms of skills Special effects crew …  Reputation comes from the quality Crane engineer of work/tools/capabilities … Casting  Support big/medium/small … Actors science …  “Big studio”/“Indie”/“Home” movies

Discovery Informatics: Emerging Themes 3 1 Social Computational computing support of the for discovery discovery process 2 Data and models

THEME 1: Computational Support of the Discovery Process  Unprecedented complexity of scientific enterprise  Science is stymied by human-managed processes What aspects of the process could be improved

Computational Support of the Discovery Process Many Opportunities for Improvement Make assumptions through Design the experiment (or study)   background knowledge (combination  Identify controls of existing knowledge) via  Inventory materials/  Literature equipment Workflow Knowledge  Data  Protocols Systems Bases  Collaboration  Statistics, comp tools Internalization -> idea(s) Execute the experiment (or study)    Get funding Consider the importance/novelty/   Adaptive /real time feasibility/cost/risk of the idea(s) experimentation Formulate testable hypothesis(s)   Integrative interpretation Make consistent/validate with/  Analyze/explore/validate the data  against existing knowledge Interpreting the results  Visualization  Collaborative analysis Provenance standards Putting the results in context  Communicating and  Prioritizing the next thing 

Computational Support of the Discovery Process State of the Art  Knowledge bases created from publications  Ontological annotations of articles including claims and evidence  Text mining to extract assertions to create knowledge bases  Reasoning with knowledge bases to suggest or check hypotheses  Workflow systems to dynamically configure data analysis  Make process explicit and reproducible  Shared repositories of reusable workflows  Augmenting scientific publications with workflows  Emerging provenance standards (OPM, W3C’s PROV)  Record relations among process steps, sources, data, agents  Visualization  3 separate fields: scientific visualization, information visualization, and visual analytics  “design studies”  Combining visualizations with other data

Discoveries through Automated Synthesis and Assisted Analysis of Scientific Publications with Hanalyzer [Hunter, U. Colorado] Text extraction from publications Semantic integration of biomedical databases

Report on the Discovery Informatics Workshop (DIW 2012) Held on - PowerPoint PPT Presentation

http://diw.isi.edu/2012 Report on the Discovery Informatics Workshop (DIW 2012) Held on February 2-3, 2012 in Arlington, VA Yolanda Gil (USC/ISI), co-chair Haym Hirsh (Rutgers U.), co-chair Funded by NSF with grant IIS-1151951 Workshop

Anne Neumann (DIW Berlin, University Potsdam) Maria Nieswand (DIW Berlin) Torben Schubert

UNESCO Discovery Centre reference image of education space UNESCO Discovery Centre Discovery

Welcome to Summer-School 2018 at SOEP/DIW Berlin InGRID AN INTEGRATING EUROPEAN SOCIAL

Panel data estimation and forecasting Christopher F Baum Boston College and DIW Berlin NCER,

Automation and Programming with Stata Christopher F Baum Boston College and DIW Berlin NCER,

Ado-file programming Christopher F Baum Boston College and DIW Berlin NCER, Queensland

Models for Count Data and Categorical Response Data Christopher F Baum Boston College and DIW

Mata Programming II Christopher F Baum Boston College and DIW Berlin NCER, Queensland University

Decomposing lifetime income inequality Peter Haan (DIW Berlin and Free University Berlin) Daniel

Mata Programming I Christopher F Baum Boston College and DIW Berlin NCER, Queensland University

Factor Variables and Marginal Effects in Stata 11 Christopher F Baum Boston College and DIW

Informatics BioMedical Informatics Imaging Informatics Richard H. Wiggins, III, MD, CIIP,

From Search to Discovery in our Future Library From Search to Discovery W e see a spectrum of

Watson Discovery Spring 2020 Discovery pipeline Using NLU, document conversion, and UI tools

Tunnel End-point Discovery Tunnel End-point Discovery draft-palet-v6ops-tun-auto-disc-03.txt

VPN Discovery VPN Discovery Design Team Discussions and Options Design Team Discussions and

A Scrutiny Committee update on: Street Cleansing Waste and Recycling 11 th October 2016 Street

Overview and Scrutiny Committee: Ambition 2020 review Key Line of Enquiry four: a sustainable

Impact of climate change on plant growth and nutrition -Small Study Group 2018- Lukas

Markham Parks Acquisition, Development and Improvement Plan Development Services Committee

Devon Authorities Strategic Waste Committee An industry view of the Governments Resource and

Contract with Africa Development strategy of CAF Foundation and Pillars of Development

A utotransporters are a large superfamily of virulence factors several other cases deletion of

What we know so far: Evening economy Key Evening Economy Pubs/Bars Night Clubs Resturants

Sambuz

Useful Links

Newsletter

Mail Us