scientific papers in social sciences
play

Scientific Papers in Social Sciences Alexander Garcia / Philipp Mayr - PowerPoint PPT Presentation

Simple Semantic Enrichment of Scientific Papers in Social Sciences Alexander Garcia / Philipp Mayr / Leyla Jael Garcia Florida State University / GESIS / biotea.ws Outline Motivation What data do we have? Why we are doing this?


  1. Simple Semantic Enrichment of Scientific Papers in Social Sciences Alexander Garcia / Philipp Mayr / Leyla Jael Garcia Florida State University / GESIS / biotea.ws

  2. Outline  Motivation  What data do we have?  Why we are doing this?  What are we doing? What do we aim to achieve?  RDF generation  Metadata and Content  Content enrichment  Consuming and delivering the data  A first approach SWIB 2012, Köln 2 12/4/2012

  3. Motivation What data do we have?  GESIS  Leibniz Institute for the Social Sciences  Support for the research cycle  Journals: ISI, MDA  MDA – Methods, Data, Analysis  Journal for Empirical Social Science Research  Focus on  Survey methodologies  Methods in empirical social research  Open-access, full-text SWIB 2012, Köln 3 12/4/2012

  4. Motivation Why we are doing this?  The World Wide Web  Dissemination infrastructure: Scientific and non- scientific contributions  Information:  Still locked up in discrete documents  Not interconnected, not machine-processable  RDF technology:  Connectivity tissue  But how does it impact to the scientific communication? SWIB 2012, Köln 4 12/4/2012

  5. Motivation What are we doing? What do we aim to achieve?  Question: How can scientific publications be delivered into the Semantic Web?  Our approach  RDF for research articles  Entry point to the Web of Data  Part of the Linked Open Data  Semantic enrichment  Interoperable with online data  Richer user interface  A different read experience  Interconnected with external related elements  Collaborative environment SWIB 2012, Köln 5 12/4/2012

  6. RDF Generation Metadata and Content http://pdfx.cs.man.ac.uk/ MDA PDF MDA XML RDF BIBO Generation Reference Enrichment Metadata+ Content + References SWIB 2012, Köln 6 12/4/2012

  7. RDF Generation Content enrichment Metadata+ Content + References Automatic Manual Annotation Annotation Automatically Annotated RDF Manually Annotated RDF SWIB 2012, Köln 7 12/4/2012

  8. Lessons learnt  Biotea, a similar project on the biomedical domain  XML to RDF works well  RDF annotation works well but … annotators are not perfect  Format is not translated  bold, italics  Modeling tables is not easy  Dictionary – based entity recognition tools works better  This project  PDF to XML is not perfect SWIB 2012, Köln 8 12/4/2012

  9. Consuming and delivering the data  What does it make possible ?  How similar are two articles?  based on concepts  semantic similarity  What articles use this reference in a section with title “Results”?  Which annotation co-occurs more with this “X” annotation?  Which articles include term “A” but not term “B”? SWIB 2012, Köln 9 12/4/2012

  10. Consuming and delivering the data A first approach SWIB 2012, Köln 10 12/4/2012

  11. Consuming and delivering the data A first approach SWIB 2012, Köln 11 12/4/2012

  12.  Contact  Alex García, alexgarciac@gmail.com  Philipp Mayr, philipp.mayr@gesis.org SWIB 2012, Köln 12 12/4/2012

Recommend


More recommend