Preserving Linked Data on the Semantic Web by the application of - PowerPoint PPT Presentation

Preserving Linked Data on the Semantic Web by the application of Link Integrity techniques from Hypermedia Rob Vesse, Wendy Hall and Les Carr {rav08r,wh,lac}@ecs.soton.ac.uk 27 April 2010

Link Integrity Aims to ensure that a Link is valid Link is dereferenceable and goes to the intended content Semantic Web introduces additional issues Co-reference Identity & Meaning Two main types of Solution Prevention & Maintenance Recovery 22

Link Integrity in Hypermedia Open Hypermedia Robust Hyperlinks (Phelps & Wilensky 2004) Opal (Harrison & Nelson 2006) Replication & Versioning Community of Agents (Moreau & Gray 1998) RepWeb (Veiga & Ferreira 2003) Memento (Sompel et al 2009) 33

Link Integrity for the Semantic Web Co-reference/Identity CRS (Jaffri et al 2007) – Compute co-references and republish Okkam (Bouquet & Stoermer 2008) – Standardise URIs across applications Maintenance Silk Framework (Volz et al 2009) – Compute links between datasets based on similarity metrics DSNotify (Haslhofer & Popitsch 2009) – Monitors datasets to spot and repair broken links 44

Applying Recovery to the Semantic Web Useful data sources for recovery already available Sindice Cache Data Warehouses e.g. LOD Cloud, Uberlic.org ‘Authoritative’ linking hubs e.g. DBPedia Co-reference services e.g. SameAs.org Possible to exploit the heavy interlinking of the Semantic Web 55

Exploiting Interlinking • Lots of other Click to edit Master text styles datasets refer to its Second level ● Third level URIs ● Fourth level • Use these linkages ● Fifth level to find relevant data to replace the lost data Exploiting Interlinking – What if DBPedia disappeared? • owl:sameAs and rdfs:seeAlso are useful links to follow • DESCRIBE against other datasets SPARQL endpoints also useful for recovering data 66

Expansion Algorithm In essence a crawler which follows links and uses user definable data sources to discover linked data about a URI Works even if the URI itself is unresolvable User can define data sources and services to use using simple RDF vocabulary ● voID with a couple of additions to control the algorithm ● Otherwise defaults to Sindice Cache, DBPedia and SameAs.org Trivially parallel => easily scalable 77

Expansion Algorithm Returns an RDF dataset, each URI we retrieve data from has a corresponding named graph in the dataset Means consuming applications can discard data from sources they don’t trust/unaware of Allows consuming applications to determine how many sources assert a particular statement 88

Applying Preservation to the Semantic Web Provide end users the means to preserve the Linked Data they are interested in Allow them to monitor it over time to preserve changes in the data View change history of data over time Republish the data so other people can use it 99

All About That (AAT) Uses the expansion algorithm to retrieve an RDF dataset about the URI the user wants to preserve ‘Smushes’ the dataset to a single graph while preserving data about the sources which assert each triple Preserves graphs by transforming the original graph into an annotated form Use this as opposed to named graphs as want to annotate at the triple rather than graph level Initial data bloat is a trade off against decreased storage needs over time 1010

All About That (AAT) • Reification is the Click to edit Master text styles basic unit of Second level ● Third level preservation ● Fourth level • Store when we ● Fifth level first and last asserted each triple • Store source(s) for each Triple Triple transformed and annotated using the AAT Schema • Each triple in the RDF Graph to be preserved is transformed into this form • Transformations of all Triples in a Graph form a named graph in AATs Triple Store 1111

All About That (AAT) Data is monitored over time allowing Change Reporting and Versioning Regularly retrieve the linked data for a URI and compare against local annotated data and update Compute the changes and express using Talis ChangeSet Ontology End users can ask to see the data as the system perceived it to be at a given date and time 1212

Future Work Produce larger set of experimental results Detailed analysis of the effectiveness of the expansion algorithm i.e. precision and recall Improving the expansion algorithm Integration with term based search Integration with other link maintenance frameworks e.g. Silk, DSNotify Investigate distributing the algorithm for improved scalability 1313

Questions? 1414

Preserving Linked Data on the Semantic Web by the application of - PowerPoint PPT Presentation

Preserving Linked Data on the Semantic Web by the application of Link Integrity techniques from Hypermedia Rob Vesse, Wendy Hall and Les Carr {rav08r,wh,lac}@ecs.soton.ac.uk 27 April 2010 Link Integrity Aims to ensure that a Link is valid

Linked Data Mapper Mapper Linked Data A Browser rowser- -based Semantic Mapping

Lecture 1: Semantic Web and RDF Aidan Hogan aidhog@gmail.com THE WEB The Web is now 26 years

Composition Announcements Linked Lists Linked List Structure A linked list is either empty or a

Composition Announcements Linked Lists Linked List Structure A linked list is either empty or a

Linked Lists Fundamentals of Computer Science Outline Sequential vs. Linked Linked List

Composition Announcements Linked Lists Linked List Structure A linked list is either empty or a

Composition Announcements Linked Lists Linked List Structure A linked list is either empty or a

Creating Semantic Mashups: Bridging Web 2.0 and the Semantic Web Jamie Taylor, Colin Evans, Toby

csci 210: Data Structures Linked lists Summary Today linked lists single-linked

PREserving Linked DAta: An introduc7on Carlo Meghini ISTI

RDF, RDFS and OWL: Graph Data Models for the Semantic Web Semantic Web: The Idea Semantic

FERTILITY PRESERVING SURGERY FERTILITY PRESERVING SURGERY FERTILITY PRESERVING SURGERY FERTILITY

Semantic Web 2008 Se a t c eb 008 Semantic Web ca. 2008 S ti W b 2008 Semantic Web

Linked Lists Definition of Linked Lists A linked list is a sequence of items (objects) where

Joint Regional Seminar 2016 Risk Analysis of Equity-linked Products 1 Equity-linked products 2

Linked Lists Kruse and Ryba Textbook 4.1 and Chapter 6 Linked Lists Linked list of items

Visualization History Visual Programming Visualization History Visual Programming

Making sure crypto stays insecure Daniel J. Bernstein University of Illinois at Chicago &

This Week Windows, Viewports and Clippings Creating Useful Drawing Tools Turtle

Introduction to Game Programming Introduction to Game Programming from 2D images ( from

Resilient Networks 3.2 Resilient Network Design Restoration & Protection Prepared along:

Set 10 Search Engines & SEO Outline How do search engines work? Basic operation

Imperative vs. object- oriented paradigms 1 11/14/17 Imperative vs. object-oriented u

Imperative vs. object- oriented paradigms 1 11/17/14 Imperative vs. object-oriented

Sambuz

Useful Links

Newsletter

Mail Us

Preserving Linked Data on the Semantic Web by the application of - PowerPoint PPT Presentation

Preserving Linked Data on the Semantic Web by the application of Link Integrity techniques from Hypermedia Rob Vesse, Wendy Hall and Les Carr {rav08r,wh,lac}@ecs.soton.ac.uk 27 April 2010 Link Integrity Aims to ensure that a Link is valid

Linked Data Mapper Mapper Linked Data A Browser rowser- -based Semantic Mapping

Lecture 1: Semantic Web and RDF Aidan Hogan aidhog@gmail.com THE WEB The Web is now 26 years

Composition Announcements Linked Lists Linked List Structure A linked list is either empty or a

Composition Announcements Linked Lists Linked List Structure A linked list is either empty or a

Linked Lists Fundamentals of Computer Science Outline Sequential vs. Linked Linked List

Composition Announcements Linked Lists Linked List Structure A linked list is either empty or a

Composition Announcements Linked Lists Linked List Structure A linked list is either empty or a

Creating Semantic Mashups: Bridging Web 2.0 and the Semantic Web Jamie Taylor, Colin Evans, Toby

csci 210: Data Structures Linked lists Summary Today linked lists single-linked

PREserving Linked DAta: An introduc7on Carlo Meghini ISTI

RDF, RDFS and OWL: Graph Data Models for the Semantic Web Semantic Web: The Idea Semantic

FERTILITY PRESERVING SURGERY FERTILITY PRESERVING SURGERY FERTILITY PRESERVING SURGERY FERTILITY

Semantic Web 2008 Se a t c eb 008 Semantic Web ca. 2008 S ti W b 2008 Semantic Web

Linked Lists Definition of Linked Lists A linked list is a sequence of items (objects) where

Joint Regional Seminar 2016 Risk Analysis of Equity-linked Products 1 Equity-linked products 2

Linked Lists Kruse and Ryba Textbook 4.1 and Chapter 6 Linked Lists Linked list of items

Visualization History Visual Programming Visualization History Visual Programming

Making sure crypto stays insecure Daniel J. Bernstein University of Illinois at Chicago &amp;

This Week Windows, Viewports and Clippings Creating Useful Drawing Tools Turtle

Introduction to Game Programming Introduction to Game Programming from 2D images ( from

Resilient Networks 3.2 Resilient Network Design Restoration &amp; Protection Prepared along:

Set 10 Search Engines &amp; SEO Outline How do search engines work? Basic operation

Imperative vs. object- oriented paradigms 1 11/14/17 Imperative vs. object-oriented u

Imperative vs. object- oriented paradigms 1 11/17/14 Imperative vs. object-oriented

Sambuz

Useful Links

Newsletter

Mail Us

Making sure crypto stays insecure Daniel J. Bernstein University of Illinois at Chicago &

Resilient Networks 3.2 Resilient Network Design Restoration & Protection Prepared along:

Set 10 Search Engines & SEO Outline How do search engines work? Basic operation