ict tools for statistical linked open data the opencube
play

ICT Tools for statistical linked open data: the OpenCube toolkit E. - PowerPoint PPT Presentation

ICT Tools for statistical linked open data: the OpenCube toolkit E. Tambouris, E. Kalampokis, K. Tarabanis University of Macedonia and ITI-CERTH, Greece {tambouris, ekal, kat}@uom.gr NTTS 2015, Brussels, 10-12 March 2015 Problem definition


  1. ICT Tools for statistical linked open data: the OpenCube toolkit E. Tambouris, E. Kalampokis, K. Tarabanis University of Macedonia and ITI-CERTH, Greece {tambouris, ekal, kat}@uom.gr NTTS 2015, Brussels, 10-12 March 2015

  2. Problem definition  Open Statistical Data are very important for the EU  Users frequently want to blend & combine statistical data from multiple sources  But, these data usually resides in files and databases ( data silos ) that are hard to combine  Linked Data (LD) technology has the potential to enable combining and performing analytics on top of disparate and previously isolated statistical data  However, relevant tools are few, scattered and un-tested under real-life conditions Potential of using LD in statistical data analysis unexploited 12 March 2015 NTTS 2015, Brussels, 10-12 March 2015 2

  3. The OpenCube project  OpenCube is a 2-year project funded by the EU within FP7  The project aims to develop and test processes and tools for managing statistical linked open data.  The results will:  Facilitate data publishers to create linked data cubes from legacy formats  Empower data users to browse, visualise, link, expand and analyse data cubes.  Enable analysis not possible before ( merging data cubes at a Web scale) 12 March 2015 NTTS 2015, Brussels, 10-12 March 2015 3

  4. Linked Statistical Data Lifecycle  We propose a lifecycle for statistical LD  The lifecycle is divided into two phases: publish and reuse (or consume )  The lifecycle prescribes the steps that raw data cubes* should go through in order to create value.  OpenCube also develops tools to support the whole lifecycle of linked statistical data. * We assume statistical data is organized as data cubes , where each cell contains a measure described based on a number of dimensions . 12 March 2015 NTTS 2015, Brussels, 10-12 March 2015 4

  5. OpenCube Toolkit  Publishing components  Developed using Information Workbench open source as  TARQL extension underlying linked data  D2RQ /R2RML-QB extension management platform  JSON-stat  License scheme  Grafter  OpenCube components are  Consuming components provided under open source  OpenCube Browser licenses  OpenCube MapView  Check http://opencube-toolkit.eu  R Analysis Chart  But, commercial solutions are also  Linking components offered by consortium members 12 March 2015 NTTS 2015, Brussels, 10-12 March 2015 5

  6. Publishing Components 12 March 2015 NTTS 2015, Brussels, 10-12 March 2015 6

  7. Summarize observations Consume: OpenCube browser across a dimension (dimension reduction)  It enables the Change the exploration of an RDF language data cube by presenting a two- dimensional slice of the cube as a table .  The slice is created by setting a fixed values for each dimension that is not presented Change the fixed values in the table. Change the axes of the table 12 March 2015 NTTS 2015, Brussels, 10-12 March 2015 7

  8. Consume: OpenCube MapView  Visualization of RDF data cubes on a map.  It supports:  Markers  Bubble  Choropleth maps 12 March 2015 NTTS 2015, Brussels, 10-12 March 2015 8

  9. Consume: Integration with R  Visualisation of analysis results (charts & tables)  Reuse of analysis results: preserving R output as linked data 12 March 2015 NTTS 2015, Brussels, 10-12 March 2015 9

  10. Consume: Other Visualizations Visualization and Exploration Analytics and Reporting Stock chart 12 March 2015 NTTS 2015, Brussels, 10-12 March 2015 10

  11. Linking Statistical Data  Enables Performing analytics on top of combined data cubes  Steps: 1. Select a data cube 2. Discover cubes on the Web of Linked Data having compatible structure ; i.e. cubes with dimensions, measures etc. that can expand the initial cube 3. Create expanded views of the initial cube 4. Consume the new cube(s) 12 March 2015 NTTS 2015, Brussels, 10-12 March 2015 11

  12. Example: Start with an initial cube 12 March 2015 NTTS 2015, Brussels, 10-12 March 2015 12

  13. Example: Discover & Select compatible cubes 12 March 2015 NTTS 2015, Brussels, 10-12 March 2015 13

  14. Example: Browse an expanded view of the initial cube 12 March 2015 NTTS 2015, Brussels, 10-12 March 2015 14

  15. Conclusions  Open Statistical data are rapidly increasing due to Open Data policies  Linked Data technologies can provide web-scale linking and analysis of statistical data  OpenCube project develops processes and tools for statistical data management  These can be divided into:  Tools for producing linked open statistical data  Tools for linking ( expanding ) open statistical data  Tools for consuming linked open statistical data  Practical use of the tools is possible in the NTTS 2015 satellite workshop session 21B on Linked Statistics (today 18:00-20:00) 12 March 2015 NTTS 2015, Brussels, 10-12 March 2015 15

  16. Questions?  For more information  OpenCube consortium  http://opencube-project.eu  http://opencube-toolkit.eu 12 March 2015 NTTS 2015, Brussels, 10-12 March 2015 16

Recommend


More recommend