interesting paths shortest paths interesting paths
play

"Interesting" Paths = Shortest Paths? - PowerPoint PPT Presentation

WiSP : Weighted Shortest Paths for RDF graphs Gonzalo Tartari, Aidan Hogan DCC, Universidad de Chile "Interesting" Paths = Shortest Paths? "Interesting" Paths Shortest Paths! (Many of the) Existing Approaches Enumerate


  1. WiSP : Weighted Shortest Paths for RDF graphs Gonzalo Tartari, Aidan Hogan DCC, Universidad de Chile

  2. "Interesting" Paths = Shortest Paths?

  3. "Interesting" Paths ≠ Shortest Paths!

  4. (Many of the) Existing Approaches Enumerate Score Order/Filter Output Paths Paths Paths

  5. (Many of the) Existing Approaches Enumerate Enumerate Score Order/Filter Output Paths Paths Paths Paths

  6. (Many of the) Existing Approaches Enumerate Score Order/Filter Output Paths Paths Paths

  7. Our Approach: Weight Graphs

  8. Wei eigh ghti ting ng gr graphs hs: No Node des

  9. Node Weights: Length (Baseline) ...

  10. Node Weights: Degree ...

  11. Node Weights: PageRank ...

  12. Aside: PageRank / directed graph used ...

  13. Wei eigh ghti ting ng gr graphs hs: Ed Edge ges

  14. Weighting with only nodes

  15. Weighting with only nodes

  16. Edge Weights: Frequency

  17. Wei eigh ghti ting ng gr graphs hs: No Node des + E s + Edg dges es

  18. Node + Edge Weights: Degree + Frequency ...

  19. Node + Edge Weights: PageRank + Frequency ...

  20. Node + Edge Weights: PageRank + Frequency ...

  21. Node + Edge Weights: [0,1] Normalisation ...

  22. Hy Hybrid d No Node de Wei eigh ghts ts

  23. Node Weights: PageRank Visiting one high-centrality node = Visiting thousands of low-centrality nodes ...

  24. Hybrid Node Weights: PageRank + Length ...

  25. Imp mplem emen entat tation on

  26. Weighted Shortest-Path Implementation • Dijsktra's algorithm: – Worst case: Image source: https://github.com/aakash1104/Graph-Algorithms

  27. Ex Exper erimen ments ts

  28. Questions • Performance: – How are the runtimes? – How is the scalability? • Weighting schemes: – How similar are paths for different weightings? – Does weighting help find interesting paths? – Which weighting finds the most interesting paths?

  29. Dataset: Wikidata • Truthy dump: 2017-06-07 – 25 million nodes ( -IRIs only) – 90 million edges

  30. Dataset: Wikidata Slices

  31. Machine • 2 x Intel Xeon Quad Core @1.9GHz • 32 GB of RAM

  32. Weighting Schemes • Node – Degree ( ) – PageRank ( ) – Length ( ) • Node + Edge – Degree + Edge Frequency ( ) – PageRank + Edge Frequency ( ) • Hybrid Node + Edge – Degree + Length + Edge Frequency ( ) – PageRank + Length + Edge Frequency ( )

  33. Pe Perfo forma manc nce

  34. Queries (Node pairs) • Queries: 100 node pairs randomly sampled – From smallest slice ( code < ) – From each slice independently • Task: Return one (best) path

  35. Performance Results (Full Dataset)

  36. Performance Results ( | Various Scales)

  37. Comp mpariso ison n of w f wei eigh ghti ting ng sc sche heme mes

  38. Comparison of path length (full dataset)

  39. How many pairs give the same path? (full dataset)

  40. Us User er Stu tudy dy

  41. Queries: Same type

  42. Queries: Different types

  43. User study • 10 students • 1.6 M dataset • Shown all paths for one query together • Scores: 1 (very poor) - 7 (very good) • 79 complete evaluations – 4 evaluations per query (node pair) – 553 scores

  44. Lowest-rated path mean score 1.25 ( {1,1,1,2} )

  45. Highest-rated path mean score 6.0 ( {5,7} )

  46. Inter-rater agreement • Kendall's τ correlation (ordinal scales) – τ = 0.201 – Slight, positive agreement • Two sets of results – All • τ = 0.201, 20 queries, 79 evaluations – Concordant • Queries with positive τ correlation only • τ = 0.552, 8 queries, 27 evaluations

  47. User study: Comparison of weightings

  48. http://wisp.dcc.uchile.cl Dem emo

  49. WiSP Demo ?

  50. Conc nclus lusion ons

  51. Conclusions • Performance: – How are the runtimes? • A few seconds (1.6 m) to a few minutes (full dataset) – How is the scalability? • Linear (roughly) • Weighting schemes: – How similar are paths for different weightings? | similar; others not so much • – Does weighting help find interesting paths? • Yes! – Which weighting finds the most interesting paths? • No clear winner ( best in most cases)

  52. Future work • Top- k queries • Explore more weightings • Normalisation / combinations • Performance? (Parallelism? Approximation?) • ¡¡¡Evaluation!!!

Recommend


More recommend