on the navigability of social tagging systems
play

On the Navigability of Social Tagging Systems Christoph Trattner - PowerPoint PPT Presentation

Graz University of Technology On the Navigability of Social Tagging Systems Christoph Trattner Knowledge Management Institute and Institute for Information Systems and Computer Media Graz University of Technology, Austria e-mail:


  1. Graz University of Technology On the Navigability of Social Tagging Systems Christoph Trattner Knowledge Management Institute and Institute for Information Systems and Computer Media Graz University of Technology, Austria e-mail: ctrattner@iicm.edu web: http://www.austria-lexikon.at/af/User/Trattner%20Christoph In collaboration with: D.Helic, M.Strohmaier, K. Andrews, Ch. Körner Christoph Trattner 2012 1

  2. Graz University of Technology What is a tagging system and what are tags? What is a tagging system? A system that provides the user the possibility to apply tags to resources What are tags? - lightweight keywords (free form vocabulary) - generated by users - for users Christoph Trattner 2012 2

  3. Graz University of Technology Popular examples of tagging systems are… Christoph Trattner 2012 3

  4. Graz University of Technology Tags Christoph Trattner 2012 4

  5. Graz University of Technology Tags Christoph Trattner 2012 5

  6. Graz University of Technology Tags Christoph Trattner 2012 6

  7. Graz University of Technology Why system designers like tags? - Tags add additional meta data to resources for which typically just sparse meta data information exists (such as pictures, movies, etc.) - Trough tags system designers are able to provide the user with simple navigational tools that improve the systems information retrieval properties - Tags are cheap!!! Christoph Trattner 2012 7

  8. Graz University of Technology Why users like tags? - Trough tags users are able to categorize or describe resources - Can find information faster - through personal tags - Can find related content faster - trough related tags Christoph Trattner 2012 8

  9. Graz University of Technology Navigation with Tags Typically tagging systems provide the user the following forms of information retrieval interfaces to navigate content of a tagging system Tag clouds – widely used 1. 2. Tag hierarchies new – hardly any implementations yet Christoph Trattner 2012 Gupta et al. 2010 9

  10. Graz University of Technology How does tag (cloud) based navigation look like? Christoph Trattner 2012 10

  11. Graz University of Technology Questions??? Are Tag Clouds useful for navigation? Christoph Trattner 2012 11

  12. Graz University of Technology Modelling a tag dataset as a graph (1/2) - A tagging dataset is typically modeled as a tripartite hypergraph - V = R U U U T - An annotation is a hyperedge (r, t, u) - A tripartite hypergraph can be mapped onto three bipartite graphs connecting users and resources, users and tags, and tags and resources. Christoph Trattner 2012 12

  13. Graz University of Technology Defining Navigability A network is navigable iff: There is a short path between all or almost all pairs of nodes in the network. Formally: 1. There exists a giant component 2. The effective diameter is low (bounded by log n) J. Kleinberg. The small-world phenomenon: An algorithmic perspective. Proc. 32nd ACM Symposium on Theory of Computing, 2000. Also appears as Cornell Computer Science Technical Report 99-1776 (October 1999) Christoph Trattner 2012 13

  14. Graz University of Technology Navigability: Examples Example 1: Not navigable : No giant component Example 2: Not navigable : giant component, BUT eff.diam: 7 > log 2 (8) Christoph Trattner 2012 14

  15. Graz University of Technology Navigability: Examples Example 3: Navigable : Giant component AND eff.diam: 2 < log 2 (10) Is this efficiently navigable? There are short paths between all nodes, but can an agent or algorithm find them with local knowledge only ? Christoph Trattner 2012 15

  16. Graz University of Technology Efficiently navigable A network is efficiently navigable iff: If there is an algorithm that can find a short path with only local knowledge, and the delivery time of the algorithm is bounded polynomially by log k (n). Example 4: B A C Efficiently navigable, if the algorithm knows it needs to go through A  B  C J. Kleinberg. The small-world phenomenon: An algorithmic perspective. Proc. 32nd ACM Symposium on Theory of Computing, 2000. Also appears as Cornell Computer Science Technical Report 99-1776 (October 1999) Christoph Trattner 2012 16

  17. Graz University of Technology Navigability of Social Tagging Systems (1/2) In general tags form networks which are navigable from a network-theoretic perspective Christoph Trattner 2012 17

  18. Graz University of Technology Navigability of Social Tagging Systems (2/2) . „Hub“ tags Tagging networks are navigable power-law networks. For power law networks, efficient sub-linear decentralised navigation algorithms exist. Christoph Trattner 2012 18

  19. Graz University of Technology But how about User Interface constraints? Tag Cloud Size n topN resources (topN most common algorithm) Pagination of resources / tag k resources shown / page (reverse chronological ordering) Christoph Trattner 2012 19

  20. Graz University of Technology How UI constraints effect Navigability Tag Cloud Size . Pagination Limiting the tag cloud size n to practically feasible sizes (e.g. 5, 10, or more) does not influence navigability (this is not very surprising). BUT : Limiting the out-degree of high frequency tags k (e.g. through pagination with resources sorted in reverse-chronological order) leaves the network vulnerable to fragmentation. This destroys navigability of prevalent approaches to tag clouds. Christoph Trattner 2012 20

  21. Graz University of Technology Questions??? How can we recover the navigability of social tagging systems? Answer: Through resource specific resource list construction! Christoph Trattner 2012 21

  22. Graz University of Technology What is a resource specific resource list ? • A resource specific resource list is a resource list that is not only specific to a particular tag but also to a particular resource in the tagging system • Typically resource lists are calculated as follows Res(t) = {r i (t),…, r n (t)} • Resource specific resource lists are calculated as Res(t,r) = {r i (t,r ),…, r n (t,r)} Christoph Trattner 2012 22

  23. Graz University of Technology Approach: Random Ordering -Instead of reverse-chronological ordering of resources, we apply a random ordering. - On each click on a particular tag a different resource list is generated - Problem: network is not efficiently navigable Better algorithms can easily be envisioned. Christoph Trattner 2012 23

  24. Graz University of Technology Approach: Hierarchical Ordering • Instead of random ordering, we use hierarchical background knowledge for ranking paginated resources [Kleinberg 2001]. • Kleinberg showed that if the nodes of a network can be organized into a hierarchy, then such a hierarchy provides a probability distribution for connecting the nodes in the network. • For such a network a hierarchical decentralized searcher exists that is able to navigate the network in log(n) => the network is efficiently navigable J. M. Kleinberg, “Small - world phenomena and the dynamics of information,” in Advances in Neural Information Processing Systems ( NIPS), 14. MIT Press, 2001, p. 2001. Christoph Trattner 2012 24

  25. Graz University of Technology Approach: Hierarchical Ordering J. M. Kleinberg, “Small - world phenomena and the dynamics of information,” in Advances in Neural Information Processing Systems ( NIPS), 14. MIT Press, 2001, p. 2001. Christoph Trattner 2012 25

  26. Graz University of Technology Problem: Semantic Penalty • Hierarchy was more or less randomly constructed • Does not take semantic similarity between resources into account • Hence, two new approaches were developed • First idea, constructing efficiently navigable tag clouds from structured web content [Trattner 2011] • Second idea, develop an algorithm that is able to construct semantically sound resource hierarchies from tagging data [Trattner 2011a] C. Trattner , D. Helic, M. Strohmaier , “On the Construction of Efficiently Navigable Tag Clouds Using Knowledge from Structured Web Content,” in JUCS, Volume 17, Issue 4, 565-582, 2011. C. Trattner , “Improving the Navigability of Tagging Systems with Hierarchically Constructed Resource Lists and Tag Trails”, in CIT , 2011. Christoph Trattner 2012 26

  27. Graz University of Technology On the construction of efficiently navigable tag clouds from structured web content • Content on the Web not always flat • There are websites that provide a hierarchical structure • Example: Austria-Forum Christoph Trattner 2012 27

  28. Graz University of Technology Austria-Forum - Wiki-based Online encyclopedia system - provides over 200,000 information items about Austria. - differently to Wikipedia, articles in Austria-Forum are published, edited, checked and certified by people who are accepted as experts in particular field - articles are organized hierarchically into categories - categories are addressable via AEIOU Community Wissenssammlungen structured URLs (cf. Open Directory DMOZ) Christoph Trattner 2012 28

  29. Graz University of Technology Austria-Forum Resource Tags Christoph Trattner 2012 29

  30. Graz University of Technology Approach (1/2) 1. Hierarchical Tag Cloud Construction Christoph Trattner 2012 30

  31. Graz University of Technology Approach (2/2) 2. Hierarchical Resource List Construction Christoph Trattner 2012 31

Recommend


More recommend