Measuring Hyperlink Distances Wikipedia case study Rodrigo R. Paim - PowerPoint PPT Presentation

Oct 25, 2023 •580 likes •811 views

Measuring Hyperlink Distances Wikipedia case study Rodrigo R. Paim Daniel R. Figueiredo Universidade Federal do Rio de Janeiro 2 nd Workshop of Brazilian Institute for Web Science Research Webpages and Hyperlinks Paim & Figueiredo - 2011

Measuring Hyperlink Distances Wikipedia case study Rodrigo R. Paim Daniel R. Figueiredo Universidade Federal do Rio de Janeiro 2 nd Workshop of Brazilian Institute for Web Science Research
Webpages and Hyperlinks Paim & Figueiredo - 2011
The Web Graph Webpages → Vertices Hyperlinks → Directed Edges Campus UFRJ Rio Brazil Paim & Figueiredo - 2011
Hyperlink Distance Webpages have some specific content Can’t get it from structure of the web New concept to analyze “connected webpages” Inversely proportional to contextual similarity d 2 Economic Physics Crisis d 1 < d 2 , d 3 d 1 Greece Europe Maths d 3 Paim & Figueiredo - 2011
Measuring Distances Multiple distance metrics Variation of Jaccard distance IDF-based Keywords play a crucial role Indicate context of a webpage Why to measure distances? Paim & Figueiredo - 2011
Navigating the Web Can one go from any webpage to another using local information only? Image from The Opte Project Website Paim & Figueiredo - 2011
Navigating the Web Car Safety Food Paim & Figueiredo - 2011
Navigating the Web Car Safety Restaurant Food Industry Agriculture Food Paim & Figueiredo - 2011
Navigating the Web Car Food Safety Processing Nestlé Restaurant Food Industry Manufacturing Agriculture Food Paim & Figueiredo - 2011
Navigating the Web Car Food Safety Processing Automotive Nestlé Industry Restaurant Food Automobile Industry Manufacturing Agriculture Food Paim & Figueiredo - 2011
Navigating the Web Car Food Safety Processing Automotive Nestlé Industry Restaurant Food Automobile Industry Manufacturing Agriculture Food Paim & Figueiredo - 2011
Problem Formulation Decentralized greedy algorithm From any u to any v Choose closest hyperlink to destination Local information only Can it reach destination? In few steps Using hyperlink distances Paim & Figueiredo - 2011
Case Study Wikipedia Two major sets D : Documents (articles) C : Categories Keywords of documents Wikipedia web graph Vertices (~ 3.6 M) Edges (~ 100 M) Paim & Figueiredo - 2011
Navigation Algorithms BFS Minimum distance with global knowledge Random Walk Next webpage chosen randomly Greedy Algorithm Only closest neighbor (dead ends) Modified Greedy Closest neighbor (not visited yet) Paim & Figueiredo - 2011
Results
Results 96.45%
Results 16.88%
Results Dead Ends 5.46%
Results 45.27%
Conclusion Greedy performs worse than Random Walk But Modified Greedy performs better Only for big distances However far from optimal (BFS) Categories are not a good “compass” Ongoing work: How to define a better greedy algorithm? Paim & Figueiredo - 2011
Thanks for your attention! Rodrigo R. Paim Daniel R. Figueiredo LAND – PESC/COPPE - UFRJ Measuring Hyperlink Distances: Wikipedia Case Study ACM WebSci'11 (Extended Abstract) Paim & Figueiredo - 2011

Recommend

HART HyperLINK Program APTA Conference May 7, 2018 2 HART HyperLINK Launched the first

HART HyperLINK Program APTA Conference May 7, 2018 2 HART HyperLINK Launched the first transit operated ride-share program in November 2016 Two phased approach Promoting first /last mile solutions to multimodal transit in

765 views • 21 slides

Dr Jeffrey Chow Research Consultant Civic Exchange Distances to public open spaces Distances to

Dr Jeffrey Chow Research Consultant Civic Exchange Distances to public open spaces Distances to public open spaces Distances to public open spaces Builtup areas within 400m Builtup areas within 400m of public open space of public open

127 views • 8 slides

Measuring Distances ASTR/PHYS 4080: Intro to Cosmology Week 6 ASTR/PHYS 4080: Introduction to

Measuring Distances ASTR/PHYS 4080: Intro to Cosmology Week 6 ASTR/PHYS 4080: Introduction to Cosmology Spring 2018: Week 06 1 How do you measure distances when youre too lazy to get off the couch? TV is 6 or 7 years old, new TVs are

544 views • 18 slides

A Sociolinguistic Analysis of Linguistically Sensitive Dialectal Word Pronunciation Distances

Segment distances Dutch dialect distances A Sociolinguistic Analysis of Linguistically Sensitive Dialectal Word Pronunciation Distances Martijn Wieling Martijn Wieling A Sociolinguistic Analysis of Linguistically Sensitive Dialectal Word

629 views • 58 slides

Phylogenetic trees II Estimating distances, estimating trees from distances Gerhard Jger

Phylogenetic trees II Estimating distances, estimating trees from distances Gerhard Jger Words, Bones, Genes, Tools February 28, 2018 Gerhard Jger Distance-based estimation WBGT 1 / 67 Background Background Gerhard Jger

814 views • 68 slides

Metric Distances 28 Great Circle Distances North Pole (90N lat) North Pole C Prime

Metric Distances 28 Great Circle Distances North Pole (90N lat) North Pole C Prime (Meridian) Meridian b a International Dateline (180 lon) Latitude ( y ) Longitude ( x ) (Parallel) ( lon , lat ) = ( x , y ) A ( x 1 , y 1 ) =

666 views • 7 slides

$Geodesic distances and intrinsic distances on some fractal sets Masanori Hino (Kyoto Univ.)$

Geodesic distances and intrinsic distances on some fractal sets Masanori Hino (Kyoto Univ.)

Geodesic distances and intrinsic distances on some fractal sets Masanori Hino (Kyoto Univ.) International Conference on Advances on Fractals and Related Topics Chinese University of Hong Kong, December 11, 2012 1/17 1. Introduction M : a

845 views • 40 slides

How to Switch Slide and make Hyperlink on Button 1. Make some buttons and use for active some

How to Switch Slide and make Hyperlink on Button 1. Make some buttons and use for active some events, switch slide or link to another place is a important function. So, we must be learn this function. First, we created 3 buttons. We can make

410 views • 3 slides

High Efficiency photovoltaic power plants: the III-V compound solar cells G. Gabetta Hyperlink

High Efficiency photovoltaic power plants: the III-V compound solar cells G. Gabetta Hyperlink Contents CESI: who are we? The photovoltaic conversion and the III-V compound semiconductors Solar Cell Manufacturing: MOCVD Solar

671 views • 65 slides

To Randomize or Not To Randomize: Space Optimal Summaries for Hyperlink Analysis Tam as

Fully Personalized PageRank Similarity Search To Randomize or Not To Randomize: Space Optimal Summaries for Hyperlink Analysis Tam as Sarl os, E otv os University and Computer and Automation Institute, Hungarian Academy of

574 views • 22 slides

1998: enter Link Analysis uses hyperlink structure to focus the relevant set combine

1998: enter Link Analysis uses hyperlink structure to focus the relevant set combine traditional IR score with popularity score Page and Brin 1998 Kleinberg Web Information Retrieval IR before the Web = traditional IR IR on the Web =

1.11k views • 78 slides

Making Math Textbooks and Materials with T EX + K ETpic + hyperlink Yoshifumi Maeda Masataka

TUG2013 conference Making Math Textbooks and Materials with T EX + K ETpic + hyperlink Yoshifumi Maeda Masataka Kaneko KAKENHI(24501075) Contents 1. K ETpic framework 2. Features of K ETpic 3. Generation of T EX commands 4.

615 views • 49 slides

Using Transportation Distances for Measuring Melodic Similarity Rainer Typke, Panos Giannopoulos,

Using Transportation Distances for Measuring Melodic Similarity Rainer Typke, Panos Giannopoulos, Remco C. Veltkamp, Frans Wiering, Ren e van Oostrum Utrecht University, Institute of Information and Computing Sciences Center for Geometry,

712 views • 36 slides

Measuring distance 1 What do we know about distances to our nearest neighbours in space? 2

Measuring distance 1 What do we know about distances to our nearest neighbours in space? 2 Our Solar System, image: NASA/JPL The night sky is full of interesting objects. Is it possible to tell which objects are closest to us? What

616 views • 33 slides

Measuring distances between medical entities. Step 1: DrugBank Alberto Olivares-Alarcos Iva

Measuring distances between medical entities. Step 1: DrugBank Alberto Olivares-Alarcos Iva Stankovic Humberto Gonzlez and Horacio Rodrguez Department of Computing Science, Universitat Politcnica de Catalunya, UPC Abstract We

618 views • 3 slides

Measuring Geometrical Distances to Cepheids with II Jeter Hall Fermilab Center for Particle

Measuring Geometrical Distances to Cepheids with II Jeter Hall Fermilab Center for Particle Astrophysics Workshop on Stellar Intensity January 29-30, 2009 Interferometry in Salt Lake City Cosmic Distance Distance Ladder Redshift Variable

252 views • 9 slides

Presentations user guide SmPC training presentations SmPC Advisory Group An agency of the

Presentations user guide SmPC training presentations SmPC Advisory Group An agency of the European Union How to navigate through presentations Follow hyperlinks indicated by hyperlink or (keyboard) for next slide or

230 views • 4 slides

Powerpoint Presentation About Multimedia Tools That Aid Classroom Instruction Usage of Multimedia

Powerpoint Presentation About Multimedia Tools That Aid Classroom Instruction Usage of Multimedia Visual Aids in the English Language Classroom. 1 results it can be resumed that using multimedia visuals as tools in the language. Get free tools

167 views • 3 slides

Jer Thorp Data artist Vancouver Originally trained as a geneticist. An adjunct faculty position

Jer Thorp Data artist Vancouver Originally trained as a geneticist. An adjunct faculty position at New York Universitys Tisch School of the Arts in the Interactive Telecommunica- tions Program. Data Artist in Residence at the New York

366 views • 7 slides

State Renewable Portfolio Standards and Energy Efficiency Resource Standards Laura Furrey, JD,

State Renewable Portfolio Standards and Energy Efficiency Resource Standards Laura Furrey, JD, PE American Council for an Energy-Efficient Economy June 2009 The American Council for an Energy-Efficient Economy (ACEEE ) Nonprofit 501(c)(3)

488 views • 15 slides

DEGREEWORKS Is BSUs official advising tool Lists all requirements that need to be

DEGREEWORKS Is BSUs official advising tool Lists all requirements that need to be completed in order to graduate, which includes: A minimum of 120 credits Completion of degree requirements (core & major requirements) 2.0

262 views • 11 slides

Presentation Guidelines Fonts Use no more than two to three fonts throughout your

Presentation Guidelines Fonts Use no more than two to three fonts throughout your presentation. Do not use ALL CAPS, except for titles. Keep points to six or less per slide. Keep words to six to eight per line. Format Be

432 views • 7 slides

County Funded February 2018 Service Changes January 8, 2018 Regular Board Meeting Mission MAX

County Funded February 2018 Service Changes January 8, 2018 Regular Board Meeting Mission MAX October 8, 2017 Service Change Implemented large scale network service change branded Mission MAX February 25, 2018 Follow-up service

218 views • 9 slides

Materials Testing Materials Testing 3 rd December 2004 FORMAT FORMAT 1. Introduction GH 2.

Materials Testing Materials Testing 3 rd December 2004 FORMAT FORMAT 1. Introduction GH 2. Business Model GH 3. Financial/ KPIs GH 4. Services/ Markets/ Competition a) Materials TDS b) Health Sciences DC c) Engineering &

826 views • 65 slides