Words & Pictures Clustering and Bag of Words Many - PowerPoint PPT Presentation

Words ¡& ¡Pictures ¡ ¡ Clustering ¡and ¡Bag ¡of ¡Words ¡ Many ¡slides ¡adapted ¡from ¡Svetlana ¡Lazebnik, ¡Fei-‑Fei ¡Li, ¡Rob ¡Fergus, ¡and ¡Antonio ¡Torralba ¡

Document ¡Vectors ¡ • Represent ¡document ¡as ¡a ¡“bag ¡of ¡words” ¡

Bag-‑of-‑features ¡models ¡ Many ¡slides ¡adapted ¡from ¡Fei-‑Fei ¡Li, ¡Rob ¡Fergus, ¡and ¡Antonio ¡Torralba ¡

Origin: ¡Bag-‑of-‑words ¡models ¡ • Orderless ¡document ¡representaJon: ¡frequencies ¡ of ¡words ¡from ¡a ¡dicJonary ¡ ¡ Salton ¡& ¡McGill ¡(1983) ¡

Origin: ¡Bag-‑of-‑words ¡models ¡ • Orderless ¡document ¡representaJon: ¡frequencies ¡ of ¡words ¡from ¡a ¡dicJonary ¡ ¡ Salton ¡& ¡McGill ¡(1983) ¡ US ¡PresidenJal ¡Speeches ¡Tag ¡Cloud ¡ http://chir.ag/phernalia/preztags/

Bags ¡of ¡features ¡for ¡image ¡ classificaJon ¡ 1. Extract ¡features ¡

Bags ¡of ¡features ¡for ¡image ¡ classificaJon ¡ 1. Extract ¡features ¡ 2. Learn ¡“visual ¡vocabulary” ¡

Bags ¡of ¡features ¡for ¡image ¡ classificaJon ¡ 1. Extract ¡features ¡ 2. Learn ¡“visual ¡vocabulary” ¡ 3. QuanJze ¡features ¡using ¡visual ¡vocabulary ¡ ¡

Bags ¡of ¡features ¡for ¡image ¡ classificaJon ¡ 1. Extract ¡features ¡ 2. Learn ¡“visual ¡vocabulary” ¡ 3. QuanJze ¡features ¡using ¡visual ¡vocabulary ¡ ¡ 4. Represent ¡images ¡by ¡frequencies ¡of ¡ ¡ “visual ¡words” ¡ ¡

1. ¡Feature ¡extracJon ¡ • Regular ¡grid ¡ – Vogel ¡& ¡Schiele, ¡2003 ¡ – Fei-‑Fei ¡& ¡Perona, ¡2005 ¡

1. ¡Feature ¡extracJon ¡ • Regular ¡grid ¡ – Vogel ¡& ¡Schiele, ¡2003 ¡ – Fei-‑Fei ¡& ¡Perona, ¡2005 ¡ • Interest ¡point ¡detector ¡ – Csurka ¡et ¡al. ¡2004 ¡ – Fei-‑Fei ¡& ¡Perona, ¡2005 ¡ – Sivic ¡et ¡al. ¡2005 ¡

1. ¡Feature ¡extracJon ¡ • Regular ¡grid ¡ – Vogel ¡& ¡Schiele, ¡2003 ¡ – Fei-‑Fei ¡& ¡Perona, ¡2005 ¡ • Interest ¡point ¡detector ¡ – Csurka ¡et ¡al. ¡2004 ¡ – Fei-‑Fei ¡& ¡Perona, ¡2005 ¡ – Sivic ¡et ¡al. ¡2005 ¡ • Other ¡methods ¡ – Random ¡sampling ¡(Vidal-‑Naquet ¡& ¡Ullman, ¡2002) ¡ – SegmentaJon-‑based ¡patches ¡(Barnard ¡et ¡al. ¡2003) ¡

Compute ¡SIFT ¡ Normalize ¡patch ¡ descriptor ¡ ¡ ¡ ¡ ¡ ¡ ¡[Lowe’99] ¡ Detect ¡patches ¡ [Mikojaczyk ¡and ¡Schmid ¡’02] ¡ [Mata, ¡Chum, ¡Urban ¡& ¡Pajdla, ¡’02] ¡ ¡ [Sivic ¡& ¡Zisserman, ¡’03] ¡ Slide ¡credit: ¡Josef ¡Sivic ¡

… ¡

… ¡ Clustering ¡ Slide ¡credit: ¡Josef ¡Sivic ¡

Visual ¡vocabulary ¡ … ¡ Clustering ¡ Slide ¡credit: ¡Josef ¡Sivic ¡

Clustering ¡ – The ¡assignment ¡of ¡objects ¡into ¡groups ¡(called ¡clusters) ¡ so ¡that ¡objects ¡from ¡the ¡same ¡cluster ¡are ¡more ¡similar ¡ to ¡each ¡other ¡than ¡objects ¡from ¡different ¡clusters. ¡ ¡ – Ojen ¡similarity ¡is ¡assessed ¡according ¡to ¡a ¡distance ¡ measure. ¡ ¡ – Clustering ¡is ¡a ¡common ¡technique ¡for ¡staJsJcal ¡data ¡ analysis, ¡which ¡is ¡used ¡in ¡many ¡fields, ¡including ¡ machine ¡learning, ¡data ¡mining, ¡pakern ¡recogniJon, ¡ image ¡analysis ¡and ¡bioinformaJcs. ¡

Any ¡of ¡the ¡similarity ¡metrics ¡we ¡talked ¡about ¡before ¡(SSD, ¡angle ¡between ¡ vectors) ¡

Feature ¡Clustering ¡ Clustering ¡is ¡the ¡process ¡of ¡grouping ¡a ¡set ¡of ¡ features ¡into ¡clusters ¡of ¡similar ¡features. ¡ Features ¡within ¡a ¡cluster ¡should ¡be ¡similar. ¡ Features ¡from ¡different ¡clusters ¡should ¡be ¡ dissimilar. ¡

source: ¡Dan ¡Klein ¡ ¡

K-‑means ¡clustering ¡ • Want ¡to ¡minimize ¡sum ¡of ¡ squared ¡Euclidean ¡distances ¡ between ¡points ¡ x i ¡and ¡their ¡ nearest ¡cluster ¡centers ¡ m k ¡ source: ¡Svetlana ¡Lazebnik ¡ ¡

Source: ¡Hinrich ¡Schutze ¡

Hierarchical ¡clustering ¡strategies ¡ • AgglomeraJve ¡clustering ¡ • Start ¡with ¡each ¡point ¡in ¡a ¡separate ¡cluster ¡ • At ¡each ¡iteraJon, ¡merge ¡two ¡of ¡the ¡“closest” ¡clusters ¡ • Divisive ¡clustering ¡ • Start ¡with ¡all ¡points ¡grouped ¡into ¡a ¡single ¡cluster ¡ • At ¡each ¡iteraJon, ¡split ¡the ¡“largest” ¡cluster ¡ source: ¡Svetlana ¡Lazebnik ¡ ¡

Divisive ¡Clustering ¡ • Top-‑down ¡(instead ¡of ¡bokom-‑up ¡as ¡in ¡ AgglomeraJve ¡Clustering) ¡ • Start ¡with ¡all ¡docs ¡in ¡one ¡big ¡cluster ¡ • Then ¡recursively ¡split ¡clusters ¡ • Eventually ¡each ¡node ¡forms ¡a ¡cluster ¡on ¡its ¡ own. ¡ Source: ¡Hinrich ¡Schutze ¡

Flat ¡or ¡hierarchical ¡clustering? ¡ • For ¡high ¡efficiency, ¡use ¡flat ¡clustering ¡(e.g. ¡k ¡ means) ¡ • For ¡determinisJc ¡results: ¡hierarchical ¡ clustering ¡ • When ¡a ¡hierarchical ¡structure ¡is ¡desired: ¡ hierarchical ¡algorithm ¡ • Hierarchical ¡clustering ¡can ¡also ¡be ¡applied ¡if ¡K ¡ cannot ¡be ¡predetermined ¡(can ¡start ¡without ¡ knowing ¡K) ¡ Source: ¡Hinrich ¡Schutze ¡

… ¡ Clustering ¡ Slide ¡credit: ¡Josef ¡Sivic ¡

Visual ¡vocabulary ¡ … ¡ Clustering ¡ Slide ¡credit: ¡Josef ¡Sivic ¡

From ¡clustering ¡to ¡vector ¡quanJzaJon ¡ • Clustering ¡is ¡a ¡common ¡method ¡for ¡learning ¡a ¡visual ¡ vocabulary ¡or ¡codebook ¡ – Unsupervised ¡learning ¡process ¡ – Each ¡cluster ¡center ¡produced ¡by ¡k-‑means ¡becomes ¡a ¡ codevector ¡ – Codebook ¡can ¡be ¡learned ¡on ¡separate ¡training ¡set ¡ – Provided ¡the ¡training ¡set ¡is ¡sufficiently ¡representaJve, ¡ the ¡codebook ¡will ¡be ¡“universal” ¡ • The ¡codebook ¡is ¡used ¡for ¡quanJzing ¡features ¡ – A ¡ vector ¡quan0zer ¡takes ¡a ¡feature ¡vector ¡and ¡maps ¡it ¡ to ¡the ¡index ¡of ¡the ¡nearest ¡codevector ¡in ¡a ¡codebook ¡ – Codebook ¡= ¡visual ¡vocabulary ¡ – Codevector ¡= ¡visual ¡word ¡

Fei-‑Fei ¡et ¡al. ¡2005 ¡

Sivic ¡et ¡al. ¡2005 ¡

Visual ¡vocabularies: ¡Issues ¡ • How ¡to ¡choose ¡vocabulary ¡size? ¡ – Too ¡small: ¡visual ¡words ¡not ¡ representaJve ¡of ¡all ¡patches ¡ – Too ¡large: ¡quanJzaJon ¡arJfacts, ¡ overfisng ¡ • GeneraJve ¡or ¡discriminaJve ¡ learning? ¡ • ComputaJonal ¡efficiency ¡ – Vocabulary ¡trees ¡ ¡ (Nister ¡& ¡Stewenius, ¡2006) ¡

frequency ¡ ….. ¡ codewords ¡

Words & Pictures Clustering and Bag of Words Many - PowerPoint PPT Presentation

Words & Pictures Clustering and Bag of Words Many slides adapted from Svetlana Lazebnik, Fei-Fei Li, Rob Fergus, and Antonio Torralba Document

CSE 595 Words and Pictures Tamara L. Berg SUNY Stony Brook Class Info CSE 595: Words &

Words & Pictures Tamara Berg NLP Overview Many slides

WORDS AND PICTURES: THE KEY TO MANY DOORS We Western Australian Department of Co Communities

Words & Pictures Tamara Berg Features Announcements HW1

Whos in the Picture? Tamara L. Berg Cse595 Words & Pictures Face Recognition Datasets

Google Slides for Elementary Students Ideas for Google Slides (ELA) Vowel sorts Creating

The sound /u/ is usually spelt with the letter u . Try writing the words to go with these

I will be able to use digital media in presentations to enhance understanding and to add interest

Let me send relevant pictures to my friends while we chat. Select a picture from a

Your universe in pictures Imago is an audiovisual communications agency offering

Motion Created by: You and Me Friends Well Be Name _____________________ Objects that Move

Pictures: all pictures: Hajo Seng except: p. 3: Wikimedia, Mirror Phase, Lacan p. 5: Mind

Diseases in pictures By Prof. Pushpa Raj Sharma These pictures are the personal collection of the

How are you learning to Live ...and Love like Jesus?! Send In Your Samaritan Pictures!!

Synonyms Antonyms Are words Are words that mean the that mean the same opposite

MORPHOLOGY A Study of the internal structure of words and the relationships among words

I u eff I u PRE I u u C C A B A B C * A : C is reachable from A Mahsa

Pictures: all pictures: Hajo Seng except: S. 8: Wikimedia: Mirror Phase, Lacan S. 15:

Simplicity in Practice https://xkcd.com/1349/ Words, words, words. Hamlet, Act 2 Scene

Words, Words, Words AND WHY THEY MATTER IN ADVERTISING AND MARKETING Steve Kaplan Becky

Disclaimer This report has been prepared by PT MD Pictures Tbk independently and is circulated for

Disclaimer This report has been prepared by PT MD Pictures Tbk independently and is circulated for

Using JPEG to Compress Still Pictures Tyler Genter December 17, 2010 Tyler Genter Using JPEG to

Proverbs Words: The Power of Life and Death Words: The Power of 3. Words: They Can Be

Words & Pictures Clustering and Bag of Words Many - PowerPoint PPT Presentation

Words & Pictures Clustering and Bag of Words Many slides adapted from Svetlana Lazebnik, Fei-Fei Li, Rob Fergus, and Antonio Torralba Document

CSE 595 Words and Pictures Tamara L. Berg SUNY Stony Brook Class Info CSE 595: Words &amp;

Words &amp; Pictures Tamara Berg NLP Overview Many slides

WORDS AND PICTURES: THE KEY TO MANY DOORS We Western Australian Department of Co Communities

Words &amp; Pictures Tamara Berg Features Announcements HW1

Whos in the Picture? Tamara L. Berg Cse595 Words &amp; Pictures Face Recognition Datasets

Google Slides for Elementary Students Ideas for Google Slides (ELA) Vowel sorts Creating

The sound /u/ is usually spelt with the letter u . Try writing the words to go with these

I will be able to use digital media in presentations to enhance understanding and to add interest

Let me send relevant pictures to my friends while we chat. Select a picture from a

Your universe in pictures Imago is an audiovisual communications agency offering

Motion Created by: You and Me Friends Well Be Name _____________________ Objects that Move

Pictures: all pictures: Hajo Seng except: p. 3: Wikimedia, Mirror Phase, Lacan p. 5: Mind

Diseases in pictures By Prof. Pushpa Raj Sharma These pictures are the personal collection of the

How are you learning to Live ...and Love like Jesus?! Send In Your Samaritan Pictures!!

Synonyms Antonyms Are words Are words that mean the that mean the same opposite

MORPHOLOGY A Study of the internal structure of words and the relationships among words

I u eff I u PRE I u u C C A B A B C * A : C is reachable from A Mahsa

Pictures: all pictures: Hajo Seng except: S. 8: Wikimedia: Mirror Phase, Lacan S. 15:

Simplicity in Practice https://xkcd.com/1349/ Words, words, words. Hamlet, Act 2 Scene

Words, Words, Words AND WHY THEY MATTER IN ADVERTISING AND MARKETING Steve Kaplan Becky

Disclaimer This report has been prepared by PT MD Pictures Tbk independently and is circulated for

Disclaimer This report has been prepared by PT MD Pictures Tbk independently and is circulated for

Using JPEG to Compress Still Pictures Tyler Genter December 17, 2010 Tyler Genter Using JPEG to

Proverbs Words: The Power of Life and Death Words: The Power of 3. Words: They Can Be

CSE 595 Words and Pictures Tamara L. Berg SUNY Stony Brook Class Info CSE 595: Words &

Words & Pictures Tamara Berg NLP Overview Many slides

Words & Pictures Tamara Berg Features Announcements HW1

Whos in the Picture? Tamara L. Berg Cse595 Words & Pictures Face Recognition Datasets