stanford i2v a news video dataset for query by image
play

Stanford I2V: A News Video Dataset for Query-by-Image Experiments - PowerPoint PPT Presentation

Stanford I2V: A News Video Dataset for Query-by-Image Experiments Andr Araujo, J. Chaves, D. Chen, R. Angst, B. Girod Stanford University Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 1 Motivation Example:


  1. Stanford I2V: A News Video Dataset for Query-by-Image Experiments André Araujo, J. Chaves, D. Chen, R. Angst, B. Girod Stanford University Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 1

  2. Motivation Example: Brand Monitoring Retrieval System Logo or product NBC, 11/18/2014, 7:35:33 PM Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 2

  3. Motivation Example: Content Linking Retrieval System KDTV, 01/18/2013, 6:41:45PM Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 3

  4. Motivation Example: Lecture search Retrieval System Presentation slide CS246, lecture 12 December 2, 2013 Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 4

  5. Online demo http://videosearch.stanford.edu Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 5

  6. Outline - Related Work - Stanford I2V Dataset - Dataset Construction - Baseline Experiments Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 6

  7. Outline - Related Work - Stanford I2V Dataset - Dataset Construction - Baseline Experiments Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 7

  8. Related Work: Visual Search Query V2I: Augmented Reality V2V: Content Tracking Video TCD, Makar et al., 2012 Frame Mat. + ST, Douze et al., 2010 Location Rec., Takacs et al., 2010 TRECVID-CCD, Over et al., 2012 I2I: Traditional Visual Search I2V: Video Search by Image FV, Jégou et al., 2012 TRECVID-INS, Over et al., 2014 Image BoW, Sivic et al., 2006 SVT, Nistér et al., 2006 TAPS, Araujo et al., 2014 SIFT, Lowe, 2004 Database Images Videos Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 8

  9. Related Work: Existing I2V Datasets Dataset Size # Queries Sivic et al., Video-Google, 2006 2h 164 Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 9

  10. Related Work: Existing I2V Datasets Dataset Size # Queries Sivic et al., Video-Google, 2006 2h 164 Over et al., TRECVID-INS, 2014 464h 30 Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 10

  11. Related Work: Existing I2V Datasets Dataset Size # Queries Sivic et al., Video-Google, 2006 2h 164 Over et al., TRECVID-INS, 2014 464h 30 Araujo et al., CNN2h, 2014 2h 139 Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 11

  12. Related Work: Existing I2V Datasets Dataset Size # Queries Sivic et al., Video-Google, 2006 2h 164 Over et al., TRECVID-INS, 2014 464h 30 Araujo et al., CNN2h, 2014 2h 139 Araujo et al., Stanford I2V, 2015 3,801h 229 Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 12

  13. Outline - Related Work - Stanford I2V Dataset - Dataset Construction - Baseline Experiments Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 13

  14. Stanford I2V Dataset Query images Database videos (selected frames) Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 14

  15. Stanford I2V Dataset Full version Light version 3.8k hours 1k hours 84k video clips 23k video clips 229 query images 78 query images 14M keyframes@1fps 3.8M keyframes@1fps 2.7 minutes/clip 2.65 minutes/clip Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 15

  16. Evaluation Procedure 1 st stage: Retrieval of Clips 2 nd stage: Temporal Refinement Query … 1 2 System 3 … … Ranked retrieval measures: Unranked retrieval measure: - Average Precision (AP) - Temporal Jaccard Index - Precision at 1 (p@1) Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 16

  17. Query/Annotation Viewer Query image Clip 1 Clip 2 Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 17

  18. Outline - Related Work - Stanford I2V Dataset - Dataset Construction - Baseline Experiments Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 18

  19. Dataset Construction: Video Collection News Videos Recording Story Segmentation Website Video clips Daneshi et al., 2013 Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 19

  20. Dataset Construction: Query Set Collection - Collected images from news websites - Used the Internet Archive Wayback Machine - Collected 805 candidate images from dates between October 1 st 2012 and September 30 th 2013 - Types of images: - Iconic images (events in the news) - Magazine covers (Time, Economist) Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 20

  21. Dataset Construction: Annotation Feature-based matching + RANSAC Query image Select all videos Match query within 1 week of against each frame Query date Jan. 7 th , 2013 query date individually Annotation Select Global signature Approve of video matches matching to matches sequences manually entire database manually Accept query if there are approved matches Reject query if no approved matches Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 21

  22. Outline - Related Work - Stanford I2V Dataset - Dataset Construction - Baseline Experiments Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 22

  23. Example: Evaluation of Standard Technique - SIFT descriptors + SCFV global signatures [Lowe, 2004] [Duan et al., 2014] - Retrieval of Clips evaluation: - Compare query signature to video frames ’ signatures (@1fps) from entire database - Evaluate performance over top 100 ranked clips - Temporal Refinement evaluation: - Compare query signature to video frames ’ signatures (@1fps) from each correct matching video - Feature matching + RANSAC between query and top 50 frames (consider a match if at least 8 inliers are found) - Evaluate Jaccard index between matches and ground-truth segments Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 23

  24. Example: Evaluation of Standard Technique Retrieval of Clips: results Temporal Refinement results 50 Light version 44 ¡ Light ¡ Full ¡ Full version 45 42 ¡ mJac (%) mAP (%) 40 ¡ 40 mAP (%) 38 ¡ 36 ¡ 35 34 ¡ 30 32 ¡ 30 ¡ 25 128 ¡ 192 ¡ 256 ¡ 512 ¡ 0 10 20 30 40 50 60 mRetLatency (secs) Latency (secs) Number of Gaussians Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 24

  25. Summary - Dataset for video retrieval using query images - 3.8k hours of video and 229 queries – largest dataset yet - First dataset to allow true large-scale experiments in this area - Experiments using standard image retrieval technique were presented, serving as a baseline for future evaluations Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 25

  26. Thank you! Questions? Dataset webpage: http://blackhole1.stanford.edu/vidsearch/dataset/stanfordi2v.html Online demo: http://videosearch.stanford.edu André Araujo http://stanford.edu/~afaraujo afaraujo@stanford.edu Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 26

Recommend


More recommend