The End-of-End- to-End: A Video Understanding Pentathlon CVPR - PowerPoint PPT Presentation

Jan 15, 2023 •240 likes •350 views

The End-of-End- to-End: A Video Understanding Pentathlon CVPR workshop, Monday 15 th June 2020 Goal: develop systems that can achieve higher-level comprehension of videos (not just objects, but complex actions, events and long-lasting

The End-of-End- to-End: A Video Understanding Pentathlon CVPR workshop, Monday 15 th June 2020
Goal: develop systems that can achieve “higher-level” comprehension of videos (not just objects, but complex actions, events and long-lasting narratives) Computer vision has made tremendous Big Picture progress at core tasks with CNNs trained end- Motivation to-end for classification, detection, segmentation etc. BUT….extending these methods for higher- level tasks is computationally expensive
One way forwards is to use frozen “experts” – representations that have been pretrained on large-scale datasets Enables focus on key questions for higher-level tasks while dramatically reducing compute. Why use - how to make best use of temporal information? pretrained - how to exploit multiple modalities? experts? - how to achieve robustness across different domains? Potential for broader participation outside industry
A video pentathlon challenge – exploring different ways to use pretrained features to solve the Workshop retrieval task objectives A forum for discussing ideas for video understanding
Schedule https://www.robots.ox.ac.uk/~vgg/challenges/video-pentathlon/#schedule
Schedule https://www.robots.ox.ac.uk/~vgg/challenges/video-pentathlon/#schedule
Schedule https://www.robots.ox.ac.uk/~vgg/challenges/video-pentathlon/#schedule
Your mic will be muted (except Workshop when asking questions) Virtual Protocol Use the chat to indicate that you would like to ask a question (we will also try to monitor YouTube)
We are recording this workshop to allow researchers in different timezones to access it later. Recording If you would prefer not to be visible, please turn off your video (and mute) If you want to ask a question, but would prefer not to be in the recording, please let us know afterwards (we will remove this part from the released footage)
Questions for the organisers 1. Use the chat 2. Email: albanie@robots.ox.ac.uk

Recommend

AIM 3 at Team eam RU RUC AI at Vid Video eo Pe Pentathlon Cha Challeng nge 2020 2020

AIM 3 at Team eam RU RUC AI at Vid Video eo Pe Pentathlon Cha Challeng nge 2020 2020 Shizhe Chen , Yida Zhao, Qin Jin Renmin University of China 1 Vi Video Pe Pentathlon Ch Challenge Task Text-to-Video Cross-modal

387 views • 14 slides

CVPR 2020 Video Pentathlon Challenge: Multi-modal Transformer for Video Retrieval Valentin

CVPR 2020 Video Pentathlon Challenge: Multi-modal Transformer for Video Retrieval Valentin Gabeur, Chen Sun, Karteek Alahari, Cordelia Schmid Video signal richness Video encoder Our cross-modal architecture Thank You

347 views • 5 slides

Towards generating stories about video Anna Rohrbach The End-of-End-to-End A Video

Towards generating stories about video Anna Rohrbach The End-of-End-to-End A Video Understanding Pentathlon, CVPR 2020 https://anna-rohrbach.net Lets look at a human generated video description A young singer with moppy dark brown hair

1.24k views • 65 slides

CISM MODERN PENTATHLON COMMITTEE CISM Modern Pentathlon Committee Composition of the CISM Modern

CISM MODERN PENTATHLON COMMITTEE CISM Modern Pentathlon Committee Composition of the CISM Modern Pentathlon Committee - Lt. Colonel Nilton Rolim Brazil President - Capt

517 views • 10 slides

Math Pentathlon Parent Meeting Mr. Paholak & Mr. Maldonado Meeting Agenda 1.) What is

Math Pentathlon Parent Meeting Mr. Paholak & Mr. Maldonado Meeting Agenda 1.) What is Math Pentathlon? 2.) Dates of Program & Location 3.) Arrival/Dismissal Procedure 4.) Tournament Information 5.) Coach Expectations 6.) Volunteer

243 views • 12 slides

Understanding Multimedia Systems Multimedia - Basics Lectures video as a medium video

Design and Evaluation of Understanding Multimedia Systems Multimedia - Basics Lectures video as a medium video technology Design issues Joemon Jose Advanced applications & tools Multimedia with Video Exercise

363 views • 8 slides

Video Understanding 5/ 29 /2020 Outline Background / Motivation / History Video Datasets

CS231N Section Video Understanding 5/ 29 /2020 Outline Background / Motivation / History Video Datasets Models Pre-deep learning CNN + RNN 3D convolution Two-stream What weve seen in class so far...

563 views • 53 slides

Video Understanding 6/ 1 /2018 Outline Background / Motivation / History Video Datasets

CS231N Section Video Understanding 6/ 1 /2018 Outline Background / Motivation / History Video Datasets Models Pre-deep learning CNN + RNN 3D convolution Two-stream What weve seen in class so far...

668 views • 52 slides

Video Skimming and Characterization through the Combination of Image and Language Understanding

IEEE International Workshop on Content-based Access of Image and Video Databases (ICCV98 - Bombay, India) Video Skimming and Characterization through the Combination of Image and Language Understanding Michael A. Smith Takeo Kanade Department

317 views • 10 slides

Overview/Questions Understanding the idea of a motion picture. What is digital

CS101 Lecture 15 Digital Video Concepts Aaron Stevens 20 February 2009 1 Overview/Questions Understanding the idea of a motion picture. What is digital video? How does YouTube send video over the internet? 2 1 Who

465 views • 10 slides

Digital Video Concepts John Magee 22 July 2013 1 Overview/Questions Understanding the idea

CS101 Lecture 20 Digital Video Concepts John Magee 22 July 2013 1 Overview/Questions Understanding the idea of a motion picture. What is digital video? How does YouTube send video over the internet? 2 Who invented moving

426 views • 21 slides

Towards web-scale video understanding Olga Russakovsky Serena Yeung Achal Dave (Stanford)

Towards web-scale video understanding Olga Russakovsky Serena Yeung Achal Dave (Stanford) (CMU) 400 hours of video are uploaded to YouTube every minute 70% of Internet traffic was videos in 2016, will be over 80% by 2020 1 http:// 2 White

821 views • 59 slides

Understanding the Impact of Video Quality on User Engagement Florin Dobrian Vyas Sekar Ion Stoica

Understanding the Impact of Video Quality on User Engagement Florin Dobrian Vyas Sekar Ion Stoica Hui Zhang Asad Awan Dilip Joseph Aditya Ganjam - Conviva Confidential - 2005: Beginning of Internet Video Era 100M streams first year Premium

304 views • 28 slides

Understanding the Impact of Video Quality on User Engagement Florin Dobrian Vyas Sekar Ion Stoica

536 views • 27 slides

Consumer Video Understanding A Benchmark Database + An Evaluation of Human & Machine

Consumer Video Understanding A Benchmark Database + An Evaluation of Human & Machine Performance Yu-Gang Jiang, Guangnan Ye, Shih-Fu Chang, Daniel Ellis, Alexander C. Loui Columbia University Kodak Research ACM ICMR 2011,

478 views • 23 slides

CISM SPORT COMMITTEE NAVAL PENTATHLON NAVAL PENTAHTLON President canditate LT CDR Ney Anderson

CISM SPORT COMMITTEE NAVAL PENTATHLON NAVAL PENTAHTLON President canditate LT CDR Ney Anderson G.dos Santos (BRASIL) CISM Sport Committee NAVAL PENTAHLON 5 events: 1. Obstackle Race (running & climbing) (305 m) 2. Life saving Swimming race

279 views • 4 slides

VIDEO UNDERSTANDING @ TWITTER C O U R T E S Y O F C O R T E X USER PROTECTION T W I T T E R

VIDEO UNDERSTANDING @ TWITTER C O U R T E S Y O F C O R T E X USER PROTECTION T W I T T E R C O R T E X T W I T T E R C O R T E X CONTENT UNDERSTANDING T W I T T E R C O R T E X T W I T T E R C O R T E X T W I T T E R C O R T E X

914 views • 64 slides

Learning Graph Representations for Video Understanding Xiaolong Wang Carnegie Mellon University

Learning Graph Representations for Video Understanding Xiaolong Wang Carnegie Mellon University Computer Vision Dog He et al. Mask R-CNN. ICCV 2017. Gler et al. DensePose: Dense Human Pose Estimation In The Wild. CVPR 2018. Deep Learning

941 views • 64 slides

for Zero-Example Video Search Dennis Koelma and Cees Snoek University of Amsterdam The

Query Understanding is Key for Zero-Example Video Search Dennis Koelma and Cees Snoek University of Amsterdam The Netherlands Pipeline Selected query terms Video Frames 2 / sec window average Closest terms Video Story cosine

400 views • 22 slides

Detecting Faces Marcello Pelillo University of Venice, Italy Image and Video Understanding a.y.

Detecting Faces Marcello Pelillo University of Venice, Italy Image and Video Understanding a.y. 2018/19 Face Detection Identify and locate human faces in images regardless of their: position scale pose (out-of-plane rotation)

936 views • 76 slides

Toward Understanding Natural Language Directions Video Motivating Example Data Corpus Data

Toward Understanding Natural Language Directions Video Motivating Example Data Corpus Data collection 15 visitors wrote 10 sets of directions each (150 total) Each visitor tries to follow someone elses directions to check quality

378 views • 22 slides

Graph-based Methods Marcello Pelillo University of Venice, Italy Image and Video Understanding

Graph-based Methods Marcello Pelillo University of Venice, Italy Image and Video Understanding a.y. 2018/19 Images as graphs j w ij i Node for every pixel Edge between every pair of pixels (or every pair of sufficiently close

1.38k views • 104 slides

Introductions Computer Vision Automatic understanding of images and video Instructor :

CS 376 Computer Vision : Lecture 1 What is computer vision? Computer Vision Jan 18, 2018 Done? Kristen Grauman, University of Texas at Austin Introductions Computer Vision Automatic understanding of images and video Instructor : 1.

253 views • 11 slides

PySpark of Warcraft understanding video games better through data Vincent D. Warmerdam @

PySpark of Warcraft understanding video games better through data Vincent D. Warmerdam @ GoDataDriven 1 Who is this guy Vincent D. Warmerdam data guy @ GoDataDriven from amsterdam avid python, R and js user. give open

921 views • 68 slides

The End-of-End- to-End: A Video Understanding Pentathlon CVPR - PowerPoint PPT Presentation

The End-of-End- to-End: A Video Understanding Pentathlon CVPR workshop, Monday 15 th June 2020 Goal: develop systems that can achieve higher-level comprehension of videos (not just objects, but complex actions, events and long-lasting

AIM 3 at Team eam RU RUC AI at Vid Video eo Pe Pentathlon Cha Challeng nge 2020 2020

CVPR 2020 Video Pentathlon Challenge: Multi-modal Transformer for Video Retrieval Valentin

Towards generating stories about video Anna Rohrbach The End-of-End-to-End A Video

CISM MODERN PENTATHLON COMMITTEE CISM Modern Pentathlon Committee Composition of the CISM Modern

Math Pentathlon Parent Meeting Mr. Paholak &amp; Mr. Maldonado Meeting Agenda 1.) What is

Understanding Multimedia Systems Multimedia - Basics Lectures video as a medium video

Video Understanding 5/ 29 /2020 Outline Background / Motivation / History Video Datasets

Video Understanding 6/ 1 /2018 Outline Background / Motivation / History Video Datasets

Video Skimming and Characterization through the Combination of Image and Language Understanding

Overview/Questions Understanding the idea of a motion picture. What is digital

Digital Video Concepts John Magee 22 July 2013 1 Overview/Questions Understanding the idea

Towards web-scale video understanding Olga Russakovsky Serena Yeung Achal Dave (Stanford)

Understanding the Impact of Video Quality on User Engagement Florin Dobrian Vyas Sekar Ion Stoica

Understanding the Impact of Video Quality on User Engagement Florin Dobrian Vyas Sekar Ion Stoica

Consumer Video Understanding A Benchmark Database + An Evaluation of Human &amp; Machine

CISM SPORT COMMITTEE NAVAL PENTATHLON NAVAL PENTAHTLON President canditate LT CDR Ney Anderson

VIDEO UNDERSTANDING @ TWITTER C O U R T E S Y O F C O R T E X USER PROTECTION T W I T T E R

Learning Graph Representations for Video Understanding Xiaolong Wang Carnegie Mellon University

for Zero-Example Video Search Dennis Koelma and Cees Snoek University of Amsterdam The

Detecting Faces Marcello Pelillo University of Venice, Italy Image and Video Understanding a.y.

Toward Understanding Natural Language Directions Video Motivating Example Data Corpus Data

Graph-based Methods Marcello Pelillo University of Venice, Italy Image and Video Understanding

Introductions Computer Vision Automatic understanding of images and video Instructor :

PySpark of Warcraft understanding video games better through data Vincent D. Warmerdam @

Math Pentathlon Parent Meeting Mr. Paholak & Mr. Maldonado Meeting Agenda 1.) What is

Consumer Video Understanding A Benchmark Database + An Evaluation of Human & Machine