AIM 3 at Team eam RU RUC AI at Vid Video eo Pe Pentathlon Cha - PowerPoint PPT Presentation

AI·M 3 at Team eam RU RUC AI at Vid Video eo Pe Pentathlon Cha Challeng nge 2020 2020 Shizhe Chen , Yida Zhao, Qin Jin Renmin University of China 1

Vi Video Pe Pentathlon Ch Challenge • Task • Text-to-Video Cross-modal Retrieval • Using provided multimodal features • Evaluation • a pentathlon of five video-text benchmarks • MSRVTT, MSVD, DiDeMo, ActivityNet (ANet), YouCook2 (YC2) • Metric • geometric mean of Recall@K (K={1, 5, 10}) 2

Ou Our Con Contri ribution ons • Hierarchical Video-Text Matching • Hierarchical graph reasoning model • Enhanced Inference Methods • Query expansion • Hubness mitigation • Knowledge Transfer from Additional Datasets • Multi-task training 3

Hier Hierar archic hical al Vi Video-Te Text Ma Matching • Simple embeddings are insufficient to represent complicated video and text details • Hierarchical Graph Reasoning Model • multi-level cross-modal matching Global • Event • Actions • Entities Local • Hierarchical textual encoding • Hierarchical video encoding Chen, Shizhe, et al. "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning." CVPR, 2020. 5

Hier Hierar archic hical al Vi Video-Te Text Ma Matching • Experimental results • HGR model achieves the best performance on all datasets • Especially on DiDeMo and Anet whose description lengths are long Absolute Gains + 1.25 + 0.77 + 4.18 + 2.98 + 1.84 Average 9 7 33 54 9 Sentence Length 6

Enha Enhanc nced In Infer erenc ence Me Method ods • Query Expansion • Reformulate a given query and ensemble results from all expanded queries • Use multiple query texts for a video in MSRVTT and MSVD datasets • Experimental results • improves retrieval performance with groundtruth expanded queries • Future work: other techniques such as automatic paraphrasing 8

Enha Enhanc nced In Infer erenc ence Me Method ods • Hubness Mitigation • some points have high probabilities to be nearest neighbors of many other points • Inverted Softmax: • Experimental results • improves retrieval performance with groundtruth expanded queries • Future work: mitigate hubness problem during training Smith, Samuel L., et al. “Offline bilingual word vectors, orthogonal transformations and the inverted softmax.” ICLR, 2017. 9

Ou Our Con Contri ribution ons • Hierarchical Video-Text Matching • Hierarchical graph reasoning model • Enhanced Inference Methods • Query expansion • Hubness mitigation • Knowledge Transfer from Additional Datasets • Multi-task balanced training 10

Kn Knowledge Tr Transfer • Training with all datasets does not perform well • Different dataset scales and cross-domain discrepancies MSRVTT MSVD DiDeMo Anet YC2 # trn pairs 117,220 43,892 7,552 8,007 7,745 • Cross-dataset performance 11

Kn Knowledge Tr Transfer • Multi-task balanced training • Combine target dataset and MSRVTT in training • Balance the training examples from different datasets • Experimental results • beneficial to employ additional datasets • Future work: more effective transfer learning approaches 12

Testing Su Te Submi mission ons • Pipeline HGR model Average Query Hubness with multi- Ensembling Expansion mitigation task balanced (3-5 models) (optional) inference training • Experimental results • Second place in the challenge 13

Ta Take Ho Home Me Message • Multi-level matching model (HGR) is effective than global/local matching models for text-video retrieval • Hubness problem needs to be addressed in training and inference • Knowledge transferring is promising Contact email: cszhe1@ruc.edu.cn 14

AIM 3 at Team eam RU RUC AI at Vid Video eo Pe Pentathlon Cha - PowerPoint PPT Presentation

AIM 3 at Team eam RU RUC AI at Vid Video eo Pe Pentathlon Cha Challeng nge 2020 2020 Shizhe Chen , Yida Zhao, Qin Jin Renmin University of China 1 Vi Video Pe Pentathlon Ch Challenge Task Text-to-Video Cross-modal

www.dat.ruc.dk Plan for del 1 og del 2 Del 1 (i dag) CSCW Opponentoplg: Analyse af

WASHINGTON STATE ROAD USAGE CHARGE Steering Committee Report and Final WA RUC Assessment

Infor EAM v11 Showcase Barry Diedericks 24 March 2016 Agenda Introductions Infor Enterprise

Understanding the RUC Survey Instrument Roseanne M. Fischoff, MPP Senior Policy Analyst II

Discussion Examples for Sequential and Combined IFM -RUC Scott Harvey Member California ISO

ADA and FMLA D a vid S. D e n to n Pa r tn e r d a vid @ b r o w n foxla w.c om Covered

Pawel K. Olszewski, PhD pawel@waikato.ac.nz TEAM TEAM TEAM TEAM TEAM TEAM TEAM TEAM TEAM

T eam 20: vertiG row December 7, 201 6 1 of 12 T eam 20 Toby Dalla Santa, M.E. Matt Cok,

Observations of Rajasthan T eam New Delhi, 12 th January, 2012 T eam Composition Ms Rita

and Simo Sorce Samba T eam Member Identity Management T eam Red Hat What is OpenStack ?

We Weakly-supe supervise sed d Vid Video eo Rec ecogn gnitio ition Pa Pascal Mettes

Video Games Written and Researched by: Patrick Kania First Video Game The first Video Game made

GE GETT TTIN ING G TH THE MOST ST FR FROM VID IDEO GETT GE TTIN ING G TH THE MOST ST

Vid Video o Hyp yperlin linkin king (LNK) K) TR TRECVi CVid 2017 2017 Maria Eskevich

NVIDIA VIDEO TECHNOLOGIES Abhijit Patait, 3/20/2019 NVIDIA Video Technologies Overview Turing

NVIDIA VIDEO TECHNOLOGIES Abhijit Patait, 3/26/2018 NVIDIA Video Technologies Overview Video

Approaches to Bamiyan Afghanistans Cultural Crossroads 2001 On the basis of consultations

Contracts, Credit Cards & Commissioners: What You Need To Know Dr Michael Schaper ACCC

Public Accounts & Estimates Committee 2006-07 Budget Estimates Hearing The Hon. John Brumby

Preparing E - Health Ready Graduates: A Qualitative Focus Group Study Deborah McGREGOR a , Melanie

REFERRING TO PERSONS AND GROUPS IN GORUM CONVERSATION Felix Rau University of Cologne Slides:

s rr t

Solving semidefinite programs for packing problems Frank Vallentin University of Cologne,

A New Dedicated Plunger device for the GALILEO -ray array Claus Mller-Gatermann 1 , Alfred

AIM 3 at Team eam RU RUC AI at Vid Video eo Pe Pentathlon Cha - PowerPoint PPT Presentation

AIM 3 at Team eam RU RUC AI at Vid Video eo Pe Pentathlon Cha Challeng nge 2020 2020 Shizhe Chen , Yida Zhao, Qin Jin Renmin University of China 1 Vi Video Pe Pentathlon Ch Challenge Task Text-to-Video Cross-modal

www.dat.ruc.dk Plan for del 1 og del 2 Del 1 (i dag) CSCW Opponentoplg: Analyse af

WASHINGTON STATE ROAD USAGE CHARGE Steering Committee Report and Final WA RUC Assessment

Infor EAM v11 Showcase Barry Diedericks 24 March 2016 Agenda Introductions Infor Enterprise

Understanding the RUC Survey Instrument Roseanne M. Fischoff, MPP Senior Policy Analyst II

Discussion Examples for Sequential and Combined IFM -RUC Scott Harvey Member California ISO

ADA and FMLA D a vid S. D e n to n Pa r tn e r d a vid @ b r o w n foxla w.c om Covered

Pawel K. Olszewski, PhD pawel@waikato.ac.nz TEAM TEAM TEAM TEAM TEAM TEAM TEAM TEAM TEAM

T eam 20: vertiG row December 7, 201 6 1 of 12 T eam 20 Toby Dalla Santa, M.E. Matt Cok,

Observations of Rajasthan T eam New Delhi, 12 th January, 2012 T eam Composition Ms Rita

and Simo Sorce Samba T eam Member Identity Management T eam Red Hat What is OpenStack ?

We Weakly-supe supervise sed d Vid Video eo Rec ecogn gnitio ition Pa Pascal Mettes

Video Games Written and Researched by: Patrick Kania First Video Game The first Video Game made

GE GETT TTIN ING G TH THE MOST ST FR FROM VID IDEO GETT GE TTIN ING G TH THE MOST ST

Vid Video o Hyp yperlin linkin king (LNK) K) TR TRECVi CVid 2017 2017 Maria Eskevich

NVIDIA VIDEO TECHNOLOGIES Abhijit Patait, 3/20/2019 NVIDIA Video Technologies Overview Turing

NVIDIA VIDEO TECHNOLOGIES Abhijit Patait, 3/26/2018 NVIDIA Video Technologies Overview Video

Approaches to Bamiyan Afghanistans Cultural Crossroads 2001 On the basis of consultations

Contracts, Credit Cards &amp; Commissioners: What You Need To Know Dr Michael Schaper ACCC

Public Accounts &amp; Estimates Committee 2006-07 Budget Estimates Hearing The Hon. John Brumby

Preparing E - Health Ready Graduates: A Qualitative Focus Group Study Deborah McGREGOR a , Melanie

REFERRING TO PERSONS AND GROUPS IN GORUM CONVERSATION Felix Rau University of Cologne Slides:

s rr t

Solving semidefinite programs for packing problems Frank Vallentin University of Cologne,

A New Dedicated Plunger device for the GALILEO -ray array Claus Mller-Gatermann 1 , Alfred

Contracts, Credit Cards & Commissioners: What You Need To Know Dr Michael Schaper ACCC

Public Accounts & Estimates Committee 2006-07 Budget Estimates Hearing The Hon. John Brumby