What Does BERT with Vision Look At? Liunian Harold Li Mark Yatskar - PowerPoint PPT Presentation

Oct 21, 2023 •123 likes •240 views

1 What Does BERT with Vision Look At? Liunian Harold Li Mark Yatskar Da Yin Cho-Jui Hsieh Kai-Wei Chang UCLA AI2 PKU UCLA UCLA A long version, VisualBERT: A Simple and Performant Baseline for Vision and Language is on Arxiv (Aug

1 What Does BERT with Vision Look At? Liunian Harold Li Mark Yatskar Da Yin Cho-Jui Hsieh Kai-Wei Chang UCLA AI2 PKU UCLA UCLA A long version, “VisualBERT: A Simple and Performant Baseline for Vision and Language” is on Arxiv (Aug 2019).
2 BERT with Vision: Pre-trained Vision-and-language (V&L) Models Several people walking on a sidewalk in the rain with umbrellas. a) Yes, it is snowing. Several people [MASK] on a [MASK] b) Yes, [person8] and [person10] are outside. in the [MASK] with [MASK]. c) No, it looks to be fall. d) Yes, it is raining heavily. Pre-train on image captions and transfer to visual question answering
3 BERT with Vision: Pre-trained Vision-and-language (V&L) Models Mask and predict on image captions Transformer over image regions and texts Significant improvement over baselines ViLBERT, B2T2, LXMERT, VisualBERT, Unicoder-VL, VL-BERT, UNITER, … Performance of VisualBERT compared to strong baselines
4 What does BERT with Vision learn during pre-training? Entity grounding Map entities to regions
5 Probing attention maps of VisualBERT: Entity Grounding 50.77 Certain heads can perform entity grounding Accuracy peaks in higher layers
6 What does BERT with Vision learn during pre-training? Syntactic grounding Map w 1 to regions of w 2 , if w 1 w 2
7 Probing attention maps of VisualBERT: Syntactic Grounding For each dependency relationship, there exists at least one accurate syntax grounding head
8 Probing attention maps of VisualBERT: Syntactic Grounding pobj nsubj Syntactic grounding accuracy peaks in higher layers
9 Probing attention maps of VisualBERT: Qualitative Example Layer 3 Layer 4 Layer 5 Layer 6 Layer 10 Layer 11 Woman Sweater Husband Accurate entity and syntax grounding Refined understanding over the layers
10 Discussion Previous work Pre-trained language models learn the classical NLP pipeline (Peters et al., 2018; Liu et al., 2019; Tenney et al., 2019) Qualitatively, V&L models learn some entity grounding (Yang et al., 2016; Anderson et al., 2018; Kim et al., 2018) Grounding can be learned using dedicated methods (Xiao et al., 2017; Datta et al., 2019) Our paper BERT with Vision learns grounding through pre-training We quantitively verify both entity and syntactic grounding https://github.com/uclanlp/visualbert

Recommend

BERT 3.0 The New BERT Wheres Ernie????? Logging into Bert BERT now uses the same style logon as

BERT 3.0 The New BERT Wheres Ernie????? Logging into Bert BERT now uses the same style logon as the other SEIMS+ applications. BERT Main Screen Icons on Main Screen of BERT There are several icons on the main screen for BERT. These icons are

590 views • 16 slides

BERT Bidirectional Encoder Representations from Transformers Introduction What is BERT?

BERT Bidirectional Encoder Representations from Transformers Introduction What is BERT? Latest language representational model BERT is conceptually simple and empirically powerful. One of the biggest challenges in natural language

839 views • 20 slides

Architecture in Motion How Adyen achieved 100x Bert Wolters - EVP Technology bert@adyen.com

Architecture in Motion How Adyen achieved 100x Bert Wolters - EVP Technology bert@adyen.com Traditional vs Today Customers are in full control $1B 80% On Singles Day in China, $1Billion was On Black Friday in the U.S., nearly 80% of

1.44k views • 46 slides

T h e P o w e r o f S u p e r g r a v i t y S o l u t i o n s Bert

T h e P o w e r o f S u p e r g r a v i t y S o l u t i o n s Bert Vercnocke University of Amsterdam GGI, 8 September 2016 O v e r v i e w 1. Intro 2. Singularity resolution: Black hole microstates

700 views • 31 slides

BERT Basic Error Response Type Bert Why: Document WG Choice What: method to sign

BERT Basic Error Response Type Bert Why: Document WG Choice What: method to sign responses on line Pros ... Simplifies negative wild card responses Fairly simple signing model model Satisfies universal signing requirement

623 views • 5 slides

Data Structures in Java Session 15 Instructor: Bert Huang

Data Structures in Java Session 15 Instructor: Bert Huang http://www1.cs.columbia.edu/~bert/courses/3134 Announcements Homework 4 on website Midterm grades almost done No class on Tuesday Review Indexing by the key needs too

1.04k views • 26 slides

Data Structures in Java Session 24 Instructor: Bert Huang

Data Structures in Java Session 24 Instructor: Bert Huang http://www.cs.columbia.edu/~bert/courses/3134 Announcements Homework 6 due Dec. 10, last day of class Final exam Thursday, Dec. 17 th , 4-7 PM, Hamilton 602 (this room) same

690 views • 67 slides

Control, inference and learning Bert Kappen : SNN Donders Institute, Radboud University, Nijmegen

Control, inference and learning Bert Kappen : SNN Donders Institute, Radboud University, Nijmegen Gatsby Unit, UCL London July 21, 2015 Bert Kappen Why control theory? A theory for intelligent behaviour: - neuroscience Bert Kappen Oxford

791 views • 59 slides

Data Structures in Java Session 17 Instructor: Bert Huang

Data Structures in Java Session 17 Instructor: Bert Huang http://www.cs.columbia.edu/~bert/courses/3134 Announcements Homework 4 due Homework 5 posted All-pairs shortest paths Review Graphs Topological Sort Print out a

278 views • 25 slides

Data Structures in Java Session 5 Instructor: Bert Huang

Data Structures in Java Session 5 Instructor: Bert Huang http://www1.cs.columbia.edu/~bert/courses/3134 Announcements Homework 1 is due now. Late penalty in effect Homework 2 released on website Due Oct. 6 th at 5:40 PM (14

369 views • 25 slides

Data Structures in Java Session 16 Instructor: Bert Huang

Data Structures in Java Session 16 Instructor: Bert Huang http://www.cs.columbia.edu/~bert/courses/3134 Announcements Homework 4 due next class Midterm grades posted. Avg: 79/90 Remaining grades: hw4, hw5, hw6 25% Final

343 views • 32 slides

Integrating control, inference and learning. Is it what robots should be doing? Bert Kappen SNN

Integrating control, inference and learning. Is it what robots should be doing? Bert Kappen SNN Donders Institute, Radboud University, Nijmegen Gatsby Unit, UCL London July 18, 2016 Bert Kappen Optimal control theory Given a current state

905 views • 44 slides

Data Structures in Java Session 7 Instructor: Bert Huang

Data Structures in Java Session 7 Instructor: Bert Huang http://www1.cs.columbia.edu/~bert/courses/3134 Announcements Homework 2 released on website Due Oct. 6 th at 5:40 PM (7 days) Homework 1 solutions posted Post homework to

1.25k views • 19 slides

Data Structures in Java Session 14 Instructor: Bert Huang

Data Structures in Java Session 14 Instructor: Bert Huang http://www1.cs.columbia.edu/~bert/courses/3134 Announcements Homework 3 Programming due Homework 4 on website Review Lists, Stacks, Queues Trees, Binary Search Trees

405 views • 25 slides

Data Structures in Java Session 21 Instructor: Bert Huang

Data Structures in Java Session 21 Instructor: Bert Huang http://www.cs.columbia.edu/~bert/courses/3134 Announcements Homework 5 due Midterm solutions posted (sorry!) Homework 6 to be posted this weekend Review Radix Sort

200 views • 18 slides

Data Structures in Java Session 22 Instructor: Bert Huang

Data Structures in Java Session 22 Instructor: Bert Huang http://www.cs.columbia.edu/~bert/courses/3134 Announcements Homework 5 solutions posted Homework 6 to be posted this weekend Final exam Thursday, Dec. 17 th , 4-7 PM,

577 views • 43 slides

Data Structures in Java Session 22 Instructor: Bert Huang

584 views • 29 slides

Data Structures in Java Session 10 Instructor: Bert Huang

Data Structures in Java Session 10 Instructor: Bert Huang http://www1.cs.columbia.edu/~bert/courses/3134 Announcements Homework 3 due 10/20 Review AVL Trees Single rotate for left-left imbalance Double rotate for left-right

830 views • 21 slides

Adaptive importance sampling for control and inference Bert Kappen SNN Donders Institute,

Adaptive importance sampling for control and inference Bert Kappen SNN Donders Institute, Radboud University, Nijmegen Gatsby Unit, UCL London December 10, 2016 Joint work with Hans Ruiz, Dominik Thalmeier Bert Kappen Optimal control

605 views • 31 slides

Bert B. Hene III 2323 Cumberland Pkwy, Suite 103 Atl Atlanta, GA 30339 t GA 30339 Direct: 678

Bert B. Hene III 2323 Cumberland Pkwy, Suite 103 Atl Atlanta, GA 30339 t GA 30339 Direct: 678 384 3001 Phone: 678 384 3000 Fax: 770 956 1686 bert@henehealth.com Lesson 1 Program Basics What is Medicare? What is

733 views • 34 slides

ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks

ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks - Jiasen Lu et al. (NeurIPS 2019) Presented by - Chinmoy Samant, cs59688 Overview Introduction Motivation Approach BERT

1.9k views • 60 slides

for Efficient Adaptation in Multi-Task Learning Asa Cooper Stickland and Iain Murray University

BERT and PALs: Projected Attention Layers for Efficient Adaptation in Multi-Task Learning Asa Cooper Stickland and Iain Murray University of Edinburgh Background: BERT Our model builds on BERT (Devlin et al., 2018), a powerful (and big) sentence

501 views • 9 slides

Bert De Coensel Supported by Content of presentation Introduction Site selection

Bert De Coensel Supported by Content of presentation Introduction Site selection Audiovisual recordings Site classification Analysis and manipulation Bert De Coensel Urban Soundscapes of the World Urban Sound Symposium

306 views • 27 slides

Robotic Testing (to the rescue) Bert Chang and Paul Du Bois Double Fine Productions About us

Robotic Testing (to the rescue) Bert Chang and Paul Du Bois Double Fine Productions About us Paul: Senior Programmer Bert: Software Test Engineer RoBert: Robot brainchild Automated tester 120-second pitch Unit testing is well

1.09k views • 60 slides