Learning Deep Features for Scene Recognition using Places Database - PowerPoint PPT Presentation

Jan 01, 2024 •488 likes •697 views

Learning Deep Features for Scene Recognition using Places Database Bolei Zhou, Agata Lapedriza, Jianxiong Xiao, Antonio Torralba, Aude Oliva NIPS2014 Bora elikkale INTRODUCTION Human Visual Recognition Samples world several times / sec

Learning Deep Features for Scene Recognition using Places Database Bolei Zhou, Agata Lapedriza, Jianxiong Xiao, Antonio Torralba, Aude Oliva NIPS2014 Bora Çelikkale
INTRODUCTION Human Visual Recognition Samples world several times / sec ~millions images within a year
INTRODUCTION Primate Brain Hierarchical organization in layers of increasing processing complexity Inspired CNNs
PROBLEM & MOTIVATION Obj Classification have obtained astonishing performanace with large databases (ImageNet) Iconic images do not contain the richness and diversity of visual info in scenes
CONTRIBUTIONS Scene-centric database 60x larger than SUN Comparison metrics for scene datasets: Density, Diversity
SCENE DATASETS Scene15 MIT Indoor67 (Lazebnik et al. 2006) (Quatham & Torralba 2009) 67 categories of indoor places 15 categories 15.620 imgs ~3000 imgs SUN (Xiao et al. 2010) Places (Zhou et al. 2014) 397 (well-sampled) categories 476 categories 130.519 imgs 7.076.580 imgs
PLACES DATASET Google Images Same categories from SUN 1 Bing Images 696 popular adjectives in Eng Flickr >40M imgs are downloaded
PLACES DATASET PCA-based duplicate removal across SUN 2 Places & SUN have different images Allows to combine Places & SUN
PLACES DATASET Annotations (with AMT) 3 Questions (eg: is this a living room?) Two round setup: 1. Default answer is NO 2. Default answer is YES Imgs shown / round : 750 + 60 from SUN for control Take >90% accuracy
COMPARISON METRICS Relative Density
COMPARISON METRICS Relative Density Images have more similar neighbors NN of a 1 NN of b 1
COMPARISON METRICS Relative Diversity Simpson Index: two random individual belong to same specie NN of a 1 NN of b 1
EXPERIMENTS Density & Diversity Comparison (AMT) 1 Relative diversity vs. relative density per each category and dataset Show 12 pairs of images Workers select the most similar pair Diversity: pairs are chosen random for each db Density: 5th NN (avoid near duplicates) is chosen as pair with GIST
EXPERIMENTS Cross Dataset Generalization 2 Training and testing across different datasets ImageNet-CNN and linear SVM
EXPERIMENTS Comparison with Hand-designed Features 3
EXPERIMENTS Training CNN for Scene Recognition 4 2,5M imgs from 205 categories, on AlexNet
PLACES-CNNs Hybrid-AlexNet Places + ImageNet 3.5M imgs, 1183 categories Accuracy = 0.5230 on validation set Places205-GoogLeNet (on 205 categories) Accuracy: top1 = 0.5567 , top5 = 0.8541 on validation set Places205-VGG16 (on 205 categories) Accuracy: top1 = 0.5890 , top5 = 0.8770 on validation set
PLACES2 DATASET 400+ unique scene categories >10M images AlexNet top1 accuracy: 43.0% VGG16 top1 accuracy: 47.6%
DEMO http://places.csail.mit.edu/demo.html http://places2.csail.mit.edu/demo.html
THANK YOU

Recommend

Scene Graphs Scene Representation How does one describe the objects in a 3D scene? Scene

Scene Graphs Scene Representation How does one describe the objects in a 3D scene? Scene Graphs 1 Scene Modeling Languages / API From the low level to the high level State Machine Model -- OpenGL Graph Based Scene Languages

483 views • 17 slides

Scene Representation How does one describe the objects in a Scene Graphs 3D scene? Scene

Scene Representation How does one describe the objects in a Scene Graphs 3D scene? Scene Graphs State Machine Model API Scene Modeling Languages / API From the low level to the high level State Machine Model State Machine

436 views • 6 slides

Episode 42: I Made Slides 10 February 2019 The Three-Act, Seven Scene Structure Act I:

Episode 42: I Made Slides 10 February 2019 The Three-Act, Seven Scene Structure Act I: Act III: Scene 1: The Inciting Incident Scene 5: Dark Moment Scene 2: Choice to Engage Scene 6: Climax Act II: Scene 7: Resolution Scene 3: The

217 views • 10 slides

CMSC427 Scene graphs Credit: slides from Dr. Zwicker Today Scene graphs & hierarchies

CMSC427 Scene graphs Credit: slides from Dr. Zwicker Today Scene graphs & hierarchies Introduction Scene graph data structures Rendering scene graphs Level-of-detail Culling 2 So far: rendering pipeline Scene data

939 views • 66 slides

Scene Recognition Scene Recognition Adriana Kovashka Adriana Kovashka UTCS, PhD student UTCS,

Scene Recognition Scene Recognition Adriana Kovashka Adriana Kovashka UTCS, PhD student UTCS, PhD student Problem Problem Problem Problem Statement Statement Distinguish between different types of scenes Applications

617 views • 45 slides

A summary of deep models for face recognition Qianli Liao Face recognition Face recognition:

A summary of deep models for face recognition Qianli Liao Face recognition Face recognition: Detection Alignment Recognition Face detection & alignment Face recognition Face detection & alignment Detection

1.2k views • 50 slides

COMPANY PROFILE WATER FEATURES 1 WATER FEATURES 2 WATER FEATURES 3 WATER FEATURES 4 WATER

COMPANY PROFILE WATER FEATURES 1 WATER FEATURES 2 WATER FEATURES 3 WATER FEATURES 4 WATER FEATURES 5 WATER FEATURES 6 WATER FEATURES 7 EXCLUSIVE POOLS 8 EXCLUSIVE POOLS 9 EXCLUSIVE POOLS 10 EXCLUSIVE POOLS 11 OVERFLOW 12

962 views • 40 slides

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

Deep 3D Representation Learning for Visual Computing Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms Conclusion 2 Outline Overview of 3D deep learning Background 3D deep learning tasks 3D deep

1.66k views • 122 slides

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from http://imgur.com/ Deep Learning Image from http://imgur.com/ Deep Learning Image from http://imgur.com/ Deep Learning Image from http://imgur.com/ Deep

1.15k views • 79 slides

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Deep Neural Networks and Deep Reinforcement Learning Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and Courville [chapt. 6,7,8]; AIMA [sect. 21.1-21.3]; Sutton and Barto, Reinforcement Learning: an

528 views • 35 slides

a better and faster way Shu Kong CS, ICS, UCI Image Understanding --> Scene Parsing Scene

Scene Parsing through Per-Pixel Labeling: a better and faster way Shu Kong CS, ICS, UCI Image Understanding --> Scene Parsing Scene Parsing semantic segmentation classifying each pixel into one of defined categories Scene Parsing semantic

1.29k views • 103 slides

Volumetric Scene Reconstruction Volumetric Scene Reconstruction Goal Goal from Multiple

Image Image- -Based Scene Reconstruction Based Scene Reconstruction Volumetric Scene Reconstruction Volumetric Scene Reconstruction Goal Goal from Multiple Views from Multiple Views Automatic construction of photo Automatic

378 views • 21 slides

Deep Incremental Scene Understanding Federico Tombari & Christian Rupprecht Technical

Deep Incremental Scene Understanding Federico Tombari & Christian Rupprecht Technical University of Munich, Germany Scene Understanding and SLAM Scene understanding with deep SLAM from RGB-D data allowing learning (typically frame-wise)

365 views • 34 slides

Changing Places/Changing Faces 1 Running Head: CHANGING PLACES/CHANGES FACES Changing

Changing Places/Changing Faces 1 Running Head: CHANGING PLACES/CHANGES FACES Changing Places/Changing Faces: Immigrant Youth and Identity Development Elham Bagheri & Jay M. Greenfeld, M.A. University of Iowa Changing Places/Changing

401 views • 11 slides

RE Places of Worship RE | Year 2 | Places of Worship | Special Places | Lesson 1 Aim Aim To

RE Places of Worship RE | Year 2 | Places of Worship | Special Places | Lesson 1 Aim Aim To consider what makes a place so special to people. Success Criteria Success Criteria Statement 1 Lorem ipsum dolor sit amet, consectetur

866 views • 31 slides

Welcome jI jI Awi wieAW eAW nM nMUUUU 42 places Up to 10 Nursery places for for Sikh

Welcome jI jI Awi wieAW eAW nM nMUUUU 42 places Up to 10 Nursery places for for Sikh applicants Other Faith Total 52 Pupils Up to 12 48 places Reception places for for Sikh Other Faith applicants Total 60 Pupils Children in public

459 views • 21 slides

The Promise and Perils of Big Data Some Slides from A. Efros and A. Torralba Why do we need data?

The Promise and Perils of Big Data Some Slides from A. Efros and A. Torralba Why do we need data? Most problems in vision are ambiguous and hard. 2D -> 3D Segmentation/Edges So, how do we solve these problems? Magic of data !

1.72k views • 144 slides

Illustration of the Capability and Limits of Visual Perception Aude Oliva Brain and Cognitive

Illustration of the Capability and Limits of Visual Perception Aude Oliva Brain and Cognitive Sciences MIT Email: oliva@mit.edu Web site: cvcl.mit.edu Demo 1 What do you see at a glance? Fast visual perception & Temporal constraints

431 views • 21 slides

CS54701: Information Retrieval CS-54701 Information Retrieval Luo Si Department of Computer

CS54701: Information Retrieval CS-54701 Information Retrieval Luo Si Department of Computer Science Purdue University Overview of Information Retrieval Why Information Retrieval: Information Overload: The world produces between 1 and 2

366 views • 33 slides

gOlogy: impact of -O* on -g Alexandre Oliva aoliva@redhat.com http://people.redhat.com/~aoliva/

1 gOlogy: impact of -O* on -g Alexandre Oliva aoliva@redhat.com http://people.redhat.com/~aoliva/ GNU Tools Cauldron, 2018 gOlogy: impact of -O* on -g Alexandre Oliva 2 Summary Project description Ground assumptions Main

945 views • 10 slides

Mo Movin ving F From G m Goo ood In Intentio tions ns to o Co Concrete A Actio tion:

Mo Movin ving F From G m Goo ood In Intentio tions ns to o Co Concrete A Actio tion: Creat reating T g Tran rans-Affir firmi ming Se Servic vices a and Gr Grant t In Initia itiativ tives Shannon Wyss , AIDS United Morey

399 views • 21 slides

Network-based and Client-based DMM solutions using Mobile IP mechanisms

Network-based and Client-based DMM solutions using Mobile IP mechanisms draft-bernardos-dmm-cmip-07 draft-bernardos-dmm-pmip-08 draft-bernardos-dmm-distributed-anchoring-09 Carlos J. Bernardos Universidad Carlos III de Madrid Antonio de la

636 views • 19 slides

Video Analytics Xavier Gir-i-Nieto Motivation 2 Motivation 3 Motivation 4 Outline 1.

Day 4 Lecture 4 Video Analytics Xavier Gir-i-Nieto Motivation 2 Motivation 3 Motivation 4 Outline 1. Scene Classification 2. Object Detection & Tracking 5 Scene Classification (Slides by Victor Campos) Karpathy, A., Toderici, G.,

966 views • 53 slides

Image Processing II Computer Vision Fall 2018 Columbia University Convolution Review Cross

Image Processing II Computer Vision Fall 2018 Columbia University Convolution Review Cross Correlation 0 0 0 1 G [ x , y ] -1 0 1 F [ x , y ] 9 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 90 90 90 90

1.1k views • 97 slides