NEURAL NETWORK FOR OBJECT RECOGNITION Ming Lang and Xialoin Hu May - PowerPoint PPT Presentation

RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION Ming Lang and Xialoin Hu May 3, 2016 Presenter: Ceren Guzel Turhan

CONTENT  Overview  Problem statement  Motivation  Overview of approach  Related studies  RCNN model  Implementations  Experimental setups  Experimental results  Conclusion RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 2

OVERVIEW  Inspired by the fact that the number of recurrent synapses outnumber feed-forward and top-down synapses in the brain  Idea: recurrent connections within convolutional layers  Activity of each unit can be modulated by activities of its neighboring units  Enhancing capability of context information  Recurrence connections provide multiple paths: facilitating learning RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 3

PROBLEM STATEMENT  Task: object recognition from Fast R-CNN Object detection with caffe by Ross Girshick RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 4

MOTIVATION  State-of-the-art results using CNN in object recognition  in ImageNet [26]  in ILSVRC-2012, Pascal VOC-2007, Pascal VOC-2012, Caltech 101, Caltech-256 [5]  in Pascal VOC-2007 [43]  in ILSVRC-2014 [50]  in CIFAR-10, CIFAR-100, MNIST [33] RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 5

MOTIVATION  Brain-CNN and Brain-RNN relationship • CNN • originates from neuroscience (the first artificial neuron) • is related to cells in primary visual cortex From Daniel L. K. Yamins and James J. DiCarlo RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 6

MOTIVATION  Brain-CNN and Brain-RNN relationship  RNN  Recurrent synapsis in neocortex  Outnumbers feed-forward and top-down synapsis  Play an role in context modulation RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 7

MOTIVATION  Object recognition – RNN relationship:  Object recognition acts a dynamic process thanks to recurrent and top-down synapsis  The processing of visual signals is related to context information  The response properties of neurons related to context around RFs RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 8

MOTIVATION  Context information:  important for object recognition  can be obtained in higher layers of feed-forward models with larger RFs  cannot modulated in lower layer for smaller objects  Strategies for context information  top-down connections  recurrent connections (in this study)  recurrent connections in the same layer RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 9

OVERVIEW OF APPROACH  Similar to RMLP:  instead of full connections in RMLP shared local connections  RCNN: Feed-forward CNN and recurrent connections inside CNN RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 10

RELATED STUDIES  Similar named studies:  Recurrent convolutional neural networks for scene labeling (2014)  Convolutional neural networks with Intro-Layer Recurrent connections for Scene Labeling (2015)  Long-term Recurrent Convolutional Networks for Visual Recognition and Description (2015)  Recurrent Convolutional neural networks for Object-class segmentation of RGB-D Video (2015) RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 11

RELATED STUDIES  MDRNN [20]:  takes images as 2D sequential data  only one hidden layer  could not generate features like CNN  Hierarchical RNN (NAP) [2]:  Recurrent and feedback connections  Vertical and lateral recurrent connections  Abstract image representation  Network with excitatory and inhibitory units  Only feed-forward version in test phase  Recurrent version for image reconstruction RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 12

RELATED STUDIES  CDBN [31]:  top-down connections  unsupervised feature learning by propagation of information from top layer to bottom layer  rCNN for scene labeling [36]:  Recurrent connection in different layers  𝑠𝐷𝑂𝑂 𝑜 : n network instance of 𝐷𝑂𝑂 𝑜  Each network instance takes RBG image and previous network output as input from Pedro O. Pinheiro and Ronan Collobert [36] RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 13

RELATED STUDIES  Sparse coding models [15]  iterative optimization procedures implicitly defines recurrent neural networks  Recursive CNN [9]  time-unfolded version of RCNN RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 14

RCNN MODEL: RCL LAYER  𝑣 𝑗,𝑘 𝑢 : feed-forward input 𝑠 𝑥 𝑙  𝑦 𝑗,𝑘 𝑢 − 1 : recurrent input 𝑦  𝑗, 𝑘 : location of unit 𝑔  𝑙 : feature map 𝑥 𝑙 𝑔 𝑥 𝑙 𝑔 : feed-forward weight  𝑥 𝑙 𝑣 𝑠 : recurrent weight  𝑥 𝑙 𝑣 (𝑗,𝑘,𝑙)  𝑐 𝑙 : bias  𝑔 : rectified linear function  𝑕 : local response normalization RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 15

RCNN MODEL RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 16

RCNN MODEL ARCHITECTURE  Standard convolutional layer, 2 RCLs, pooling, 2 RCLs, pooling, FC layer  Dropout after each pooling layer except layer 5  Cross-entropy loss using BPTT  (T+1): the depth of each RTL  4(T+1)+2: the length of longest path RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 17

IMPLEMENTATIONS  Cuda-convnet2  2 Titan GPU  Hyper-parameters:  𝑙 : 96  Feed-forward filter size in layer: 5 × 5  Feed-forward and recurrent filter size in layer 2 to 4: 3 × 3  For LRN  𝛽 : 0.001  𝛾 : 0.75  𝑂 = 𝑙/8 + 1 RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 18

EXPERIMENTAL SETUPS  Datasets:  CIFAR-10  CIFAR-100  MNIST  SVHN  Trained using BPTT in combination with stochastic gradient descent  Learning rate: 0.01  When accuracy stopped improving, it is decreased to its 1/10  Final learning rate is set to 0.0001  Momentum: 0.9  Iteration number: 3 RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 19

EXPERIMENTAL RESULTS: CIFAR-10  Dataset:  60000 images (50000/10000/10000)  32 × 32 pixel resolutions  10 classes  Baseline models:  WCNN-128: (removed recurrent connections version of RNN with 3 × 3 filters  rCNN-96: (removed recurrent connections of RCLs but adding cascade of duplicated convolutional layers) RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 20

EXPERIMENTAL RESULTS: CIFAR-10  Comparison with baseline models: Model # of parameters Error (%) Training Testing rCNN-96 (1 iter) 0.67 M 4.61 12.65 rCNN-96 (1 iter) 0.67 M 2.26 12.99 rCNN-96 (1 iter) 0.67 M 1.24 14.92 WCNN-128 (1 iter) 0.60 M 3.45 9.98 RCNN-96 (1 iter) 0.67 M 4.99 9.95 RCNN-96 (2 iter) 0.67 M 3.58 9.63 RCNN-96 (3 iter) 0.67 M 3.06 9.31 RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 21

EXPERIMENTAL RESULTS: CIFAR-10  Comparison with state-of-the-art models without data augmentation: Model # of parameters Testing error (%) Maxout[17] > 5 M 11.68 Prob maxout [47] > 5 M 11.35 NIN [33] 0.97 M 10.41 DSN [30] 0.97 M 9.69 RCNN-96 0.67 M 9.31 RCNN-128 1.19 M 8.98 RCNN-160 1.86 M 8.69 RCNN-96 (no dropout) 0.67 M 13.56 RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 22

EXPERIMENTAL RESULTS: CIFAR-10  Comparison with state-of-the-art models with data augmentation: Model # of parameters Testing error (%) Prob maxout [47] > 5 M 9.39 Maxout[17] > 5 M 9.38 DropConnect (12 nets) [51] - 9.32 NIN [33] 0.97 M 8.81 DSN [30] 0.97 M 7.97 RCNN-96 0.67 M 7.37 RCNN-128 1.19 M 7.24 RCNN-160 1.86 M 7.09 RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 23

EXPERIMENTAL RESULTS: CIFAR-100  Dataset:  60000 images (50000|10000|10000)  32 × 32 pixel resolutions  100 classes  Same settings as CIFAR-10 without further tuning hyper-parameters RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 24

EXPERIMENTAL RESULTS: CIFAR-100 Model # of parameters Testing error (%) Maxout [17] > 5 M 38.57 Prob maxout [47] > 5 M 38.14 Tree based priors [49] - 36.85 NIN [33] 0.98 M 35.68 DSN [30] 0.98 M 34.57 RCNN-96 0.68 M 34.18 RCNN-128 1.20 M 32.59 RCNN-160 1.87 M 31.75 RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 25

EXPERIMENTAL RESULTS: CIFAR-100  Comparison with state-of-the-art models with data augmentation: Model # of parameters Testing error (%) Prob maxout [47] > 5 M 9.39 Maxout[17] > 5 M 9.38 DropConnect (12 nets) [51] - 9.32 NIN [33] 0.97 M 8.81 DSN [30] 0.97 M 7.97 RCNN-96 0.67 M 7.37 RCNN-128 1.19 M 7.24 RCNN-160 1.86 M 7.09 RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 26

EXPERIMENTAL RESULTS: MNIST  Dataset  10 classes  70000 images (60000|10000)  28 × 28 pixel Model # of parameters Testing error (%) NIN [33] 0.35 M 0.47 Maxout [17] 0.42 M 0.45 DSN [30] 0.35 M 0.39 RCNN-32 0.08 M 0.42 RCNN-64 0.30 M 0.32 RCNN-96 0.67 M 0.32 RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION 27

NEURAL NETWORK FOR OBJECT RECOGNITION Ming Lang and Xialoin Hu May - PowerPoint PPT Presentation

RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION Ming Lang and Xialoin Hu May 3, 2016 Presenter: Ceren Guzel Turhan CONTENT Overview Problem statement Motivation Overview of approach Related studies RCNN

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Object-Oriented Databases Object Oriented Databases ODMG Standard Object Model, Object

Object oriented Object oriented Object oriented Object oriented approach and UML approach and

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Object Space Volume Rendering Object Space Volume Rendering Ronald Peikert SciVis 2010 - Object

Neural Machine Translation Gongbo Tang 8 October 2018 Outline Neural Machine Translation 1

Neural Network II Neural Network II Week 8 1 Team Homework Assignment #10 Team Homework

Neural Networks Neural Net Basics Dan Klein, John DeNero UC Berkeley Slides adapted from Greg

Introduction to Neural Machine Translation Gongbo Tang 16 September 2019 Outline Why Neural

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Neural Networks for Machine Learning Lecture 2a An overview of the main types of neural network

Java Object Persistence Rakesh Vidyadharan rakesh@sptci.com 2008-05-20 Rakesh Vidyadharan Java

The Object Factory Object-Oriented Programming in R: S3 & R6 pour refill Object-Oriented

Chapter 8, Object Design: Object design is the process of adding details to the requirements

Emerging NVM Enabled Storage Architecture: From Evolution to Revolution. Yiran Chen Electrical and

SUCCESSFUL LONG-TERM INVESTING WITH HIGH-QUALITY STOCKS FEBRUARY 15, 2020 Susan Christ, CFA

Fundamentals of Computational Neuroscience 2e December 31, 2009 Chapter 8: Recurrent associative

TOPICS IN ETHICS AND IMMIGRATION LAW KURSTEN PHELPS Tahirih Justice Center Director of Legal

@ .To increase understanding of the The difference between "conscious/explicit behavior"

From the National Coalition for Alarm Management Safety A case study from Sentara Healthcare

FINDING THE EVIDENCE Providence Health Care Research Challenge Research Skills and Resources

IN BABIES AND CHILDREN -PERSPECTIVES ROY JOSEPH EMAIL: PAEROYJ@NUS.EDU.SG YONG LOO LIN SCHOOL

NEURAL NETWORK FOR OBJECT RECOGNITION Ming Lang and Xialoin Hu May - PowerPoint PPT Presentation

RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION Ming Lang and Xialoin Hu May 3, 2016 Presenter: Ceren Guzel Turhan CONTENT Overview Problem statement Motivation Overview of approach Related studies RCNN

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Object-Oriented Databases Object Oriented Databases ODMG Standard Object Model, Object

Object oriented Object oriented Object oriented Object oriented approach and UML approach and

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Object Space Volume Rendering Object Space Volume Rendering Ronald Peikert SciVis 2010 - Object

Neural Machine Translation Gongbo Tang 8 October 2018 Outline Neural Machine Translation 1

Neural Network II Neural Network II Week 8 1 Team Homework Assignment #10 Team Homework

Neural Networks Neural Net Basics Dan Klein, John DeNero UC Berkeley Slides adapted from Greg

Introduction to Neural Machine Translation Gongbo Tang 16 September 2019 Outline Why Neural

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Neural Networks for Machine Learning Lecture 2a An overview of the main types of neural network

Java Object Persistence Rakesh Vidyadharan rakesh@sptci.com 2008-05-20 Rakesh Vidyadharan Java

The Object Factory Object-Oriented Programming in R: S3 &amp; R6 pour refill Object-Oriented

Chapter 8, Object Design: Object design is the process of adding details to the requirements

Emerging NVM Enabled Storage Architecture: From Evolution to Revolution. Yiran Chen Electrical and

SUCCESSFUL LONG-TERM INVESTING WITH HIGH-QUALITY STOCKS FEBRUARY 15, 2020 Susan Christ, CFA

Fundamentals of Computational Neuroscience 2e December 31, 2009 Chapter 8: Recurrent associative

TOPICS IN ETHICS AND IMMIGRATION LAW KURSTEN PHELPS Tahirih Justice Center Director of Legal

@ .To increase understanding of the The difference between &quot;conscious/explicit behavior&quot;

From the National Coalition for Alarm Management Safety A case study from Sentara Healthcare

FINDING THE EVIDENCE Providence Health Care Research Challenge Research Skills and Resources

IN BABIES AND CHILDREN -PERSPECTIVES ROY JOSEPH EMAIL: PAEROYJ@NUS.EDU.SG YONG LOO LIN SCHOOL

The Object Factory Object-Oriented Programming in R: S3 & R6 pour refill Object-Oriented

@ .To increase understanding of the The difference between "conscious/explicit behavior"