Neural Networks Module2 : learning with Gradient Descent module 2: - PowerPoint PPT Presentation

Nov 18, 2023 •438 likes •639 views

Neural Networks Module2 : learning with Gradient Descent module 2: numerical optimization LEARNING PERFORMANCE REPRESENTATION DATA PROBLEM EVALUATION RAW DATA CLUSTERING FEATURES housing data train/test error, accuracy spam data Cross

Neural Networks
Module2 : learning with Gradient Descent module 2: numerical optimization LEARNING PERFORMANCE REPRESENTATION DATA PROBLEM EVALUATION RAW DATA CLUSTERING FEATURES housing data train/test error, accuracy spam data Cross Validation SUPERVISED SELECTION ROC LABELS LEARNING ANALYSIS numerical optimization DATA Logistic Regression DIMENSIONS Perceptron PROCESSING TUNING Neural Network • formulate problem by model/parameters • formulate error as mathematical objective • optimize numerically the parameters for the given objective • usually algebraic setup - involves matrices and calculus • probabilistic setup (likelihoods) next module
Module 2 Objectives / Neural Networks • perceptron rules • neural network idea, philosophy, construction • NN weights • Backpropagation : training NN using gradient descent • NN modes, autoencoders • run NN-autoencoder on a simple problem
The perceptron
The perceptron • (like with regression) we are looking for a linear classifier � � • error different than regression: weighted sum over misclassified points set M
Perceptron - geometry • perceptron is a linear (hyperplane) separator • for simplicity, will transform data points with y=-1 (left) to y=1 (right) by reversing the sign
The perceptron • To optimize for perceptron error, use gradient descent � • with update rule � � • batch update: �
perceptron update - intuition • perceptron update: the plane (dotted red) normal w (red arrow) moves in the direction of misclassified p1 until p1 is on the correct side.
Perceptron proof of convergence • if data is indeed linearly separable, the perceptron will find the separator line.
Multilayer perceptrons
Checkpoint : XOR perceptron • build/explain a 3-layer perceptron that give the same classification as the logical XOR function � � � � • your answer is required! Submit via dropbox.
Neural Networks • NN is a stack of connected perceptrons � • bottom up: - input layer - hidden layer - output layer � • multilayer NN very very powerful in that they can approximate almost any function - with enough training data
Neural Networks • Each unit performs first a linear combination of inputs � � • Then applies a nonlinear (ex. logistic) function “f” before outputting a value � • Three layer NN output can be expressed mathematically as �
Training the NN weights ( w ) • one datapoint � � � • set of weights up (close to output): � � � � • we obtain the hidden-output weight update rule
Training the NN weights ( w ) • weight first set of weights (close to input)
NN training
Autoencoders • network is “rotated” - from left to right: input-hidden-ouput • input and output are the same values - hidden layer encodes the input and decodes back to itself
BackPropagation ( Tom Mitchell book )

Recommend

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural Networks can represent complex decision boundaries decision boundaries Variable size. Any boolean function can be Variable size. Any boolean

358 views • 14 slides

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Neural Networks and Handwriting Recognition Steven Sloss Math 164 Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven Sloss Structure Training Neural Networks Math 164 Motivation Problem

889 views • 41 slides

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Feed-forward Networks Network Training Error Backpropagation Deep Learning Feed-forward Networks Network Training Error Backpropagation Deep Learning Neural Networks Neural networks arise from attempts to model Neural Networks

380 views • 9 slides

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Sequential Data with Neural Networks Recurrent Neural

303 views • 4 slides

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural IR tasks Neural IR architecture Feature Representations Neural IR query auto completion Neural IR query suggestion Neural IR document

1.48k views • 18 slides

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

Ugur HALICI - METU EEE - ANKARA 11/18/2004 CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I : Recurrent Neural Networks CHAPTER I Recurrent Neural Networks Introduction In this chapter first the

404 views • 27 slides

CHAPTER II III I CHAPTER Neural Networks as Neural Networks as Associative Memory

Ugur HALICI - METU EEE - ANKARA 11/18/2004 CHAPTER II III I CHAPTER Neural Networks as Neural Networks as Associative Memory Associative Memory CHAPTER III : III : Neural Networks as Associative Memory CHAPTER Neural Networks as

513 views • 22 slides

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Convolutional Neural Networks<br/><br/> 5/4/19, 4(03 PM Convolutional Neural Networks<br/><br/> 5/4/19, 4(03 PM Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use UMaine

412 views • 9 slides

Neural Networks 0. Logistics Spring 2019 1 Neural Networks are taking over! Neural networks

Neural Networks 0. Logistics Spring 2019 1 Neural Networks are taking over! Neural networks have become one of the major thrust areas recently in various pattern recognition, prediction, and analysis problems In many problems they have

852 views • 33 slides

Neural Networks and their Application to Go Neural Networks Learning Blackjack Theory Training

Neural Networks and their Application to Go A. Bausch Neural Networks and their Application to Go Neural Networks Learning Blackjack Theory Training neural networks Problems AlphaGo Anne-Marie Bausch The Game of Go Policy Network

280 views • 24 slides

Neural Networks 1. Introduction Fall 2017 Neural Networks are taking over! Neural networks

Neural Networks 1. Introduction Fall 2017 Neural Networks are taking over! Neural networks have become one of the major thrust areas recently in various pattern recognition, prediction, and analysis problems In many problems they have

1.17k views • 91 slides

Neural Networks Neural Net Basics Dan Klein, John DeNero UC Berkeley Slides adapted from Greg

Neural Networks Neural Net Basics Dan Klein, John DeNero UC Berkeley Slides adapted from Greg Durrett Neural Networks Neural Networks Linear classification: argmax y w > f ( x, y ) possible because Linear Neural we transformed

316 views • 4 slides

Relaxation and Hopfield Networks Neural Networks Neural Networks - Hopfield 1 Bibliography

Relaxation and Hopfield Networks Neural Networks Neural Networks - Hopfield 1 Bibliography Hopfield, J. J., "Neural networks and physical systems with emergent collective computational abilities," Proceedings of the National Academy

367 views • 19 slides

Neural Networks 1. Introduction Spring 2020 1 Neural Networks are taking over! Neural

Neural Networks 1. Introduction Spring 2020 1 Neural Networks are taking over! Neural networks have become one of the major thrust areas recently in various pattern recognition, prediction, and analysis problems In many problems they

1.63k views • 119 slides

Introduction to Artificial Intelligence Neural Networks - Deep Learning for NLP Janyl Jumadinova

Introduction to Artificial Intelligence Neural Networks - Deep Learning for NLP Janyl Jumadinova November 21, 2016 Neural Networks 2/20 Neural Networks 3/20 Neural Networks Neural computing requires a number of neurons , to be connected

813 views • 21 slides

Neural Networks 1. Introduction Spring 2019 1 Neural Networks are taking over! Neural

Neural Networks 1. Introduction Spring 2019 1 Neural Networks are taking over! Neural networks have become one of the major thrust areas recently in various pattern recognition, prediction, and analysis problems In many problems they

1.57k views • 124 slides

Java AND and OR Java XOR and NOT AND operator (&) OR

Java Bitwise Operators Java has six bitwise operators: Miscellaneous Java Symbol Operator & Bitwise AND TOPICS | Bitwise OR Bit Operators ^ Bitwise XOR

273 views • 3 slides

XOR with intermediate (hidden) units Delta rule as gradient descent in error (sigmoid units)

XOR with intermediate (hidden) units Delta rule as gradient descent in error (sigmoid units) n j = a i w ij i Intermediate units can re-represent 1 input patterns as new patterns with a j = w ij t j 1 + exp ( n j ) altered

95 views • 5 slides

My typical workflow Jakub Muszy nski 6th7th May 2014 Computer Science and Communications

My typical workflow Jakub Muszy nski 6th7th May 2014 Computer Science and Communications (CSC) Research Unit Jakub Muszy nski (UL HPC School 2014) My typical workflow 1 / 15 My experiments I am simulating a P2P protocol.

536 views • 15 slides

XML and Databases Chapter 3: Designing XML DTDs Prof. Dr. Stefan Brass Martin-Luther-Universit

Motivation, Example Database Single Rows/Objects Grouping Rows: Tables Relationships XML and Databases Chapter 3: Designing XML DTDs Prof. Dr. Stefan Brass Martin-Luther-Universit at Halle-Wittenberg Winter 2019/20

791 views • 46 slides

Optimizing linear maps modulo 2 (i.e.: fast xor sequences for bitsliced software) D. J.

Optimizing linear maps modulo 2 (i.e.: fast xor sequences for bitsliced software) D. J. Bernstein University of Illinois at Chicago NSF ITR0716498 Example: size-4 poly Karatsuba. Start with size 2: F = F 0 + F 1 x , G = G 0 + G 1 x , H 0 =

1.18k views • 68 slides

Lower Bounds for Number-in-Hand Multiparty Communication Complexity Jeff M. Phillips Elad

Lower Bounds for Number-in-Hand Multiparty Communication Complexity Jeff M. Phillips Elad Verbin, Qin Zhang Univ. of Utah CTIC/MADALGO, Aarhus Univ. SODA 2012, Kyoto Jan. 17, 2012 1-1 The multiparty communication model x 1 = 010011 x 2 =

762 views • 40 slides

SAML 1.1 and its uses in eduGAIN Stefan Winter <stefan.winter@restena.lu> 1 Outline

Fondation RESTENA euroCAMP 04 April 2006 SAML 1.1 and its uses in eduGAIN Stefan Winter <stefan.winter@restena.lu> 1 Outline SAML 1.1 overview Abstract operations vs. SAML profile Abstract operations: changes since

165 views • 14 slides

Neural Networks Sven Koenig, USC Russell and Norvig, 3 rd Edition, Sections 18.7.1-18.7.4 These

12/18/2019 Neural Networks Sven Koenig, USC Russell and Norvig, 3 rd Edition, Sections 18.7.1-18.7.4 These slides are new and can contain mistakes and typos. Please report them to Sven (skoenig@usc.edu). 1 Inductive Learning for Classification

285 views • 11 slides