Artificial Neural Networks Genome 559: Introduction to Statistical - PowerPoint PPT Presentation

Artificial Neural Networks Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein Some slides adapted from Geoffrey Hinton and Igor Aizenberg

A quick review � Ab initio gene prediction � Parameters: � Splice donor sequence model � Splice acceptor sequence model � Intron and exon length distribution � Open reading frame � More … � Markov chain � States � Transition probabilities � Hidden Markov Model (HMM)

Machine learning “A field of study that gives computers the ability to learn without being explicitly programmed.” Arthur Samuel (1959)

Tasks best solved by learning algorithms � Recognizing patterns : � Facial identities or facial expressions � Handwritten or spoken words � Recognizing anomalies : � Unusual sequences of credit card transactions � Prediction: � Future stock prices � Predict phenotype based on markers � Genetic association, diagnosis, etc.

Why machine learning? � It is very hard to write programs that solve problems like recognizing a face. � We don’t know what program to write. � Even if we had a good idea of how to do it, the program might be horrendously complicated. � Instead of writing a program by hand, we collect lots of examples for which we know the correct output � A machine learning algorithm then takes these examples, trains , and “produces a program” that does the job. � If we do it right, the program works for new cases as well as the ones we trained it on.

Why neural networks? � One of those things you always hear about but never know exactly what they actually mean… � A good example of a machine learning framework � In and out of fashion … � An important part of machine learning history � A powerful framework

The goals of neural computation 1. To understand how the brain actually works � Neuroscience is hard! 2. To develop a new style of computation � Inspired by neurons and their adaptive connections � Very different style from sequential computation 3. To solve practical problems by developing novel learning algorithms � Learning algorithms can be very useful even if they have nothing to do with how the brain works

How the brain works (sort of) � Each neuron receives inputs from many other neurons � Cortical neurons use spikes to communicate � Neurons spike once they “aggregate enough stimuli” through input spikes � The effect of each input spike on the neuron is controlled by a synaptic weight. Weights can be positive or negative � Synaptic weights adapt so that the whole network learns to perform useful computations � A huge number of weights can affect the computation in a very short time. Much better bandwidth than a computer.

A typical cortical neuron � Physical structure: � There is one axon that branches � There is a dendritic tree that collects input from other neurons � Axons typically contact dendritic trees at synapses axon body � A spike of activity in the axon causes dendritic a charge to be injected into the post- tree synaptic neuron

Idealized Neuron � Basically, a weighted sum! X 1 w 1 Σ X 2 w 2 Y w 3 X 3 y ∑ = x i w i i

Adding bias � Function does not have to pass through the origin X 1 w 1 Σ,b X 2 w 2 Y w 3 X 3 = ∑ y x w − b i i i

Adding an “activation” function � The “field” of the neuron goes through an activation function X 1 w 1 φ Σ,b X 2 w 2 Y w 3 X 3 Z , (the field of the neuron) ∑ y = Φ ( x w − b ) i i i

Common activation functions Linear activation Logistic activation 1 ( ) φ = z z ( ) φ z = + X 1 e α − z 1 w 1 1 X 2 w 2 Σ,b,φ Y z z w 3 0 X 3 Hyperbolic tangent activation Threshold activation − 2 γ u 1 − e  1, if z ≥ 0, ( ) ( ) ϕ u = tanh γ u = =  − ( ) φ z = sign( ) z − 2 γ u  1, if z < 0. 1 + e 1 1 0 z z -1 -1 13

McCulloch-Pitts neurons � Introduced in 1943 (and influenced Von Neumann!) � Threshold activation function � Restricted to binary inputs and outputs w 1 =1, w 2 =1, b=1.5 X 1 w 1 =1, w 2 =1, b=0.5 w 1 Σ,b Y X 1 X 2 y X 1 X 2 y w 2 0 0 0 0 0 0 X 2 0 1 0 0 1 1 = ∑ z x w − b 1 0 0 1 0 1 i i 1 1 1 1 1 1 i 1 if z>0 y= 0 otherwise X 1 AND X 2 X 1 OR X 2

Beyond binary neurons w 1 =1, w 2 =1, b=1.5 X 1 w 1 =1, w 2 =1, b=0.5 w 1 Σ,b Y X 1 X 2 y X 1 X 2 y w 2 0 0 0 0 0 0 X 2 0 1 0 0 1 1 = ∑ z x w − b 1 0 0 1 0 1 i i 1 1 1 1 1 1 i 1 if z>0 y= 0 otherwise X 1 AND X 2 X 1 OR X 2 X 2 X 2 (0,1) (0,1) (1,1) (1,1) (0,0) (0,0) X 1 X 1 (1,0) (1,0)

Beyond binary neurons � A general classifier � The weights determine the slope � The bias determines the distance from the origin But … how would we know X 2 how to set the weights and the bias? (note: the bias can be represented as an additional input) X 1

Perceptron learning � Use a “training set” and let the perceptron learn from its mistakes � Training set: A set of input data for which we know the correct answer/classification! � Learning principle: Whenever the perceptron is wrong, make a small correction to the weights in the right direction. � Note: Supervised learning � Training set vs. testing set

Perceptron learning 1. Initialize weights and threshold (e.g., use small random values). 2. Use input X and desired output d from training set 3. Calculate the actual output, y 4. Adapt weights: w i (t+1) = w i (t) + α(d − y)x i for all weights. α is the learning rate (don’t overshoot) Repeat 3 and 4 until the d − y is smaller than a user-specified error threshold, or a predetermined number of iterations have been completed. If solution exists – guaranteed to converge!

Linear separability � What about the XOR function? � Or other non linear separable classification problems such as:

Multi-layer feed-forward networks � We can connect several neurons, where the output of some is the input of others.

Solving the XOR problem � Only 3 neurons are required!!! +1 X 1 b=1.5 -1 +1 Y b=0.5 +1 +1 X 2 b=0.5 +1

In fact … � With one hidden layer you can solve ANY classification task! � But …. How do you find the right set of weights? (note: we only have an error delta for the output neuron) � This problem caused this framework to fall out of favor … until …

Back-propagation Main idea: � First propagate a training input data point forward to get the calculated output � Compare the calculated output with the desired output to get the error (delta) � Now, propagate the error back in the network to get an error estimate for each neuron � Update weights accordingly

Types of connectivity � Feed-forward networks output units � Compute a series of transformations hidden units � Typically, the first layer is the input and the last layer is the output. input units � Recurrent networks � Include directed cycles in their connection graph. � Complicated dynamics. � Memory. � More biologically realistic?

Computational representation of networks A B C D List of edges: Connectivity Matrix Object Oriented (ordered) pairs of nodes A B C D Name:D Name:C ngr: ngr: A 0 0 1 0 [ (A,C) , (C,B) , Name:A B 0 0 0 0 ngr: p1 p1 p2 (D,B) , (D,C) ] C 0 1 0 0 p1 Name:B D 0 1 1 0 ngr: � Which is the most useful representation?

Artificial Neural Networks Genome 559: Introduction to Statistical - PowerPoint PPT Presentation

Artificial Neural Networks Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein Some slides adapted from Geoffrey Hinton and Igor Aizenberg A quick review Ab initio gene prediction Parameters:

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Artificial Neural Networks By: Kodi Neumiller Overview What is an artificial neural network

Introduction to Artificial Intelligence Neural Networks - Deep Learning for NLP Janyl Jumadinova

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Artificial Neural Networks Roger Barlow CODATA School - Roger Barlow -Artificial Neural Networks

How Neural Networks (NN) Biological Neuron: A . . . Can (Hopefully) Learn Artificial Neural . .

Artificial Neural Networks Oliver Schulte - CMPT 726 Feed-forward Networks Network Training

Networks Luke Schuler Overview What is an Artificial Neural Network? History

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

CS4501: Introduction to Computer Vision Neural Networks (NNs) Artificial Neural Networks (ANNs)

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

CHAPTER II III I CHAPTER Neural Networks as Neural Networks as Associative Memory

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Neural Networks 0. Logistics Spring 2019 1 Neural Networks are taking over! Neural networks

High performance and scalable architectures A practical introduction to CQRS and Axon Framework

Using light to observe the brain Gyorgy Lur, PhD Bio Sci H195, University of California, Irvine

Pheromone Elicits Discrete Behavior Title Different Sensory Modalities Employ Receptor

Neuroscience The Study of Neurons and Brains 2 /32 DNN did this Recognition Degeneration

RGC and optic nerve Literature review 2018 Nitza Goldenberg-Cohen , MD Bnai Zion Medical Center,

Introduction to Neural Networks Philipp Koehn 24 September 2020 Philipp Koehn Machine

Thanks and Disclosures UCSF Neurology Outpatient Conference Thanks! John Engstrom, M.D. Dr.

FORGET ME PLEASE? EVENT SOURCING & THE GDPR Michiel Rook - @michieltcs DISCLAIMER: I AM

Artificial Neural Networks Genome 559: Introduction to Statistical - PowerPoint PPT Presentation

Artificial Neural Networks Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein Some slides adapted from Geoffrey Hinton and Igor Aizenberg A quick review Ab initio gene prediction Parameters:

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Artificial Neural Networks By: Kodi Neumiller Overview What is an artificial neural network

Introduction to Artificial Intelligence Neural Networks - Deep Learning for NLP Janyl Jumadinova

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Artificial Neural Networks Roger Barlow CODATA School - Roger Barlow -Artificial Neural Networks

How Neural Networks (NN) Biological Neuron: A . . . Can (Hopefully) Learn Artificial Neural . .

Artificial Neural Networks Oliver Schulte - CMPT 726 Feed-forward Networks Network Training

Networks Luke Schuler Overview What is an Artificial Neural Network? History

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

CS4501: Introduction to Computer Vision Neural Networks (NNs) Artificial Neural Networks (ANNs)

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

CHAPTER II III I CHAPTER Neural Networks as Neural Networks as Associative Memory

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Neural Networks 0. Logistics Spring 2019 1 Neural Networks are taking over! Neural networks

High performance and scalable architectures A practical introduction to CQRS and Axon Framework

Using light to observe the brain Gyorgy Lur, PhD Bio Sci H195, University of California, Irvine

Pheromone Elicits Discrete Behavior Title Different Sensory Modalities Employ Receptor

Neuroscience The Study of Neurons and Brains 2 /32 DNN did this Recognition Degeneration

RGC and optic nerve Literature review 2018 Nitza Goldenberg-Cohen , MD Bnai Zion Medical Center,

Introduction to Neural Networks Philipp Koehn 24 September 2020 Philipp Koehn Machine

Thanks and Disclosures UCSF Neurology Outpatient Conference Thanks! John Engstrom, M.D. Dr.

FORGET ME PLEASE? EVENT SOURCING &amp; THE GDPR Michiel Rook - @michieltcs DISCLAIMER: I AM

FORGET ME PLEASE? EVENT SOURCING & THE GDPR Michiel Rook - @michieltcs DISCLAIMER: I AM