Implementing a Multilayer Perceptron from Scratch Implementing a - PowerPoint PPT Presentation

Mar 30, 2024 •304 likes •390 views

Implementing a Multilayer Perceptron from Scratch Implementing a Multilayer Perceptron from Scratch In [1]: % matplotlib inline import d2l from mxnet import nd from mxnet.gluon import loss as gloss Load the Fashion-MNIST data set Load the

Implementing a Multilayer Perceptron from Scratch Implementing a Multilayer Perceptron from Scratch In [1]: % matplotlib inline import d2l from mxnet import nd from mxnet.gluon import loss as gloss
Load the Fashion-MNIST data set Load the Fashion-MNIST data set In [2]: batch_size = 256 train_iter, test_iter = d2l.load_data_fashion_mnist(batch_size)
Initialize Model Parameters Initialize Model Parameters In [3]: num_inputs, num_outputs, num_hiddens = 784, 10, 256 W1 = nd.random.normal(scale=0.01, shape=(num_inputs, num_hiddens)) b1 = nd.zeros(num_hiddens) W2 = nd.random.normal(scale=0.01, shape=(num_hiddens, num_outputs)) b2 = nd.zeros(num_outputs) params = [W1, b1, W2, b2] for param in params: param.attach_grad()
Activation Function Activation Function In [4]: def relu(X): return nd.maximum(X, 0)
The model The model In [5]: def net(X): X = X.reshape((-1, num_inputs)) H = relu(nd.dot(X, W1) + b1) return nd.dot(H, W2) + b2
The Loss Function The Loss Function In [6]: loss = gloss.SoftmaxCrossEntropyLoss()
Training Training In [7]: num_epochs, lr = 10, 0.5 d2l.train_ch3(net, train_iter, test_iter, loss, num_epochs, batch_size, params, lr) epoch 1, loss 0.7868, train acc 0.708, test acc 0.822 epoch 2, loss 0.4831, train acc 0.820, test acc 0.853 epoch 3, loss 0.4295, train acc 0.842, test acc 0.859 epoch 4, loss 0.3930, train acc 0.856, test acc 0.865 epoch 5, loss 0.3663, train acc 0.866, test acc 0.869 epoch 6, loss 0.3520, train acc 0.870, test acc 0.871 epoch 7, loss 0.3368, train acc 0.876, test acc 0.870 epoch 8, loss 0.3236, train acc 0.880, test acc 0.878 epoch 9, loss 0.3129, train acc 0.886, test acc 0.883 epoch 10, loss 0.3067, train acc 0.886, test acc 0.882
Evaluation Evaluation In [8]: for X, y in test_iter: break true_labels = d2l.get_fashion_mnist_labels(y.asnumpy()) pred_labels = d2l.get_fashion_mnist_labels(net(X).argmax(axis=1).asnumpy()) titles = [truelabel + ' \n ' + predlabel for truelabel, predlabel in zip(true_labels , pred_labels)] d2l.show_fashion_mnist(X[0:9], titles[0:9])

Recommend

Introduction to Machine Learning Multilayer Perceptron Barnabs Pczos The Multilayer

Introduction to Machine Learning Multilayer Perceptron Barnabs Pczos The Multilayer Perceptron 2 Multilayer Perceptron 3 ALVINN: AN AUTONOMOUS LAND VEHICLE IN A NEURAL NETWORK Dean A. Pomerleau, Carnegie Mellon University, 1989

655 views • 44 slides

Scratch Brainstorming CLIMATE CHANGE CODING LESSON GRADE 10 Meet Scratch Scratch is a coding

Scratch Brainstorming CLIMATE CHANGE CODING LESSON GRADE 10 Meet Scratch Scratch is a coding platform for all ages and subjects. Students can use Scratch to learn 21 st century skills while coding their own interactive stories,

557 views • 11 slides

Scratch Brainstorming WATER SYSTEMS CODING LESSON GRADE 8 Meet Scratch Scratch is a coding

Scratch Brainstorming WATER SYSTEMS CODING LESSON GRADE 8 Meet Scratch Scratch is a coding platform for all ages and subjects. Students can use Scratch to learn 21 st century skills while coding their own interactive stories, animations,

395 views • 11 slides

CS 472 - Perceptron 1 Basic Neuron CS 472 - Perceptron 2 Expanded Neuron CS 472 - Perceptron

CS 472 - Perceptron 1 Basic Neuron CS 472 - Perceptron 2 Expanded Neuron CS 472 - Perceptron 3 Perceptron Learning Algorithm l First neural network learning model in the 1960s l Simple and limited (single layer models) l Basic concepts

571 views • 53 slides

Introduction to Scratch Programming Tiffany Snell Palm Beach County Library System What is

Introduction to Scratch Programming Tiffany Snell Palm Beach County Library System What is Scratch? Website: scratch.mit.edu Why Scratch and Not Python, JavaScript, or C? Scratch - Python - print("Hello World!")

457 views • 23 slides

Machine Learning A Geometric Approach Linear Classification: Perceptron Professor Liang Huang

Machine Learning A Geometric Approach Linear Classification: Perceptron Professor Liang Huang some slides from Alex Smola (CMU) Perceptron Frank Rosenblatt deep learning multilayer perceptron perceptron linear regression SVM CRF

698 views • 47 slides

The Perceptron Algorithm Machine Learning 1 Some slides based on lectures from Dan Roth, Avrim

The Perceptron Algorithm Machine Learning 1 Some slides based on lectures from Dan Roth, Avrim Blum and others Outline The Perceptron Algorithm Variants of Perceptron Perceptron Mistake Bound 2 Where are we? The Perceptron

559 views • 44 slides

Structured Perceptron CMSC 470 Marine Carpuat POS tagging Sequence labeling with the perceptron

Sequence Labeling with the Structured Perceptron CMSC 470 Marine Carpuat POS tagging Sequence labeling with the perceptron Sequence labeling problem Structured Perceptron Input: Perceptron algorithm can be used for sequence labeling

654 views • 13 slides

Applied Machine Learning Applied Machine Learning Multilayer Perceptron Siamak Ravanbakhsh

Applied Machine Learning Applied Machine Learning Multilayer Perceptron Siamak Ravanbakhsh Siamak Ravanbakhsh COMP 551 COMP 551 (winter 2020) (winter 2020) 1 Learning objectives Learning objectives multilayer percepron: model different

1.54k views • 97 slides

Perceptrons Introduction: Neural Networks 1 The Perceptron 2 Using Perceptrons Perceptrons

Introduction: Neural Networks The Perceptron Multilayer Perceptrons Training MLPs Applying MLPs Introduction: Neural Networks The Perceptron Multilayer Perceptrons Training MLPs Applying MLPs Perceptrons Introduction: Neural Networks 1

761 views • 11 slides

Position in Scratch Position on the Stage! In Scratch, the sprites perform the commands you give

Position in Scratch Position on the Stage! In Scratch, the sprites perform the commands you give them on the stage . You can control the position of the sprites on the stage. Position on the Stage! In Scratch, the stage is actually a big X/Y

310 views • 11 slides

Introduction to Machine Learning Perceptron Barnabs Pczos Contents History of Artificial

Introduction to Machine Learning Perceptron Barnabs Pczos Contents History of Artificial Neural Networks Definitions: Perceptron, Multi-Layer Perceptron Perceptron algorithm 2 Short History of Artificial Neural Networks 3

836 views • 42 slides

How to Train Your Perceptron 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University

PERCEPTRON How to Train Your Perceptron 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University Lets start easy worlds smallest perceptron! w f y x y = wx (a.k.a. line equation, linear regression) Learning a Perceptron

656 views • 20 slides

The Perceptron Mistake Bound Machine Learning 1 Some slides based on lectures from Dan Roth,

The Perceptron Mistake Bound Machine Learning 1 Some slides based on lectures from Dan Roth, Avrim Blum and others Where are we? The Perceptron Algorithm Variants of Perceptron Perceptron Mistake Bound 2 Convergence Convergence

692 views • 33 slides

New CDE Type RA 125 C Radial, Multilayer Film Capacitors For high-frequency RFI/EMI

New CDE Type RA 125 C Radial, Multilayer Film Capacitors For high-frequency RFI/EMI suppression, ignition pulse forming and motor controllers. RA Multilayer Film Capacitors High-performance step-up from multilayer ceramic capacitors.

463 views • 9 slides

LCA OF BIODEGRADABLE LCA OF BIODEGRADABLE MULTILAYER FILM FROM MULTILAYER FILM FROM BIOPOLYMERS

3rd International Conference on Life Cycle Management University of Zurich at Irchel, August 27-29, 2007 LCA OF BIODEGRADABLE LCA OF BIODEGRADABLE MULTILAYER FILM FROM MULTILAYER FILM FROM BIOPOLYMERS BIOPOLYMERS D. Garran 1 , R. Vidal 1 ,

581 views • 22 slides

Nick Gnedin The Brief History of Time End of inflation: Today: z=10 27 z=0 t=10 -36 s t=13.7

Faintest Galaxies in the JWST Era Nick Gnedin The Brief History of Time End of inflation: Today: z=10 27 z=0 t=10 -36 s t=13.7 Gyr The Brief History of Time ionized neutral ionized RE-IONIZATION What We Know Now : Galaxy Luminosity

474 views • 20 slides

Design and Architectures for Embedded Systems Prof. Dr. J. Henkel Henkel Prof. Dr. J. CES CES

Design and Architectures for Embedded Systems Prof. Dr. J. Henkel Henkel Prof. Dr. J. CES CES - - Chair for Embedded Systems Chair for Embedded Systems University of University of Karlsruhe Karlsruhe, Germany , Germany Today:

342 views • 22 slides

INTRODUCTION TO CALD (Computer Aided Logic Design) Introduction Late 1960s-early 80s

Robert Betz: 97 Department of Electrical and Computer Engineering INTRODUCTION TO CALD (Computer Aided Logic Design) Introduction Late 1960s-early 80s most logic design carried out using the TTL (transistor transistor logic) or

726 views • 25 slides

Boundedness and absoluteness of some dynamical invariants Krzysztof Krupi nski (joint work

Boundedness and absoluteness of some dynamical invariants Krzysztof Krupi nski (joint work with Ludomir Newelski and Pierre Simon) Instytut Matematyczny Uniwersytet Wroc lawski Paris March 26, 2018 Krzysztof Krupi nski Boundedness

476 views • 43 slides

Machine Learning Fall 2017 Structured Prediction (structured perceptron, HMM, structured SVM)

Machine Learning Fall 2017 Structured Prediction (structured perceptron, HMM, structured SVM) Professor Liang Huang (Chap. 17 of CIML) Structured Prediction x x the man bit the dog the man bit the dog x x DT NN

672 views • 27 slides

Coordinating distributed systems part II Marko Vukoli Distributed Systems and Cloud Computing

Coordinating distributed systems part II Marko Vukoli Distributed Systems and Cloud Computing Last Time Coordinating distributed systems part I Zookeeper At the heart of Zookeeper is the ZAB atomic broadcast protocol Today

682 views • 29 slides

Genuinely entangled subspaces M. Demianowicz ( joint work with R. Augusiak ) partial support:

Genuinely entangled subspaces M. Demianowicz ( joint work with R. Augusiak ) partial support: National Science Centre (NCN, Poland) Department of Atomic, Molecular, and Optical Physics Faculty of Applied Physics and Mathematics Gdask

279 views • 25 slides

VHDL VHDL - Flaxer Eli Ch 2 - 1 Programmable Logic Review (last chapter) VHDL and

Chapter 2 Programmable Logic VHDL VHDL - Flaxer Eli Ch 2 - 1 Programmable Logic Review (last chapter) VHDL and programmable logic = best current solution for rapid design, implementation, testing, and documenting of complex digital

463 views • 11 slides