Announcements Class is 170. Matlab Grader homework, 1 and 2 (of - PowerPoint PPT Presentation

Announcements Class is 170. Matlab Grader homework, 1 and 2 (of less than 9) homeworks Due 22 April tonight, Binary graded. 167, 165,164 has done the homework. ( If you have not done HW talk to me/TA! ) Homework 3 due 5 May Homework 4 (SVM +DL) due ~24 May Jupiter “GPU” home work released Wednesday. Due 10 May Projects: 39 Groups formed. Look at Piazza for help. Guidelines is on Piazza May 5 proposal due. TAs and Peter can approve. Today: • Stanford CNN 10, CNN and seismics Wednesday • Stanford CNN 11, SVM, (Bishop 7), • Play with Tensorflow playground before class http://playground.tensorflow.org Solve the spiral problem

Recurrent Neural Networks: Process Sequences Recurrent Neural Networks: Process Sequences Recurrent Neural Networks: Process Sequences Recurrent Neural Networks: Process Sequences Recurrent Neural Networks: Process Sequences “Vanilla” Neural Network e.g. Machine Translation e.g. Image Captioning seq of words -> seq of words e.g. Image Captioning image -> sequence of words image -> sequence of words e.g. Sentiment Classification Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 10 - May 4, 2017 14 Lecture 10 - May 4, 2017 Fei-Fei Li & Justin Johnson & Serena Yeung 12 sequence of words -> sentiment Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 10 - May 4, 2017 12 Vanilla Neural Networks e.g. Video classification on frame level Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 10 - May 4, 2017 13 Lecture 10 - May 4, 2017 Fei-Fei Li & Justin Johnson & Serena Yeung 15 Lecture 10 - May 4, 2017 Fei-Fei Li & Justin Johnson & Serena Yeung 11

Recurrent Neural Network We can process a sequence of vectors x by applying a recurrence formula at every time step: y RNN new state old state input vector at some time step some function x with parameters W (Vanilla) Recurrent Neural Network Fei-Fei Li & Justin Johnson & Serena Yeung Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 10 - Lecture 10 - May 4, 2017 May 4, 2017 20 The state consists of a single “hidden” vector h : y RNN x Lecture 10 - Lecture 10 - May 4, 2017 May 4, 2017 Fei-Fei Li & Justin Johnson & Serena Yeung Fei-Fei Li & Justin Johnson & Serena Yeung 22

RNN: Computational Graph … f W f W f W h 0 h 1 h 2 h 3 h T x 1 x 2 x 3 Lecture 10 - Lecture 10 - May 4, 2017 May 4, 2017 Fei-Fei Li & Justin Johnson & Serena Yeung Fei-Fei Li & Justin Johnson & Serena Yeung 25

L RNN: Computational Graph: Many to Many y 2 L 2 y 3 L 3 y T L T y 1 L 1 … f W f W f W h 0 h 1 h 2 h 3 h T x 1 x 2 x 3 W Fei-Fei Li & Justin Johnson & Serena Yeung Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 10 - Lecture 10 - 29 May 4, 2017 May 4, 2017

(Vanilla) Recurrent Neural Network The state consists of a single “hidden” vector h : y Example: RNN Character-level Language Model x Vocabulary: Example: [h,e,l,o] Fei-Fei Li & Justin Johnson & Serena Yeung Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 10 - Lecture 10 - May 4, 2017 May 4, 2017 22 Character-level Language Model Example training sequence: Vocabulary: “hello” [h,e,l,o] Example training Lecture 10 - Lecture 10 - May 4, 2017 May 4, 2017 Fei-Fei Li & Justin Johnson & Serena Yeung Fei-Fei Li & Justin Johnson & Serena Yeung 35 sequence: “hello” Fei-Fei Li & Justin Johnson & Serena Yeung Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 10 - Lecture 10 - 36 May 4, 2017 May 4, 2017

“l” “e” “o” Example: “l” Sample Character-level .03 .25 .11 .11 .13 .20 .17 .02 Softmax Language Model .00 .05 .68 .08 .84 .50 .03 .79 Sampling Vocabulary: [h,e,l,o] At test-time sample characters one at a time, feed back to model Fei-Fei Li & Justin Johnson & Serena Yeung Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 10 - Lecture 10 - May 4, 2017 May 4, 2017 40

Truncated Backpropagation through time Loss Fei-Fei Li & Justin Johnson & Serena Yeung Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 10 - Lecture 10 - May 4, 2017 May 4, 2017 44

Long Short Term Memory (LSTM) Cell state Vanilla RNN LSTM Hochreiter and Schmidhuber, “Long Short Term Memory”, Neural Computation 1997 Fei-Fei Li & Justin Johnson & Serena Yeung Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 10 - Lecture 10 - 96 May 4, 2017 May 4, 2017 Hidden state h(t) Cell state c(t)

Long Short Term Memory (LSTM) f : Forget gate, Whether to erase cell [Hochreiter et al., 1997] i : Input gate, whether to write to cell g : Gate gate (?), How much to write to cell vector from o : Output gate, How much to reveal cell below ( x ) sigmoid i x sigmoid f h W vector from sigmoid o before ( h ) tanh g 4h 4*h 4h x 2h Lecture 10 - Lecture 10 - May 4, 2017 May 4, 2017 Fei-Fei Li & Justin Johnson & Serena Yeung Fei-Fei Li & Justin Johnson & Serena Yeung 97

Long Short Term Memory (LSTM) [Hochreiter et al., 1997] c t c t-1 ☉ + f i W ☉ tanh g h t-1 stack o h t ☉ x t Lecture 10 - Lecture 10 - May 4, 2017 May 4, 2017 Fei-Fei Li & Justin Johnson & Serena Yeung Fei-Fei Li & Justin Johnson & Serena Yeung 98

Classifying emergent and impulsive seismic noise in continuous seismic waveforms Christopher W Johnson NSF Postdoctoral Fellow UCSD / Scripps Institution of Oceanography

The problem Local Time 16 20 0 4 8 12 16 • Identify material failures in the upper 1 km of the crust • Separate microseismicity (M<1) • 59-74% of daily record is not random noise • Earthquake <1% • Air-traffic ~7% • Wind ~6% • Develop new waveform classes • air-traffic, vehicle-traffic, wind, human, instrument, etc. Ben-Zion et al., GJI 2015 4/27/19 Christopher W Johnson – ECE228 CNN 2

The data • 2014 deployment for ~30 days • 1100 vertical 10Hz geophones • 10-30 m spacing • 500 samples per second • 1.6 Tb of waveform data • Experiment design optimized to explore properties and deformation ~600 m in the shallow crust; upper 1km • High res. velocity structure • Imaging the damage zone • Microseismic detection Ben-Zion et al., GJI 2015 4/27/19 Christopher W Johnson – ECE228 CNN 3

Earthquake detection • Distributed region sensor network • Source location random, but expected along major fault lines • P-wave (compression) & S-wave (shear) travel times • Grid search / regression to obtain location • Requires robust detections for small events from IRIS website 4/27/19 Christopher W Johnson – ECE228 CNN 4

Recent advances in seismic detection • 3-component seismic data (east, north, vert) • CNN • Each component is channel • Softmax probability Ross et al., BSSA 2018 4/27/19 Christopher W Johnson – ECE228 CNN 5

Recent advances in seismic detection • Example of continuous waveform • Every sample is classified as noise, P-wave, or S-wave • Outperforms traditional methods utilizing STA/LTA Ross et al., BSSA 2018 4/27/19 Christopher W Johnson – ECE228 CNN 6

Future direction is seismology • Utilize accelerometer in everyone’s smart phone Kong et al., SRL, 2018 4/27/19 Christopher W Johnson – ECE228 CNN 7

Research Approach and Objectives • Need labeled data. This is >80% of the work! • Earthquakes • Arrival time obtained from borehole seismometer within array • Define noise • Develop new algorithm to produce 2 noise labels • Signal processing / spectral analysis • Calculate earthquake SNR • Discard events with SNR ~1 • Waveforms to spectrogram • Matrix of complex values • Retain amplitude and phase • Each input has 2 channels • This is not a rule, just a choice 4/27/19 Christopher W Johnson – ECE228 CNN 8

Deep learning model – Noise Labeling • Labeling is expensive • 1 day with 1100 geophones • ~1800 CPU hrs on 3.4GHz Xeon Gold (1.7hr/per daily record) • ~9000 CPU hrs on 2.6 GHz Xeon E5 on COMET (5x decrease) • Noise training data • 1s labels • 1100 stations for 3 days • Use consecutive 4 s intervals • Calculate spectrogram Image from Meng, Ben-Zion, and Johnson, in GJI revisions 4/27/19 Christopher W Johnson – ECE228 CNN 9

Deep learning model – Assemble data • Obtain earthquake arrival times • Extract 4s waveforms 1s before p-wave arrival • Vary start time within ±0.75s before p-wave • Use each event 5x to retain equal weight with noise • Filter 5-30 Hz, require SNR > 1.5 • Obtain ~480,000 p-wave examples • Incorporates spatial variability across array P-wave • Precalculate 2 noise labels • Use 4s of continuous labels • Data set contains ~1.2 million labeled wavelets • Each API has input format • Shuffle data – Data must contain variability in subsets Noise 4/27/19 Christopher W Johnson – ECE228 CNN 10

Deep learning model - Labels • Earthquake • Random noise • Not random noise • Start with 3 labels • STFT • Equal number in each class • Normalize waveform • It is possible that non-random • Retain amp & phase noise contains earthquakes • 2 layer input matrix 4/27/19 Christopher W Johnson – ECE228 CNN 11

Announcements Class is 170. Matlab Grader homework, 1 and 2 (of - PowerPoint PPT Presentation

Announcements Class is 170. Matlab Grader homework, 1 and 2 (of less than 9) homeworks Due 22 April tonight, Binary graded. 167, 165,164 has done the homework. ( If you have not done HW talk to me/TA! ) Homework 3 due 5 May Homework 4 (SVM +DL)

DHTs and Sharding Aurojit Panda Announcements Announcements Fill out the Github consent

61A Lecture 35 Wednesday, December 4 Announcements 2 Announcements Homework 11 due Thursday

61A Lecture 6 Monday, February 2 Announcements 2 Announcements Homework 2 due Monday 2/2 @

61A Lecture 33 Monday, November 25 Announcements 2 Announcements Homework 10 due Tuesday

61A Lecture 6 Friday, September 13 Announcements 2 Announcements Homework 2 due Tuesday

61A Lecture 24 Monday, March 30 Announcements 2 Announcements Homework 7 due Wednesday 4/8

61A Lecture 37 Wednesday, April 29 Announcements 2 Announcements Homework 9 (4 pts) due

CS 61A Lecture 10 Friday, February 13 Announcements 2 Announcements Guerrilla Section 2 is

61A Lecture 14 Wednesday, February 25 Announcements 2 Announcements Project 2 due Thursday

Linearizability & CAP Announcements No hours this week. Announcements No hours this

61A Lecture 13 Wednesday, October 2 Announcements 2 Announcements Homework 3 deadline

61A Lecture 24 Friday, November 1 Announcements 2 Announcements Homework 7 due Tuesday 11/5

61A Extra Lecture 2 Thursday, February 5 Announcements 2 Announcements If you want 1 unit

CS 61A Lecture 11 Wednesday, February 18 Announcements 2 Announcements Optional Hog Contest

Announcements Lecture 22 System Development Leah Perlmutter / Summer 2018 Announcements

Lecture 30: Conclusion Brian Hou August 11, 2016 Announcements Announcements Final Exam

Convolutional Neural Networks Kaitlin Palmer San Diego State University 1 Outline What are

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

Congestion Management for Non-Blocking Clos Networks Nikos Chrysos Inst. of Computer Science

An Open Interface for Hooking Solvers to Modeling Systems Part 2: The Modeling System Interface

Deep Learning Tutorial Part II Greg Shakhnarovich TTI-Chicago December 2016 Deep Learning

Switching and Forwarding Outline Store-and-Forward Switches Bridges and Extended LANs Cell

Crypto meets Web Security: Certificates and SSL/TLS Spring 2017 Franziska (Franzi) Roesner

Vanishing and exploding gradients RECURREN T N EURAL N ETW ORK S F OR LAN GUAGE MODELIN G IN P

Announcements Class is 170. Matlab Grader homework, 1 and 2 (of - PowerPoint PPT Presentation

Announcements Class is 170. Matlab Grader homework, 1 and 2 (of less than 9) homeworks Due 22 April tonight, Binary graded. 167, 165,164 has done the homework. ( If you have not done HW talk to me/TA! ) Homework 3 due 5 May Homework 4 (SVM +DL)

DHTs and Sharding Aurojit Panda Announcements Announcements Fill out the Github consent

61A Lecture 35 Wednesday, December 4 Announcements 2 Announcements Homework 11 due Thursday

61A Lecture 6 Monday, February 2 Announcements 2 Announcements Homework 2 due Monday 2/2 @

61A Lecture 33 Monday, November 25 Announcements 2 Announcements Homework 10 due Tuesday

61A Lecture 6 Friday, September 13 Announcements 2 Announcements Homework 2 due Tuesday

61A Lecture 24 Monday, March 30 Announcements 2 Announcements Homework 7 due Wednesday 4/8

61A Lecture 37 Wednesday, April 29 Announcements 2 Announcements Homework 9 (4 pts) due

CS 61A Lecture 10 Friday, February 13 Announcements 2 Announcements Guerrilla Section 2 is

61A Lecture 14 Wednesday, February 25 Announcements 2 Announcements Project 2 due Thursday

Linearizability &amp; CAP Announcements No hours this week. Announcements No hours this

61A Lecture 13 Wednesday, October 2 Announcements 2 Announcements Homework 3 deadline

61A Lecture 24 Friday, November 1 Announcements 2 Announcements Homework 7 due Tuesday 11/5

61A Extra Lecture 2 Thursday, February 5 Announcements 2 Announcements If you want 1 unit

CS 61A Lecture 11 Wednesday, February 18 Announcements 2 Announcements Optional Hog Contest

Announcements Lecture 22 System Development Leah Perlmutter / Summer 2018 Announcements

Lecture 30: Conclusion Brian Hou August 11, 2016 Announcements Announcements Final Exam

Convolutional Neural Networks Kaitlin Palmer San Diego State University 1 Outline What are

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

Congestion Management for Non-Blocking Clos Networks Nikos Chrysos Inst. of Computer Science

An Open Interface for Hooking Solvers to Modeling Systems Part 2: The Modeling System Interface

Deep Learning Tutorial Part II Greg Shakhnarovich TTI-Chicago December 2016 Deep Learning

Switching and Forwarding Outline Store-and-Forward Switches Bridges and Extended LANs Cell

Crypto meets Web Security: Certificates and SSL/TLS Spring 2017 Franziska (Franzi) Roesner

Vanishing and exploding gradients RECURREN T N EURAL N ETW ORK S F OR LAN GUAGE MODELIN G IN P

Linearizability & CAP Announcements No hours this week. Announcements No hours this