( ) Among various machine learning methods based on = + + h - PDF document

Transactions of the Korean Nuclear Society Virtual Spring Meeting July 9-10, 2020 Identification of Initial Events in Nuclear Power Plants Using Machine Learning Methods Young Do Koo a , Hye Seon Jo a , Kwae Hwan Yoo a , Man Gyun Na a* a Dept. of Nuclear Engineering, Chosun Univ., 309 Pilmun-daero, Dong-gu, Gwangju, 61452 * Corresponding author: magyna@chosun.ac.kr 1. Introduction output layers, and hidden layer with loop expressed as Fig. 1. That is, a hidden state in a current step is affected by a hidden state from a previous step in the In the event that any event such as a transient going RNN as Eq. (1), and lastly a value to be outputted in the beyond normal operating condition happens in nuclear output layer is computed using the hidden state as Eq. power plants (NPPs), accurately recognizing and (2). Hence, RNN or RNN-based methods are known as identifying it is essential to establish necessary actions effective methods particularly in time-series analysis. for early mitigating an undesired state under such a circumstance. Especially, initial identification of events, such as a design basis accident (DBA) circumstance in y ˆ t y ˆ t the NPPs, can be one of the critical requisites to prevent from progression to a severe accident. However, correct identification of accident occurrence locations or types RNN may not be easily done on account of monitoring of too h t -1 h t cell many instrumentation signals related to an accident. Therefore, this study is performed to develop models accurately identifying 9 events in initial time after an x t x t accident occurrence, and accordingly artificial intelligence (AI) techniques were used to make the Fig. 1. Simple recurrent neural networks models. ( ) Among various machine learning methods based on = φ + + h W x U h b (1) − artificial neural network (ANN) structures as AI t h t h t 1 h techniques, long-short term memory (LSTM) [1] and where W and U are weights, b is bias, and φ is an gated recurrent unit (GRU) [2], which are with the recurrent neural network structure, were utilized in the activation function. study. The main reason why these methods were applied = φ + is that recurrent neural network structure has an ˆ y ( W h b ) (2) t y t y advantage that information in previous steps in its network is relatively well transferred to current and next h t steps than other methods with typical feedforward network structure (e.g. deep neural network (DNN) or h t- 1 h t convolutional neural network (CNN) [3]). In addition, since event identification model using the DNN was developed and its result was compared with that of φ support vector machine (SVM) model in the previous study [4], in an attempt to check performance on various event identification by newly applied methods in the x t study, the models were developed using the LSTM and GRU. Thus, identification results for 9 initial events of the Fig. 2. A hidden cell of RNN LSTM and GRU models are shown in this paper. Furthermore, ongoing work on event identification Notwithstanding this characteristic of the RNN, the through clustering using an unsupervised learning basic RNN can encounter long term dependency method, as another AI technique, is briefly indicated in problem, which denotes that the networks can not be the paper. effectively learned using the hidden states in the previous steps far from the current step when its hidden 2. Machine Learning Methods Based on Recurrent layer gets denser. Accordingly, the LSTM and GRU Neural Network Structure were presented to overcome this problem. These two variations have different revised hidden cells in the Recurrent neural network (RNN) [3], basic hidden layers in the basic RNN framework and each of framework of the LSTM and GRU, can be simply the specific hidden cells in the LSTM and GRU is a explained as the dynamic neural network with input and

Transactions of the Korean Nuclear Society Virtual Spring Meeting July 9-10, 2020 main factor to solve the aforementioned problem. Fig. 2 2.2 Gated Recurrent Unit indicates a hidden cell of the basic RNN. The GRU, one of the well-known variations of the 2.1 Long-Short Term Memory LSTM, is a method simplifying the LSTM and contains hidden cells in its hidden layers expressed as Fig. 4. The A hidden cell of the LSTM is called as “memory cell” main characteristic of the GRU is that it has less and indicated as Fig. 3. The key of the LSTM is cell parameters than the LSTM even similar. For instance, state in the memory cells as conveyer belt; it only two types of internal gates such as update gate, continuously transfers information in the current step to which is considered as integration of the forget and the next steps and has function to prevent gradient from input gates in the LSTM, and reset gate are in the vanishing. The LSTM cell in the current step is updated hidden cells of the GRU, and moreover the cell state and controlled by three types of gates such as forget, and the hidden state are combined as one hidden state. input, and output. Namely, the information in the previous step is selectively propagated to the next cells h t by removing or adding using these gates. h t- 1 h t × + h t 1- × ×  t C t- 1 C t h × σ σ φ + φ ×  t x t C Reset Update σ σ φ σ gate gate h t h t- 1 Fig. 4. A hidden cell of GRU x t Forget Input Output gate gate gate Eqs. (8) and (9) denote the update and reset gates of the GRU. Each gate literally has a role to determine Fig. 3. A memory cell in LSTM update ratio using the information from the previous and current steps and reset the information from the Eqs. (3)-(5) indicate the forget, input, and output previous step, respectively. The sigmoid function is gates influential in the cell state in the LSTM cells, generally utilized for two gates in the GRU cells, which respectively. Each gate generally outputs hidden state of is the same as the LSTM cells. The hidden state in the the previous step to values between 0 and 1 using GRU is calculated using an output from the update gate sigmoid function to determine how much information is and a candidate in the current step as Eq. (10). applied to the cell. If an outputted value from the sigmoid function is 1, the information is totally used in ( ) = σ + + u W x U h b the cell. On the contrary, the information fades in case (8) − t u t u t 1 u that the value gets closer to 0. The cell states and hidden states in the LSTM are calculated as Eqs. (6) and (7), ( ) = σ + + r W x U h b (9) − respectively. t r t r t 1 r ( )  = σ + + = − ∗ + ∗ f W x U h b (3) h (1 u ) h u h − (10) t f t f t 1 f − t t t t 1 t ( ) = σ + + where candidate result,  t i W x U h b (4) h , is calculated using Eq. (11) − 1 t i t i t i as follows: ( ) = σ + + o W x U h b (5) − t o t o t 1 o  = φ ∗ + h ( W h r Ux ) (11) t − h t 1 t t  = ∗ + ∗ C f C i C (6) − t t t t 1 t 3. Data of Postulated Accident Situation in NPPs = ∗ φ h o ( C ) (7) 3.1 Data Acquisition by Simulating Accidents t t t Identification models of initial events in the NPPs where * denotes element-wise product (Hadamard developed using the LSTM and GRU are based on product). accident symptoms, which is the same as the model established in the previous study [4]. In other words, the

( ) Among various machine learning methods based on = + + h - PDF document

Transactions of the Korean Nuclear Society Virtual Spring Meeting July 9-10, 2020 Identification of Initial Events in Nuclear Power Plants Using Machine Learning Methods Young Do Koo a , Hye Seon Jo a , Kwae Hwan Yoo a , Man Gyun Na a* a Dept. of

Kernel-based Methods and Support Vector Machines Larry Holder CSE 6363 Machine Learning

Kernel-based Methods and Support Vector Machines Larry Holder CptS 570 Machine Learning

APPLIED MACHINE LEARNING Methods for Clustering K-means, Soft K-means DBSCAN 1 MACHINE

Introduction to Machine Learning Eric Medvet 16/3/2017 1/77 Outline Machine Learning: what and

Reinforcement Learning for Continuous State and Action Spaces Gradient Methods 1 MACHINE LEARNING

Tree Based Methods (Ensemb mble Scheme mes) Machine Learning Spring 2018 Feb 26 2018 Kasthuri

Machine learning theory Kernel methods Hamid Beigy Sharif university of technology April 20,

Bayesian Learning Based on Machine Learning, T. Mitchell, McGRAW Hill, 1997, ch. 6

Bayesian Learning Based on Machine Learning, T. Mitchell, McGRAW Hill, 1997, ch. 6

Machine Learning Methods for Metabolic Pathway Prediction Joseph M. Dale, Liviu Popescu, and

Bayesian Methods for Machine Learning Radford M. Neal, University of Toronto

APPLIED MACHINE LEARNING Methods for Reduction of Dimensionality through Linear Projection

Survey of Machine Learning Methods Pedro Rodriguez CU Boulder PhD Student in Large-Scale Machine

Beating the Perils of Non-Convexity: Machine Learning using Tensor Methods Anima Anandkumar ..

10c Machine Learning: Symbol-based 10.0 Introduction 10.5 Knowledge and Learning 10.1 A

10b Machine Learning: Symbol-based 10.0 Introduction 10.5 Knowledge and Learning 10.1 A

10a Machine Learning: Symbol-based 10.0 Introduction 10.5 Knowledge and Learning 10.1 A

10d Machine Learning: Symbol-based 10.0 Introduction 10.5 Knowledge and Learning 10.1 A

Stochastic gradient methods for machine learning Francis Bach INRIA - Ecole Normale Sup

Stochastic gradient methods for machine learning Francis Bach INRIA - Ecole Normale Sup

Genetics-based Machine Learning and Behaviour Based Robotics: A New Synthesis

Machine Learning 10-701 Tom M. Mitchell Machine Learning Department Carnegie Mellon University

Machine Learning and Data Mining Nearest neighbor methods Kalev Kask Supervised learning

Randomized methods for machine learning David Lopez-Paz, FAIR May 17, 2016