Recurrent Neural Network Attention Mechanisms for Interpretable - PowerPoint PPT Presentation

Feb 06, 2024 •553 likes •719 views

Recurrent Neural Network Attention Mechanisms for Interpretable System Log Anomaly Detection Presented By: Akash Kulkarni System Log Analysis is complicated 1. Log sources generate TBs of data per day 2. Lack of labelled data (scarce or

Recurrent Neural Network Attention Mechanisms for Interpretable System Log Anomaly Detection Presented By: Akash Kulkarni
System Log Analysis is complicated 1. Log sources generate TBs of data per day 2. Lack of labelled data (scarce or unbalanced) 3. Actionable information may be obscured (by complex relationships across logging sources) Need for an aided human monitoring and assessment.
Unsupervised RNN language models • Models distribution of normal events in system logs (learns complex relationships buried in logs) • No need of labelled data. • No feature engineering required (as deep learning learns significant features automatically)
Language Modelling • Each log-line consists of sequence of T tokens: • x (1:T) = x (1), x (2), x (3) ……x (T) , each token x (i) ∈ V • Language model (like RNNs) assigns probabilities to sequences: & • P( " ($:&) ) ) = ∏ )*$ +(" ()) |" (-)) ) • Tokenization can be word-based or character-based.
Cyber Anomaly Language Models 1. Event Model (EM)– which applies a standard LSTM to log lines 2. Bidirectional Event Model (BEM)
Cyber Anomaly Language Models 3. Tiered Language Model (T-EM or T-BEM)
Attention • Key matrix, ! = tanh(() * ) • Weights, , = -./0123 4! 5 • Attention, 2 = ,( • Prediction,
EM attention variants 1. Fixed Attention: • ! " = ! • Assumes some position in the sequence are more important than others 2. Syntax Attention: • ! " not shared across t • Importance depends on the position of the current token in sequence. 3. Semantic Attention 1: , • 4. Semantic Attention 2: ( • ℎ (&) = concatentation of ℎ (&) and ! (&)
EM attention variants 5. Tiered Attention • Replaces mean with weighted average via attention
Results Table 1. AUC statistics for word tokenization models Table 2. AUC statistics for character tokenization models
Analysis Comparison of attention weights when predicting success/failure token
Analysis 1. Average Fixed attention weights 2. Average Syntax attention weights 3. Average Semantic 1 attention weights 4. Average Semantic 2 attention weights
Analysis • Tiered attention models • For lower forward-directional LSTM, attention weights were nearly 1.0 for 2 nd to last hidden state. • For lower bidirectional LSTM, attention weights were nearly 1.0 for 1 st hidden state and last state • Hence, attentions are not needed for this model task.
Case Studies 2. Low anomaly word case study with Semantic attention 1. Word case study with Semantic attention 3. Case character study with Semantic attention
Conclusions • Fixed and syntactic attention effective for fixed structure sequences. • Attention mechanism improve performance and provide feature importance and relational mapping between features. Future directions • Explore BEM with attention • Equipping a lower tier model to attend over upper tier hidden states

Recommend

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

Ugur HALICI - METU EEE - ANKARA 11/18/2004 CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I : Recurrent Neural Networks CHAPTER I Recurrent Neural Networks Introduction In this chapter first the

404 views • 27 slides

Recurrent Neural Network Xiaogang Wang xgwang@ee.cuhk.edu.hk February 26, 2019 cuhk Xiaogang

Recurrent Neural Network Xiaogang Wang xgwang@ee.cuhk.edu.hk February 26, 2019 cuhk Xiaogang Wang (CUHK) Recurrent Neural Network February 26, 2019 1 / 52 Outline 1 Recurrent neural networks Recurrent neural networks BP on RNN Variants

1.22k views • 52 slides

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class Recurrent Neural Network Cell Recurrent Neural Networks (RNNs) Bi-Directional Recurrent Neural Networks (Bi-RNNs) Multiple-layer /

583 views • 47 slides

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

793 views • 21 slides

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Sequential Data with Neural Networks Recurrent Neural

303 views • 4 slides

Recurrent Neural Network Agenda Recurrent Neural Network

Recurrent Neural Network Agenda Recurrent Neural Network NER sequence tagging RNN

670 views • 46 slides

Advanced Neural Machine Translation Gongbo Tang 23 September 2019 Outline NMT with Attention

Advanced Neural Machine Translation Gongbo Tang 23 September 2019 Outline NMT with Attention Mechanisms 1 Attention Mechanisms Understanding Attention Mechanisms Attention Variants NMT at Different Granularities 2 Hybrid Models

828 views • 57 slides

Advanced Neural Machine Translation Gongbo Tang 21 September 2020 Outline NMT with Attention

Advanced Neural Machine Translation Gongbo Tang 21 September 2020 Outline NMT with Attention Mechanisms 1 Attention Mechanisms Understanding Attention Mechanisms Attention Variants NMT at Different Granularities 2 Hybrid Models

810 views • 56 slides

Attention in NLP CS 6956: Deep Learning for NLP Overview What is attention Attention in

Attention in NLP CS 6956: Deep Learning for NLP Overview What is attention Attention in encoder-decoder networks Various kinds of attention 2 Overview What is attention? Attention in encoder-decoder networks 3 Visual

971 views • 73 slides

NLP Programming Tutorial 8 - Recurrent Neural Nets Graham Neubig Nara Institute of Science and

NLP Programming Tutorial 8 Recurrent Neural Nets NLP Programming Tutorial 8 - Recurrent Neural Nets Graham Neubig Nara Institute of Science and Technology (NAIST) 1 NLP Programming Tutorial 8 Recurrent Neural Nets Feed Forward Neural

604 views • 34 slides

The Power of Linear Recurrent Neural Networks Neural Networks Was knnen lineare rekurrente

The Power of Linear Recurrent The Power of Linear Recurrent Neural Networks Neural Networks Was knnen lineare rekurrente neuronale Netze? Frieder Stolzenburg Overview Frieder Stolzenburg Introduction Recurrent Neural Networks

1.17k views • 66 slides

Understanding LSTM Networks Recurrent Neural Networks An unrolled recurrent neural network The

Understanding LSTM Networks Recurrent Neural Networks An unrolled recurrent neural network The Problem of Long-Term Dependencies RNN short-term dependencies Language model trying to predict the next word based on the previous ones the clouds

729 views • 27 slides

Recurrent Neural Networks CS60010: Deep Learning Abir Das IIT Kharagpur Mar 11, 2020

Recurrent Neural Networks CS60010: Deep Learning Abir Das IIT Kharagpur Mar 11, 2020 Introduction Recurrent Neural Network LSTM Agenda Get introduced to different recurrent neural architecture e.g. , RNNs, LSTMs, GRUs etc. Get

743 views • 48 slides

Recurrent Neural Networks Greg Mori - CMPT 419/726 Goodfellow, Bengio, and Courville: Deep

Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Recurrent Neural Networks Greg Mori - CMPT 419/726 Goodfellow, Bengio, and Courville: Deep Learning textbook Ch. 10 Recurrent Neural Networks Long

667 views • 18 slides

CSEP 517: Natural Language Processing Recurrent Neural Networks Autumn 2018 Luke Zettlemoyer

CSEP 517: Natural Language Processing Recurrent Neural Networks Autumn 2018 Luke Zettlemoyer University of Washington [most slides from Yejin Choi] RECURRENT NEURAL L NE NETWOR WORKS Recurrent Neural Networks (RNNs) Each input

787 views • 29 slides

CHAPTER VII VII CHAPTER Learning in Recurrent Networks Learning in Recurrent Networks CHAPTER

Ugur HALICI - METU EEE - ANKARA 11/18/2004 CHAPTER VII VII CHAPTER Learning in Recurrent Networks Learning in Recurrent Networks CHAPTER VI : VI : Learning in CHAPTER Learning in Recurrent Recurrent Networks Networks Introduction We

464 views • 17 slides

Recurrent Neural Networks CS 6956: Deep Learning for NLP Overview 1. Modeling sequences 2.

Recurrent Neural Networks CS 6956: Deep Learning for NLP Overview 1. Modeling sequences 2. Recurrent neural networks: An abstraction 3. Usage patterns for RNNs 4. BiDirectional RNNs 5. A concrete example: The Elman RNN 6. The vanishing

3.07k views • 30 slides

Natural Language Processing with Deep Learning Language Modeling with Recurrent Neural Networks

Natural Language Processing with Deep Learning Language Modeling with Recurrent Neural Networks Navid Rekab-Saz navid.rekabsaz@jku.at Institute of Computational Perception Agenda Language Modeling with n- grams Recurrent Neural

667 views • 55 slides

Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) CMSC 678 UMBC Recap

Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) CMSC 678 UMBC Recap from last time Feed-Forward Neural Network: Multilayer Perceptron + 0 ) = (

1.04k views • 93 slides

Neural Network Part 4: Recurrent Neural Networks Yingyu Liang Computer Sciences 760 Fall 2017

Neural Network Part 4: Recurrent Neural Networks Yingyu Liang Computer Sciences 760 Fall 2017 http://pages.cs.wisc.edu/~yliang/cs760/ Some of the slides in these lectures have been adapted/borrowed from materials developed by Mark Craven,

384 views • 37 slides

Introduction to Recurrent Neural Networks Jakob Verbeek Modeling sequential data with Recurrent

Introduction to Recurrent Neural Networks Jakob Verbeek Modeling sequential data with Recurrent Neural Networks Compact schematic drawing of standard multi-layer perceptron (MLP) output hidden input Modeling sequential data So far we

796 views • 50 slides

Recurrent Neural Networks for Language Modeling CSE392 - Spring 2019 Special Topic in CS Tasks

Recurrent Neural Networks for Language Modeling CSE392 - Spring 2019 Special Topic in CS Tasks Recurrent Neural Network and Language Modeling: Generate how? Sequence Models next word, sentence capture hidden representation of

674 views • 39 slides

Gated Orthogonal Recurrent Units: On Learning to Forget Li Jing, a lar Glehre, John

Gated Orthogonal Recurrent Units: On Learning to Forget Li Jing, a lar Glehre, John Peurifoy, Yichen Shen, Max Tegmark, Marin Solja i , Yoshua Bengio Gradient Vanishing/Explosion Problem During backpropagation through time,

321 views • 9 slides

Differential Categories, Recurrent Neural Networks, and Machine Learning Shin-ya Katsumata and

Differential Categories, Recurrent Neural Networks, and Machine Learning Shin-ya Katsumata and David Sprunger* National Institute of Informatics, Tokyo SYCO 4 Chapman University May 23, 2019 1/32 Outline Feedforward neural networks 1

978 views • 59 slides