Analytics on Sensor Networks Joint work with D. D. Ha Hallac , S. - PowerPoint PPT Presentation

Analytics on Sensor Networks Joint work with D. D. Ha Hallac , S. Vare, S. Bhooshan, R. Sosic, S. Boyd, and VW Jure Leskovec

Jure Leskovec 2

Sensors are Everywhere § Sequences of time stamped observations Jure Leskovec, Stanford 3

Sensor Data: Time Series § Sensors generate lots of time-series data Jure Leskovec, Stanford 4

Challenges § This data is § High-dimensional § Unlabeled § High-velocity § Dynamic § Heterogeneous Jure Leskovec, Stanford University 5

…But it Can be Very Valuable! § Caterpillar shipping § Discovered correlation between fuel usage and refrigerated containers § Realized that in certain regimes they needed to re-optimize their engine configuration parameters § Saved $650,000+/year Jure Leskovec, Stanford University 6

Success Stories § Pella Corporation § Large window and door manufacturing § Owns 10 manufacturing plants § Large % of costs comes from energy bill § Deployed sensor network across their plants § To monitor usage and provide real-time feedback to operators § 16% decrease in energy costs! Jure Leskovec, Stanford University 7

Discovering Structure in the Data § Without proper methods, it is not possible to capitalize on the promise of “big data” § Unsupervised learning methods are needed to allow humans to interpret and act on these large datasets Jure Leskovec, Stanford University 8

How do we describe the structure of the time series so we can obtain insights and make predictions? 9

Key Questions How to break down time series datasets into simple, interpretable components? § …without pre-defining the structure, which leaves us open to biases! How can we identify breakpoints, outliers, and labels for this time series data in a scalable way? eaming settings increasingly common § St Stream Jure Leskovec, Stanford University 10

Today’s Talk § Toeplitz inverse covariance-based clustering (TICC) § Drive2Vec § Overview of future research directions in time series analysis § Deep learning § Open-source tools § Applications Jure Leskovec, Stanford University 11

Toeplitz Inverse Covariance- based Clustering (TICC) 12

Interpreting a Time Series Value in “breaking down” the data into a sequence of states Jure Leskovec, Stanford University 13

Simultaneous Segmentation and Clustering In general, these “states” are not predefined § § We do not know what they are, nor what they refer to… § Instead, we need to discover these states in an uns unsup upervised way! Jure Leskovec, Stanford University 14

What is a Time Series? § T sequential observations § x 1 , x 2 , …, x T § Each observation x i is n -dimensional § i.e., coming from n different sensors § Observations can be synchronous or asynchronous § There may be missing data § For example, if certain sensors are sampled at a higher rate than others Jure Leskovec, Stanford University 15

Goal Given : Multivariate time series § Gi Goal: Assign each point into one of § Go K different states (or clusters ), each defined by a simple “pattern” Jure Leskovec, Stanford University 16

Definition of a Cluster Convert a sequence of timestamped observations into a time-varying network Jure Leskovec, Stanford University 17

Definition of a Cluster Each cluster is defined by a multilayer correlation § network, or a Markov Random Field (MRF) § Contains both intra-layer and inter-layer edges MRFs encode st structural relationsh ships between § the sensors Jure Leskovec, Stanford University 18

Example Jure Leskovec, Stanford University 19

Automobile – “Turning” State Jure Leskovec, Stanford University 20

Automobile – “Stopping” State Jure Leskovec, Stanford University 21

TICC Problem Setup § Formal definition: where, Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data. D. Hallac, S. Vare, S. Boyd, J. Leskovec. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) , 2017 22

Block Toeplitz Matrices § Sparsity in the Toeplitz matrix defines the MRF edge structure § Toeplitz constraint enforces time invariance

Running Example

Approach: EM § TICC is highly non-convex § But we can use an EM-like approach to solve it! § Alternate between… § Assigning points to clusters in a temporally consistent way § Updating the cluster parameters 2 5

Assigning Points to Clusters We can solve this with dynamic programming! 2 6

Updating Cluster Parameters § Toeplitz Gr Graphical Lasso: § We derive an ADMM solution (with closed-form proximal operators) to solve this problem efficiently

TICC: Scalability CVXPY SnapVX § Can scale to problems with tens of millions of observations! SnapVX: A Network-Based Convex Optimization Solver. D. Hallac, C. Wong, S. Diamond, A. Sharang, R. Sosi č , S. Boyd, J. Leskovec. Journal of Machine Learning Research (JMLR), 18(4):1 − 5, 2017. Jure Leskovec, Stanford 28

How to Use TICC k box solver that returns § Black § Segmentation of the time series § Structural network defining each state § Key parameter: Number of states § Statistical methods of choosing the optimal parameter value § How to understand the results? Jure Leskovec, Stanford University 29

Case Study: Automobiles § We analyzed 1 hour of driving data § 36,000 samples @ 10Hz § We observed seven sensors § Brake pedal position § Forward (X-)acceleration § Lateral (Y-)acceleration § Steering wheel angle § Vehicle velocity § Engine RPM § Gas Pedal Position Jure Leskovec, Stanford University 30

Interpreting the Clusters § We run TICC with K = 5 clusters and plot the betweenness centrality score of each node in each cluster Jure Leskovec, Stanford University 31

Plotting the Resulting Clusters § Green = straight, white = slowing down, red = turning, blue = speeding up § Results are very consistent across the data! Jure Leskovec, Stanford University 36

Implications § Auto-labeling of data in an unsupervised way § Big cost for autonomous vehicles engine for discovering motifs in § Sear Search ch en the time series § Discover unique characteristics of individual drivers § Can be used to identify more granular behaviors § Lane changes, near-accidents, etc. Jure Leskovec, Stanford University 37

Predicting the Future (but without feature engineering) Jure Leskovec, Stanford University 38

[Hallac et al., 2018 ] Key Question Can you aggregate all of car’s sensors and embed them into a single, low-dimensional st stat ate ? Jure Leskovec, Stanford University 39

Our Approach This state should be pr predi dictive of of bot both term future the the sho hort t and nd long ng-te § First order effects – what the car is about to do § Second order effects – the environment that the car is currently in (location, driver style, etc…) Jure Leskovec, Stanford University 40

Key Insight Key insight: Attempt to predict the Key future at at m multiple g e gran anular arities es simultaneously: § Combine multiple RNNs so they can learn at different levels of abstraction § Learn to encode future at various time-scales Jure Leskovec, Stanford University 41

Drive2Vec Architecture § Recurrent Neural network based on stacked Gated Recurrent Units (GRUs) Jure Leskovec, Stanford University 42

Problem Setup § Dataset: Automobile data containing 1,400 sensors recording at 10 Hz. § Goal: Predict driver actions 1 sec before they occur § Left/Right blinker § Accelerate (gas pedal > threshold) § Hard braking (brake pedal < threshold) Driver Identification Using Automobile Sensor Data from a Single Turn. D. Hallac, A. Sharang, R. Stahlmann, A. Lamprecht, M. Huber, M. Roehder, R. Sosic, J. Leskovec IEEE International Conference on Intelligent Transportation Systems (ITSC), 2016. 4 Jure Leskovec, Stanford University 3

Drive2Vec Goal Given: a 1 second window (10 § Gi samples) of 665-dimensional data Goal: Embed this data into a single § Go 64-dimensional state that can be used to predict the short and long- term future of the car Jure Leskovec, Stanford University 44

Drive2Vec Experiments § This single 64-dimensional embedding can: § A) Predict ex exact act sensor values in short- term § B) Predict long-term av age sensor aver erag values § C) Correctly identify driver (out of 29 potential drivers) § D) Be used as a kn knowledge base to identify potentially risky scenarios Jure Leskovec, Stanford University 45

Analytics on Sensor Networks Joint work with D. D. Ha Hallac , S. - PowerPoint PPT Presentation

Analytics on Sensor Networks Joint work with D. D. Ha Hallac , S. Vare, S. Bhooshan, R. Sosic, S. Boyd, and VW Jure Leskovec Jure Leskovec 2 Sensors are Everywhere Sequences of time stamped observations Jure Leskovec, Stanford 3 Sensor

Sensor Relocation Mesh-based Sensor Relocation Mesh-based Sensor Relocation Objective for

Sensor Networks & TinyDB Author: Roman Kolcun Supervisor: Julie A. McCann Index Sensor

Analytics and Data Summit 2020 Analytics and Data Summit 2020 Analytics and Data Summit 2020

Temporal Privacy in Wireless Sensor Networks Temporal Privacy in Wireless Sensor Networks

Mobile Communications Ad-Hoc Networks & Wireless Sensor Networks Ad-hoc networks

Localization in Sensor Networks Localization in Sensor Networks Jie Gao Computer Science

Localization in Sensor Networks I Localization in Sensor Networks I Jie Gao Jie Gao Computer

INTRODUCTION TO WIRELESS SENSOR NETWORKS Marco Zennaro, ICTP Trieste-Italy Wireless sensor

FLOOD SENSOR WATER LEAK & TEMPERATURE SENSOR Strictly Confidential Water leak &

Querying Sensor Networks Sam Madden 1 Sensor Networks Small computers with: Radios

Undergraduate Business Analytics Minor Spreadsheet Analytics BANA-2081 Business Analytics

Wireless Ad Hoc & Sensor Networks Wireless Ad Hoc & Sensor Networks Introduction -

DOOR / WINDOW SENSOR CONTACT & TEMPERATURE SENSOR Strictly Confidential Universal contact

Security in Wireless Sensor Networks Introduction A Wireless Sensor Network is a network made of

Clock Synchronization Chapter 9 Ad Hoc and Sensor Networks Ad Hoc and Sensor Networks

Wireless Sensor Wireless Sensor Networks (WSNs) Networks (WSNs) Technological Revolution

Following in the Footsteps of The Choshu Five Developing the next Generation Leaders

A Future for Software Engineering? Leon J. Osterweil University of Massachusetts Amherst, MA,

Adherence Engineering to Reduce Central Line Associated Bloodstream Infections Frank A. Drews

Traceability, education and promotion: getting the message out Graham H Beastall

Forensic Science in Criminal Courts: Ensuring Scientific Validity of Feature-Comparison Methods

The Institute of Computer Science at the University of W urzburg The Institute in Numbers 8

I felt utterly stripped of safety and love. And so, what tormented me most as I shook through

2006 Capping Ceremony CLDH-EI Dept. of Nursing The Capping Ceremony Symbols of the profes-

Analytics on Sensor Networks Joint work with D. D. Ha Hallac , S. - PowerPoint PPT Presentation

Analytics on Sensor Networks Joint work with D. D. Ha Hallac , S. Vare, S. Bhooshan, R. Sosic, S. Boyd, and VW Jure Leskovec Jure Leskovec 2 Sensors are Everywhere Sequences of time stamped observations Jure Leskovec, Stanford 3 Sensor

Sensor Relocation Mesh-based Sensor Relocation Mesh-based Sensor Relocation Objective for

Sensor Networks &amp; TinyDB Author: Roman Kolcun Supervisor: Julie A. McCann Index Sensor

Analytics and Data Summit 2020 Analytics and Data Summit 2020 Analytics and Data Summit 2020

Temporal Privacy in Wireless Sensor Networks Temporal Privacy in Wireless Sensor Networks

Mobile Communications Ad-Hoc Networks &amp; Wireless Sensor Networks Ad-hoc networks

Localization in Sensor Networks Localization in Sensor Networks Jie Gao Computer Science

Localization in Sensor Networks I Localization in Sensor Networks I Jie Gao Jie Gao Computer

INTRODUCTION TO WIRELESS SENSOR NETWORKS Marco Zennaro, ICTP Trieste-Italy Wireless sensor

FLOOD SENSOR WATER LEAK &amp; TEMPERATURE SENSOR Strictly Confidential Water leak &amp;

Querying Sensor Networks Sam Madden 1 Sensor Networks Small computers with: Radios

Undergraduate Business Analytics Minor Spreadsheet Analytics BANA-2081 Business Analytics

Wireless Ad Hoc &amp; Sensor Networks Wireless Ad Hoc &amp; Sensor Networks Introduction -

DOOR / WINDOW SENSOR CONTACT &amp; TEMPERATURE SENSOR Strictly Confidential Universal contact

Security in Wireless Sensor Networks Introduction A Wireless Sensor Network is a network made of

Clock Synchronization Chapter 9 Ad Hoc and Sensor Networks Ad Hoc and Sensor Networks

Wireless Sensor Wireless Sensor Networks (WSNs) Networks (WSNs) Technological Revolution

Following in the Footsteps of The Choshu Five Developing the next Generation Leaders

A Future for Software Engineering? Leon J. Osterweil University of Massachusetts Amherst, MA,

Adherence Engineering to Reduce Central Line Associated Bloodstream Infections Frank A. Drews

Traceability, education and promotion: getting the message out Graham H Beastall

Forensic Science in Criminal Courts: Ensuring Scientific Validity of Feature-Comparison Methods

The Institute of Computer Science at the University of W urzburg The Institute in Numbers 8

I felt utterly stripped of safety and love. And so, what tormented me most as I shook through

2006 Capping Ceremony CLDH-EI Dept. of Nursing The Capping Ceremony Symbols of the profes-

Sensor Networks & TinyDB Author: Roman Kolcun Supervisor: Julie A. McCann Index Sensor

Mobile Communications Ad-Hoc Networks & Wireless Sensor Networks Ad-hoc networks

FLOOD SENSOR WATER LEAK & TEMPERATURE SENSOR Strictly Confidential Water leak &

Wireless Ad Hoc & Sensor Networks Wireless Ad Hoc & Sensor Networks Introduction -

DOOR / WINDOW SENSOR CONTACT & TEMPERATURE SENSOR Strictly Confidential Universal contact