caesar context aware event stream analytics in real time
play

CAESAR: Context-Aware Event Stream Analytics in Real time Olga - PowerPoint PPT Presentation

CAESAR: Context-Aware Event Stream Analytics in Real time Olga Poppe, Chuan Lei, Elke A. Rundensteiner, and Dan Dougherty March 18, 2016 1 Complex Event Processing CEP engine Primitive events Complex events 1 , 2 , 3 The


  1. CAESAR: Context-Aware Event Stream Analytics in Real time Olga Poppe, Chuan Lei, Elke A. Rundensteiner, and Dan Dougherty March 18, 2016 1

  2. Complex Event Processing CEP engine Primitive events Complex events 𝑅 1 , 𝑅 2 , 𝑅 3 The same workload of independent event queries is continuously evaluated 2 Worcester Polytechnic Institute

  3. Application Context β€’ Event compositions signify application contexts β€’ Most event queries are appropriate only in certain contexts β€’ They can be safely suspended otherwise Examples of application contexts: Emergency management: normal, crowded, fire β€’ Health care: safe, warning, violation β€’ Algorithmic trading: hold, buy, sell β€’ Financial fraud: approved, suspicious, fraud β€’ 3 Worcester Polytechnic Institute

  4. Traffic Management Use Case 140 hours idling in traffic due to congestion in 10-worst β€’ U.S. traffic corridors per year [The Wall Street Journal] Health cost of $18 billion due to traffic noise and pollution β€’ in the USA's 83 largest urban areas in 2010 [USA Today] 1.24 million deaths due to traffic injuries worldwide in β€’ 2010 [Wikipedia] 4 Worcester Polytechnic Institute

  5. Traffic Management Contexts Accident Congestion Clear Accident warning Toll notification Statistics Route re-computation Route re-computation Local services Goal is to leverage application contexts to speed up system responsiveness 5 Worcester Polytechnic Institute

  6. Challenges β€’ Rich semantics ─ Complex conditions implying a context ─ Unknown and unbounded context duration ─ Multiple inter-dependent event queries β€’ Readable specification β€’ Real time responsiveness 6 Worcester Polytechnic Institute

  7. State-of-the-art Approaches CEP Business Systems CAESAR Models (Esper, (BPMN, UML) StreamInsight) Expressive event queries Application contexts Context- aware optimizations 7 Worcester Polytechnic Institute

  8. Contributions & Outline CAESAR system: β€’ Graphical model β€’ Context-aware algebra β€’ Context-driven optimization techniques β€’ Execution infrastructure Performance evaluation 8 Worcester Polytechnic Institute

  9. Outline CAESAR Model 9 Worcester Polytechnic Institute

  10. Context-aware Event Stream Analytics 10 Worcester Polytechnic Institute

  11. Context-aware Event Stream Analytics 11 Worcester Polytechnic Institute

  12. Context-aware Event Stream Analytics 12 Worcester Polytechnic Institute

  13. Application Contexts 13 Worcester Polytechnic Institute

  14. Context Deriving Queries 14 Worcester Polytechnic Institute

  15. Context Processing Queries 15 Worcester Polytechnic Institute

  16. Context-aware Event Queries 16 Worcester Polytechnic Institute

  17. Outline CAESAR Algebra 17 Worcester Polytechnic Institute

  18. Context-preserving Plan Generation 18 Worcester Polytechnic Institute

  19. CAESAR Algebra Operators 1. Context initiation 𝐷𝐽 c 𝐽, 𝑋 2. Context termination π·π‘ˆ c 𝐽, 𝑋 3. Context window 𝐷𝑋 𝑑 𝐽, 𝑋 4. Filter 𝐺𝐽 πœ„ (𝐽) 5. Projection 𝑄𝑆 𝐡,𝐹 (𝐽) 6. Event pattern 𝑄(𝐽) 19 Worcester Polytechnic Institute

  20. Runtime Context Maintenance Context bit vector 𝑋 : 0 1 0 0 1 0 0 0 0 0 Context types: 𝑑 a , c b , … c z Time stamp 𝑋. 𝑒𝑗𝑛𝑓 Updated by the context initiation & termination operators β€’ Accessed by the context window operator β€’ Synchronized by the time driven scheduler β€’ 20 Worcester Polytechnic Institute

  21. Translation from Query Set to Algebra Plan DERIVE Toll(c.id, c.sec, 5) PATTERN NewCar c CONTEXT congestion DERIVE NewCar(s.id, s.xway, s.dir, s.seg, s.lane, s.pos, s.lane) PATTERN SEQ ( NOT Position f, Position s) WHERE f.sec+30=s.sec AND f.id=s.id AND f.lane β‰  β€² exitβ€² CONTEXT congestion 21 Worcester Polytechnic Institute

  22. Outline CAESAR Optimizer 22 Worcester Polytechnic Institute

  23. CAESAR Optimizer Overview Problem statement: Given a workload of context-aware event queries, our optimization problem is to find an optimized query plan for this workload with minimal CPU cost. Context-aware optimization techniques: Context window push down strategy β€’ Context workload sharing algorithm β€’ 23 Worcester Polytechnic Institute

  24. Context Window Push Down Strategy Performance benefits: Suspension of irrelevant operators β€’ Context-driven stream routing β€’ 24 Worcester Polytechnic Institute

  25. Context Workload Sharing Algorithm 25 Worcester Polytechnic Institute

  26. Context Workload Sharing Algorithm 26 Worcester Polytechnic Institute

  27. Context Workload Sharing Algorithm 27 Worcester Polytechnic Institute

  28. Outline CAESAR Infrastructure & Experiments 28 Worcester Polytechnic Institute

  29. CAESAR Architecture 29 Worcester Polytechnic Institute

  30. Experimental Setup Execution infrastructure : Java 7, 1 Linux machine with 16-core 3.4 GHz CPU and 48GB of RAM Data sets : Linear Road stream benchmark (LR) [1] β€’ 3 roads=1.7GB Physical Activity Monitoring real data set (PAM) [2] β€’ 1.6GB [1] A.Arasu et al., Linear Road: A stream data management benchmark. VLDB’04 [2] A.Reiss et al., Creating and benchmarking a new data set for physical activity monitoring. PETRA’12 30 Worcester Polytechnic Institute

  31. Context-aware Event Stream Analytics For 7 roads, context-aware (CA) event stream analytics is 9 -fold faster than context-independent (CI) approach. 31 Worcester Polytechnic Institute

  32. Context-aware Event Query Sharing If 30 context windows of length 15 minutes process 4 event queries each and overlap by 15 minutes, workload sharing wins 6 -fold. 32 Worcester Polytechnic Institute

  33. Outline Conclusions 33 Worcester Polytechnic Institute

  34. Conclusions β€’ CAESAR is first context-aware CEP system β€’ Graphical context-specification model β€’ Context-aware algebra β€’ Context-driven optimization techniques β€’ Execution infrastructure β€’ 8-fold speed up on average 34 Worcester Polytechnic Institute

  35. Acknowledgement β€’ Advisors: Elke A. Rundensteiner, Dan Dougherty β€’ Collaborator: Chuan Lei β€’ DSRG group at WPI β€’ EDBT reviewers β€’ NSF grants IIS 1018443 and IIS 1343620 35 Worcester Polytechnic Institute

Recommend


More recommend