Speculative Plan Execution for Information Agents Greg Barish - PowerPoint PPT Presentation

Speculative Plan Execution for Information Agents Greg Barish University of Southern California Information Sciences Institute Advisor : Professor Craig A. Knoblock 1

Outline 1. Review and motivating example 2. Speculative plan execution 3. Value prediction for speculative execution 4. Related work 5. Summary 2

Streaming dataflow model • Dataflow a b c d – Operations scheduled by data availability a b c d • Independent operations execute in parallel MUL MUL MUL • Maximizes horizontal parallelism MUL – Dataflow computers [Dennis 1974] [Arvind 1978] ADD ADD – Example: computing (a*b) + (c*d) • Streaming Producer – Operations emit data as soon as possible • Independent data processed in parallel • Maximizes vertical parallelism Consumer – Network query engines [Ives et al. 1999] [Naughton et al. 2000] [Hellerstein et al. 2001] 3

The CarInfo agent 1. Locate cars that meet criteria - Edmunds.com 2. Filter out Oldsmobiles 4

The CarInfo agent 1. Locate cars that meet criteria - Edmunds.com 2. Filter out Oldsmobiles 3. Gather safety reviews for each - NHSTA.gov 5

The CarInfo agent 1. Locate cars that meet criteria - Edmunds.com 2. Filter out Oldsmobiles 3. Gather safety reviews for each - NHSTA.gov 4. Gather detailed reviews of each - ConsumerGuide.com 6

ConsumerGuide navigation • ConsumerGuide requires navigation from original search results to desired answer 7

CarInfo Agent Plan 1. Get list of cars from Edmunds.com that meet specified criteria. 2. Remove any Oldsmobiles from that list. 3. Get the search results for each of those cars from NHTSA.gov, extracting the safety ratings. 4. Get the search results for each car at CG.com, extracting the link to the summary page. 5. Get the summary page for each car, extracting the link to the full review. 6. Get the full review page for each car, extracting the review itself. 8

Agent Execution Performance • Standard von Neumann model – Execute one operation at a time – Each operation processes all of its input before output is used for next operation – Assume : 1000ms per I/O op, 100ms per CPU op • Execution time = 13.4 sec CG Full CG Summary CG Search NHTSA Select Edmunds 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 time (seconds) CPU-bound operation I/O-bound operation 9

Dataflow-style CarInfo agent plan ((Dodge Stratus), (Midsize coupe/hatchback, (Pontiac Grand Am), $4000 to $12000, (Mercury Cougar)) 2002) WRAPPER (safety reports) JOIN NHTSA Search WRAPPER SELECT search (car reviews) Edmunds maker != criteria Search "Oldsmobile " WRAPPER WRAPPER WRAPPER ConsumerGuide ConsumerGuide ConsumerGuide Search Summary Full Review ((Oldsmobile Alero), (Dodge Stratus), ((http://cg.com/summ/20812.htm), (Pontiac Grand Am), other summary review URLs ) (Mercury Cougar)) ((http://cg.com/full/20812.htm), other full review URLs ) 10

Expressing the CarInfo agent plan PLAN car-info { INPUT: criteria OUTPUT: reviews-and-ratings BODY { Wrapper ("Edmunds", criteria : cars) Select (cars, "maker != 'Oldsmobile'" : filtered-cars) Wrapper ("NHTSA", filtered-cars : safety-ratings) Wrapper ("CG Search", filtered-cars : summary-urls) Wrapper ("CG Summary", summary-urls : full-urls) Wrapper ("CG Full", full-urls : car-reviews) Join (safety-ratings, car-reviews, "l.make=r.make and l.model=r.model" : reviews-and-ratings) } } 11

Streaming dataflow executor • Thread pool architecture – Enables bounded, dynamic parallelism Plan operators 3 (e.g., Wrapper, Select, etc.) 2 Thread Plan Pool Plan Output Input 1 ((Oldsmobile Olero), (Dodge Stratus), (Midsize cpe/hatchbk, (Pontiac Grand Am), $4000 to $12000, (Mercury Cougar)) 2002) WRAPPER SELECT Example: Edmunds maker != Search "Oldsmobile " 12

Streaming dataflow performance Join CG Full CG Summary CG Search Select Edmunds 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 time (seconds) • Improved, but plan remains I/O-bound (76%) • Main problem: remote source latencies – Meanwhile, local resources are wasted • Complicating factor: binding constraints – Remote queries dependent on other remote queries • Question: How can execution be more efficient? 13

Speculative plan execution • Execute operators ahead of schedule – Predict data based on past execution • Allows greater degree of parallelism – Solves the problem caused by binding constraints • Can lead to speedups > streaming dataflow Join CG Full CG Summary GOAL CG Search Select Edmunds 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 time (seconds) 14

Focus of this talk • An approach to speculative plan execution – Safe & fair – Yields arbitrary speedups – Algorithm for the automatic transformation of agent plans • An approach to value prediction – Combines caching , classification , and transduction – Better accuracy and space efficiency than strictly caching 15

Outline 1. Review and motivating example 2. Speculative plan execution 3. Value prediction for speculative execution 4. Related work 5. Summary 16

How to speculate? • General problem – Means for issuing and confirming predictions • Two new operators – Speculate : Makes predictions based on "hints" hints predictions/additions Speculate answers confirmations – Confirm : Prevents errant results from exiting plan probable results Confirm actual results confirmations 17

How to speculate? • Example: CarInfo – Make predictions about cars based on search criteria – Makes practical sense: • Same criteria will typically yield same cars BEFORE W J W S W W W 18

How to speculate? • Example: CarInfo – Make predictions about cars based on search criteria – Makes practical sense: • Same criteria will typically yield same cars AFTER predictions/additions hints W J Confirm W Speculate S W W W answers confirmations 19

Detailed example 2002 Midsize coupe $4000-$12000 W J Confirm W Speculate S W W W Time = 0.0 sec 20

Issuing predictions Oldsmobile Olero T1 Dodge Stratus T2 Pontiac Grand Am T3 Mercury Cougar T4 W J Confirm W Speculate S W W W Time = 0.1 sec 21

Speculative parallelism Dodge Stratus T2 Pontiac Grand Am T3 Mercury Cougar T4 W J Confirm W Speculate S W W W Time = 0.2 sec 22

Answers to hints W J Confirm W Speculate S W W W Oldsmobile Olero Dodge Stratus Pontiac Grand Am Time = 1.0 sec Mercury Cougar 23

Continued processing Additions (corrections), if any W J Confirm W Speculate S W W W T1 T2 T3 T4 Time = 1.1 sec 24

Generation of final results Dodge Stratus (safety) (review) T2 Pontiac Grand Am (safety) (review) T3 Mercury Cougar (safety) (review) T4 W J Confirm W Speculate S W W W Time = 4.2 sec 25

Confirmation of results Dodge Stratus (safety) (review) Pontiac Grand Am (safety) (review) Mercury Cougar (safety) (review) W J Confirm W Speculate S W W W Time = 4.3 sec 26

In practice: how it works • Speculate generates speculative tuples • These tuples are run by a separate pool of “speculative threads” – These threads only execute operator methods on speculative tuples • Thus, the Speculate operator elicits more agent run-time parallelism – Greater thread-level parallelism (TLP) – Beyond the dataflow limit 27

Safety and fairness • Safety – Confirm operator • Fairness – CPU • Speculative operations executed by "speculative threads" – Lower priority threads – Memory and bandwidth • Speculative operations allocate "speculative resources" – Drawn from "speculative pool" of memory – Other solutions exist, such as RSVP (Zhang et al 1994) 28

Getting better speedups • Cascading speculation – Single speculation allows a max speedup of 2 • Time spent either speculating or confirming – Cascading speculation allows arbitrary speedups • Up to the length of the longest plan flow S S S S S S S S S a b c d e f g h i j W W W W W W W W W W W W W W W W W W W W C 29

Automatic plan transformation • One important step is determining the set of candidate transformations • However: – Determining this set is an expensive proposition – Assuming: • A candidate transformation can include one or more speculations • A given speculation is consumed by one and only one operator – The # of possible transformations: ST( n ) = ( n -1) + n *ST( n -1), ST(1) = 0 – A single flow of 10 consecutive operators has over 3 million possible speculative schedules! 30

Automatic plan transformation • An alternative: leverage Amdahl's Law: – Focus on most expensive path ( MEP ) • Basic algorithm 1. Find MEP 2. Find best candidate speculative plan transformation 3. IF no candidate found, THEN exit 4. Transform plan accordingly 5. REPEAT (anytime property) • The "best" candidate – The one with the highest potential speedup • Algorithm assumes some addtl speculative overhead – Function of the amount of data speculated about 31

Speculative Plan Execution for Information Agents Greg Barish - PowerPoint PPT Presentation

Speculative Plan Execution for Information Agents Greg Barish University of Southern California Information Sciences Institute Advisor : Professor Craig A. Knoblock 1 Outline 1. Review and motivating example 2. Speculative plan execution 3.

Speculative Execution Outcome unknown Block 1 Predict future execution path Begin executing

Harmonizing Speculative and Non-Speculative Execution in Architectures for Ordered Parallelism

Speculative Plan Execution for Information Agents Greg Barish University of Southern California

and Effi ficient Speculative Execution JIYONG YU, NAMRATA MANTRI, JOSEP TORRELLAS, ADAM

SpeechMiner: A Framework for Investigating and Measuring Speculative Execution Vulnerabilities

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

Speculative Defragmentation Speculative Defragmentation A Technique to Improve the

FRACTAL AN EXECUTION MODEL FOR FINE-GRAIN NESTED SPECULATIVE PARALLELISM SU SUVINAY Y SU

Data-Centric Execution of Speculative Parallel Programs MARK JEFFREY, SUVINAY SUBRAMANIAN,

Data-Centric Execution of Speculative Parallel Programs MA MARK JEFFREY, SUVINAY SUBRAMANIAN,

MASTERING STRATEGY EXECUTION 18 BEST PRACTICES FOR STRATEGY EXECUTION STRATEGY EXECUTION AS

CSC421 Intro to Artificial Intelligence UNIT 01: Intelligent Agents Agents & environments

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

2015 OUTSTANDING YOUNG AGENTS COMMITTEE: Membership Development The Young Agents Council of the

Intelligent Driving Agents Intelligent Driving Agents Microscopic traffic simulation with

Innovative Ideas to Engage Agents Will Bickmore & Sarah-Lynne Rand Senior Account Managers

CSCI-548: Information Integration on the Web Craig Knoblock University of Southern California

Entry and the USO in the Postal Sector ACCC 2004 Regulatory Conference July 29-30, 2004 Sea

https://github.com/gglobster/trappist TRAPPIST: d e r u t a e f - l l u f n

Session 3. Mapping and Timeline Define geographical boundaries of the project (mapping) This

Data Mining Lecture 06: Bayes Theorem Theses slides are based on the slides by Tan,

Understanding the Diversity of Tweets in the Time of Outbreaks Nattiya Kanhabua and Wolfgang

How specific arguments defeat general dogmas: lack of parsimony in molecular biology Vlasta

Seminar And Workshop On Grid Computing At UT 8:30 am - Coffee and Rolls 9:00 - 11:00 The

Speculative Plan Execution for Information Agents Greg Barish - PowerPoint PPT Presentation

Speculative Plan Execution for Information Agents Greg Barish University of Southern California Information Sciences Institute Advisor : Professor Craig A. Knoblock 1 Outline 1. Review and motivating example 2. Speculative plan execution 3.

Speculative Execution Outcome unknown Block 1 Predict future execution path Begin executing

Harmonizing Speculative and Non-Speculative Execution in Architectures for Ordered Parallelism

Speculative Plan Execution for Information Agents Greg Barish University of Southern California

and Effi ficient Speculative Execution JIYONG YU, NAMRATA MANTRI, JOSEP TORRELLAS, ADAM

SpeechMiner: A Framework for Investigating and Measuring Speculative Execution Vulnerabilities

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

Speculative Defragmentation Speculative Defragmentation A Technique to Improve the

FRACTAL AN EXECUTION MODEL FOR FINE-GRAIN NESTED SPECULATIVE PARALLELISM SU SUVINAY Y SU

Data-Centric Execution of Speculative Parallel Programs MARK JEFFREY, SUVINAY SUBRAMANIAN,

Data-Centric Execution of Speculative Parallel Programs MA MARK JEFFREY, SUVINAY SUBRAMANIAN,

MASTERING STRATEGY EXECUTION 18 BEST PRACTICES FOR STRATEGY EXECUTION STRATEGY EXECUTION AS

CSC421 Intro to Artificial Intelligence UNIT 01: Intelligent Agents Agents &amp; environments

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

2015 OUTSTANDING YOUNG AGENTS COMMITTEE: Membership Development The Young Agents Council of the

Intelligent Driving Agents Intelligent Driving Agents Microscopic traffic simulation with

Innovative Ideas to Engage Agents Will Bickmore &amp; Sarah-Lynne Rand Senior Account Managers

CSCI-548: Information Integration on the Web Craig Knoblock University of Southern California

Entry and the USO in the Postal Sector ACCC 2004 Regulatory Conference July 29-30, 2004 Sea

https://github.com/gglobster/trappist TRAPPIST: d e r u t a e f - l l u f n

Session 3. Mapping and Timeline Define geographical boundaries of the project (mapping) This

Data Mining Lecture 06: Bayes Theorem Theses slides are based on the slides by Tan,

Understanding the Diversity of Tweets in the Time of Outbreaks Nattiya Kanhabua and Wolfgang

How specific arguments defeat general dogmas: lack of parsimony in molecular biology Vlasta

Seminar And Workshop On Grid Computing At UT 8:30 am - Coffee and Rolls 9:00 - 11:00 The

CSC421 Intro to Artificial Intelligence UNIT 01: Intelligent Agents Agents & environments

Innovative Ideas to Engage Agents Will Bickmore & Sarah-Lynne Rand Senior Account Managers