Walking the tightrope : Responsive yet stable Traffic Engineering Presented by Aparna Sundar
Problem Definition and current solutions TE problem definition Offline Methods – OSPF-TE , MPLS Online Methods – MATE, TeXCP
Inadequecy of Offline methods ● Cannot react to real-time traffic reroutes. ● Load distribution not guarenteed to be optimal. ● Suboptimal reaction to failure.
Online methods ● Should react to real-time traffic demands and failures. ● Prior approaches – centralized, assuming a global oracle, lacking stability analysis. Eg MATE. ● TeXCP – distributed and stable.
Summary of Results ● For same traffic demands, TeXCP supports same utilisation and failure resiliance with a third of the capacity as traditional offline methods. ● Network utilization is always within a few percentage points of optimal value. ● Prefers shorter routes while trimming long routes that are not useful.
Big Picture ● Two Components ● Load Balancer : multiple paths delivering demands from ingress to egress router, moving traffic from over-utilized to under utilized paths. ● Closed loop Feed Back controller: collects network feedback at faster time scale than LB to ensure traffic stability.
Diagram, Explanation of terms
LP Formulation at each IE pair
Path Selection,Probing Network state ● Path Selection: ● ISP picks set of K shortest paths that it can use. ● Probing Network state: ● Maintain probe timer, Tp, to maintain track of path utilization. Tp > RTT. ● Probe packet with updatable utilization field sent by ingress node. Egress node sends it back to app agent. ● Probe loss: estimate util to max(1,p u_sp)
Load Balancer ● Each agent maintains a decision timer, which fires every Td sec, > 5Tp. ● each time the agent,computes change in fraction of IE traffic sent on path p. ● At eqbm, x_sp is constant, traffic is conserved, no negative traffic possible,updates should decrease max utilization.
Load Balancer (contd) ● ● ● Intuition:path whose util is greater than avg shd dec its rate while path whose util isbelow avg should increase its rate. ● Change in traffic is proportional to current traffic on path (which is prop to util). ● Use of epsilon – to re-use or restart util on a path.
Preventing Oscillations , Managing Congestion ● Two agents working independantly may shift flow to link that was previously under-utilized. ● Solution (inspired by XCP) ● Compute aggregate feedback: ● Compute per IE flow feedback based on a Max- Min approach: ● ● Positive feedback added, negative multiplied.
Preventing Oscillations , Managing Congestion(contd) ● Sending feedback to agents using probe. ● g_sp is allowed rate on path p. ● Actual rate = min(g_sp, x_sp R_s). ● Prefer to use shorter paths: Use weighted max-min fairness to push a preference for shorter paths. ● Heuristic : Shorter paths better for better network utilisations
Analysis ● Computation of explicit feedback for each pair, by load balancer, that leads to more stable per- IE flow rates and subsequently utilizations. ● Effect of feedback on network, “mostly” done by the time load balancer kicks into action ie,the explicit feedback brings path util to 90% of desired value, before the next time any of the load balancers need to make a decision.
Results and Comparision
Results and comparision (contd)
Discussion ● Look at source-dest paths, instead of ingress- egress paths? ● Metric for network utilization ● ● Including estimate for egress-ingress links?
Recommend
More recommend