Machine Learning as Enabler for Cross-Layer Resource Allocation: - PowerPoint PPT Presentation

| 1 Machine Learning as Enabler for Cross-Layer Resource Allocation: Opportunities and Challenges with Deep Reinforcement Learning Fatemeh Shah-Mohammadi and Andres Kwasinski Rochester Institute of Technology

| 2 Outline • Benefits for cross-layering. • Cognitive radios as enablers for cross-layer systems. • QoE-based resource allocation with Deep Q- learning. • Transfer learning for accelerated learning of Deep Q-Networks. • Uncoordinated multi-agent Deep Q-learning with non-stationary environments.

| 3 Why Cross- • Ubiquitous computing requires pervasive Layer connectivity, Approach? » under different wireless environment, » with heterogeneous network infrastructure and traffic mix. • User-centric approach translates to QoE metrics: » an end-to-end yardstick.

| 4 Obstacle to • Wireless devices development is divided into different teams, each specialized in implementing one layer or sub- Cross-Layer layer in a specific processor (e.g. main CPU or baseband Realization radio processor).

| 5 Cognitive Radios as Cross-Layer • Wireless network environment as a Enablers multi-layer entity. • Cognitive engine in a cognitive radio senses and interacts with the environment through measuring and acting on the multi-layered environment.

| 6 • A primary network (PN) owns a portion of the spectrum. Study Case: • A secondary network (SN) simultaneously transmits over the same portion of the spectrum. Underlay DSA • Transmissions in secondary network are at a power such that the interference they create on the primary network remains below a tolerable threshold. Primary network access point. Interference from secondary to primary. Secondary network terminal - SU (cognitive radio). Transmission in secondary network. Secondary network access point.

| 7 • Heterogeneous traffic mix: interactive video streams (high User-Centric bandwidth demand, delay constraint) and regular data (FTP). Secondary • Performance: measured as Quality of Experience (QoE) – following the user-centric approach to network design and Network management advocated in 5G systems • Chosen QoE metric: Mean Opinion Score MOS A common yardstick MOS Quality Impairment 5 Excellent Imperceptible Data MOS: 4 Good Perceptible but not annoying 3 Fair Slightly annoying Video MOS: 2 Poor Annoying 1 Bad Very annoying

| 8 Problem • Cross-layer resource allocation problem. Setup • For an underlay DSA SN, • choose: – transmitted bit rate (i.e. source compression for video), – transmit power, • such that the QoE for end users is maximized

| 9 • Use multi-agent Deep Q-Network (DQN) to solve problem. Solution Based • An efficient realization of Reinforcement Learning (RL). on Deep • An SU learns the actions (parameters setting) by following a repetitive cycle: Reinforcement Agent Learning Observes (SU) environment state: Selects action Receives from action space: reward ‘0/1’ - SU power feasibility (target SINR) ‘0/1’ - Underlay DSA interference condition Environment ‘0/1’ – Layer 2 outgoing queue delay Layer 2 MOS Delay

| 10 • Estimate the Q-action value function – calculation of the Deep expected discounted reward to be received when taking action when the environment is in state at time : Q-Network

| 11 Sharing Experience • Limited changes in wireless environment when a newcomer SU joins an already operating network. • Awareness of the environment (reflected in action-value parameters encoded in DQN weights) of expert SUs can be transferred to the newcomer SU. • Technique called “Transfer Learning”.

| 12 Transfer • Accelerated learning without performance penalty. Learning Results

| 13 An Issue With the Q-values Standard DQN • Scenario: Uncoordinated multi-agent power allocation. CRs maximize their throughput while keeping relative throughput change in PN below limit. • Standard DQN may not converge due to non-stationary environment.

| 14 Uncoordinated Multi-Agent DQN (Acknowledgement to Ankita Tondwalkar) Exploration phase: do action exploration only occasionally – generate near-stationary environment Near-standard DQN (no replay memory, target action-values stored in array. Policy update with inertia

| 15 Uncoordinated Multi-Agent DQN - Results Q-values Standard DQN (for comparison purposes, same scenario) • Demonstrable convergence to optimal solution as learning time goes to infinite

| 16 Uncoordinated Multi-Agent DQN - Results • Comparison against optimal solution through exhaustive search – optimality based on maximum sum throughput in SN.

| 17 Conclusions • Discussed the benefits for cross-layered protocols and their practical realization through cognitive radios. • Presented QoE-based cross-layer resource allocation cognitive engine with Deep Q-learning. • Explained how learning could be accelerated for a newcomer node by transferring experience from other node. » Learning is accelerated with no discernable performance loss. • Presented a first-of-its-kind Deep Q-learning technique that converges to optimal resource allocation in uncoordinated interacting multi-agent scenario (non- stationary environment).

| 18 Thank You! Questions?

Machine Learning as Enabler for Cross-Layer Resource Allocation: - PowerPoint PPT Presentation

| 1 Machine Learning as Enabler for Cross-Layer Resource Allocation: Opportunities and Challenges with Deep Reinforcement Learning Fatemeh Shah-Mohammadi and Andres Kwasinski Rochester Institute of Technology | 2 Outline Benefits for

BRAND HOUSE Retails Limited The Fashion Enabler March 2009 The Fashion Enabler Section 1 :

Network Layer October 2, 2019 guha.jayachandran@sjsu.edu Layer 2: Protocol atop Layer 1

Lecture 6: Wireless Link Layer, Lecture 6: Wireless Link Layer, MAC protocols, CSMA MAC

1 Transport Layer Transport Layer Outline Message, Segment, Datagram Transport-layer

ELEC / COMP 177 Fall 2016 Some slides from Kurose and Ross, Computer Networking , 5 th Edition

5 Network Layer Network Layer Network Layer Network Layer Example: Choosing among multiple ASes

MPLS as a Service MPLS as a Service Enabler in Metro Enabler in Metro Area Networks? Area

BIM - a key enabler to Digital Transformation Simon Kerr BIM a key enabler to Digital

02 | 27 SOUTHERN CROSS 23.04 03 | 27 SOUTHERN CROSS 23.04 04 | 27 SOUTHERN CROSS 23.04 06

The Shadow of the Cross The Cross of Jesus part 1B The Shadow of the Cross Hebrews 10:1-14 The

10 mm Cytoarchitecture and function layer 4: input layer 5: output Motor cortex: expanded layer

Data-link layer Da Data ta-link link layer er Referred to as layer 2 Physical

CompSci 356: Computer Network Architectures Lecture 25: Application Layer Protocols Chapter 9.1

7 Network Layer Network Layer Network Layer Network Layer Subnets Classful Address

1 Network Layer Network Layer Recall: Circuit Switching vs. Packet Interplay between routing

CompSci 356: Computer Network Architectures Lecture 23: Application Layer Protocols Chapter 9.1

Cr Cross L Laye yer Co Control ( (CL CLC) C) ba base sed d on n SDN and nd SDR R

Heterogeneous Networks Mustafa Emara, Miltiades C. Filippou, Dario Sabella 2018 IEEE Wireless

Security on Rails Jonathan Weiss, 03.09.2008 Peritor GmbH Who are we? Jonathan W Jonathan

GUI Applications Announcements for This Lecture Prelim 2 Assignments A6 due TOMORROW

A Massively Scalable Architecture for Learning Representations from Heterogeneous Graphs NVIDIA

On the Price of Heterogeneity in Parallel Systems P . Brighten Godfrey and Richard M. Karp

An Experiment-Driven Performance Model of Stream Processing Operators in Fog Computing

Efficient 3-D Placement of an Aerial Base Station in Next Generation Cellular Networks Article by