video streams based on user
play

Video Streams based on User Access Pattern Ngo Quang Minh Khiem - PowerPoint PPT Presentation

Adaptive Encoding of Zoomable Video Streams based on User Access Pattern Ngo Quang Minh Khiem Guntur Ravindra Wei Tsang Ooi National University of Singapore Zoomable Video Zoomable Video with Bitstream Switching (x,y,w,h) Server Client


  1. Adaptive Encoding of Zoomable Video Streams based on User Access Pattern Ngo Quang Minh Khiem Guntur Ravindra Wei Tsang Ooi National University of Singapore

  2. Zoomable Video

  3. Zoomable Video with Bitstream Switching

  4. (x,y,w,h) Server Client GOAL: Minimize bandwidth to transmit RoIs

  5. Dynamic Cropping of ROI Encode video once Support any RoI cropping Tiled Streaming Monolithic Streaming (TS) (MS)

  6. Tiles overlapping with the RoI are One tile = k x k macroblocks transmitted Encode each tile as independantly decodable video streams Tiled Streaming

  7. Data outside RoI need for decoding RoI Single monolithic video Monolithic Streaming

  8. Trade-offs with TS and MS Bigger tile  More waste  More bits TS Smaller tile  Less compression  More bits

  9. Longer MV  More dependency  More bits MS Shorter MV  Less compression  More bits

  10. RoI Access Pattern Reduce bandwidth further, given RoI access statistics?

  11. Questions in this paper • Tiled Streaming  Different tile size in the same frame? • Monolithic Streaming  Different motion search range? • How?

  12. Adaptive Encoding Given RoI access statistics, adapt the encoding parameters such that the expected bandwidth E needed to transmit a RoI is minimized   ( ) ( ) E c r p r  r R c(r): compressed size of RoI r p(r): access probability of RoI r

  13. RoI Access Pattern Log user selection of RoI (Online) Encoded Video Adaptive Encoded Video

  14. Adaptive Encoding Adaptive Tiling Monolithic Streaming (AT) with RoI-aware Coding (MS-PB)

  15. Adaptive Tiling Given RoI access pattern, tile the video such that E is minimized   ( ) ( ) E c t p t  t T c(t): compressed size of tile t p(t): access probability of tile t

  16. Intuition Allowing tiles of different sizes can reduce bandwidth RoI accessed Merge tiles by most users 1,2,3 and 4 1 2 3 4 Regular tiling Adaptive tiling with 2x2 tiles

  17. Greedy Heuristic Tiling • Start with regular 1x1 tiles • Merge a tile with its neighbors if expected bandwidth is reduced • Merge newly-formed tile with its neighbors bandwidth is reduced

  18. t 1 t 2 c(t 1 ) = 9 c(t 2 ) = 6 p(t 1 ) = 0.8 p(t 2 ) = 0.8 t 12 c(t 12 ) = 11 p(t 12 ) = 1   p(t )c(t ) p(t )c(t ) p(t )c(t ) 1 1 2 2 12 12

  19. RoI Access Pattern Resulting tile map

  20. Monolithic Streaming with RoI-aware Coding • Referenced MBs form large region outside RoI • Short motion vector: less bandwidth efficient • Probabilistic boxing motion vector (MS-PB)

  21. Intuition R1 P(A), P(B ): sending A, B P(AB) : A and B in same RoI R2 P(A) – P(AB) : sending A independent of B A B • P(A) – P(AB) > P(B)  Increase in size of A when sending R2 is marginal • P(A) – P(AB) < P(B)  Increase in size of A when sending R2 is higher • [P(A)-P(AB)] S(A) > P(B) S(B)

  22. Motion Vector Spread after MS-PB

  23. Evaluation • Evaluate AT and MS-PB in terms of  Bandwidth efficiency  Compression efficiency • Benchmark methods  Per-RoI  Tiled Streaming  Monolithic Streaming

  24. Video Sequences Rush-Hour (500 frames) Tractor (688 frames) Bball (200 frames) Rainbow (350 frames)

  25. Experiment Setup • RoI size: 320x192 pel • Video resolution 1920x1080 pel • Evaluation is conducted by a training-testing framework  Training and test sets have the same distribution • One training and test set for each GoP

  26. Expected Data Rate for Different Videos without B-Frames 4 Expected Data Rate (Mbps) 3.5 3 PerRoI 2.5 MS-PB 2 MS 1.5 AT 1 TS4x4 0.5 0 Bball Rainbow Test Video

  27. Expected Data Rate for Different Videos with 2 B-Frames 4.5 Expected Data Rate (Mbps) 4 3.5 3 PerRoI 2.5 MS-PB 2 MS 1.5 AT 1 TS4x4 0.5 0 Bball Rainbow Test Video

  28. Compressed Video File Size with 2 B-Frames 160 140 120 File Size (MB) PerRoI 100 MS-PB 80 MS 60 AT 40 TS16x16 20 0 Bball Rainbow Test Video Compressed Video File Size without B-Frames 140 120 PerRoI File Size (MB) 100 MS-PB 80 MS 60 AT 40 TS16x16 20 0 Bball Rainbow Test Video

  29. Presence of B-frame Without B-frame With B-frame MS-PB < MS MS- PB ≈ MS Motion Vector Spread Motion Vector Spread without B-frame with 2 B-frame

  30. Conclusion & Future Work • Propose an adaptive encoding approach based on user access patterns • Reduce bandwidth by 21% (MS-PB) and 27% (AT) • Limiting motion vector is beneficial to zoomable video with wide spread of dependency • Future work:  Computational complexity  Diverse user interest of RoI  Frequency of Adaptation

  31. Thank you • Questions? • Feedback/Suggesetion?

Recommend


More recommend