synthesizing images and videos
play

Synthesizing Images and Videos Qifeng Chen Assistant Professor, - PowerPoint PPT Presentation

New Perspectives for Processing and Synthesizing Images and Videos Qifeng Chen Assistant Professor, HKUST Q&A Which company is the most valuable worldwide? Apple What is the most important product of Apple? iPhone What is


  1. New Perspectives for Processing and Synthesizing Images and Videos Qifeng Chen Assistant Professor, HKUST

  2. Q&A ◼ Which company is the most valuable worldwide? ◼ Apple ◼ What is the most important product of Apple? ◼ iPhone ◼ What is the most differentiable functionality of a smart phone today? ◼ Photography (arguably)

  3. Low-light Imaging

  4. Powerful Zoom

  5. Overview ◼ Image and Video Processing ▪ Learning to See in the Dark ▪ Zoom to Learn, Learn to Zoom ▪ Fast Image and Video Processing ▪ Reflection Removal ◼ Image and Video Synthesis ▪ Photographic Image Synthesis ▪ Semi-parametric Image Synthesis ▪ RGBD Future Video Prediction ▪ Fully Automatic Video Colorization

  6. Image and Video Processing

  7. Learning to See in the Dark

  8. Low-light Imaging Learning to See in the Dark A deep learning based Image Signal Processor Chen Chen, Qifeng Chen, Jia Xu, and Vladlen Koltun. Learning to See in the Dark, CVPR 2018

  9. Dataset

  10. Amplication Ratio

  11. Results

  12. Demo

  13. Results

  14. Zoom to Learn, Learn to Zoom

  15. Data Collection

  16. Data Collection

  17. What not just super-resolution with GANs? ◼ Existing super-resolution methods are trained on downsampled RGB images that contain little noise ◼ But in 8X digital zoom, noise is prominent ◼ RGB images are the output of ISP ▪ High frequency is removed by denoising ◼ We train our model to recover underlying high-frequency details from noisy input

  18. Contextual Bilateral Loss Contextual Loss A novel loss (CoBi) for measuring similarity of slightly misaligned image pairs

  19. Contextual Bilateral Loss

  20. Results

  21. Results

  22. Results

  23. Results https://youtu.be/xmCzET2GNk0 https://youtu.be/xmCzET2GNk0

  24. Going well

  25. A hazy day

  26. Dehazed image Nonlocal Dehazing [Berman et al. 2016]

  27. But not practical Nonlocal Dehazing takes a few seconds

  28. Alternative solutions? ◼ Use another method ▪ No state-of-the-art accuracy ◼ Accelerate implementation ▪ Time consuming ◼ Nonlinear Function Approximator ▪ Simple, general, accurate and fast

  29. Real-time performance Our approxminator runs at 30fps

  30. Fast Image Processing Qifeng Chen, Jia Xu, and Vladlen Koltun. Fast Image Processing with Fully-Convolutional Networks, ICCV 2017

  31. Results

  32. Demo

  33. Single Image Reflection Removal

  34. Data Collection

  35. Method

  36. Results

  37. Deep Image and Video Synthesis

  38. Art by Human Creation

  39. Art by Human Creation & AI

  40. Photographic image synthesis Input semantic layouts Synthesized images Qifeng Chen and Vladlen Koltun. Photographic Image Synthesis with Cascaded Refinement Networks. ICCV 2017

  41. Motivation ◼ Computer graphics ▪ Alternative route to photorealism ▪ Capture photographic appearance ▪ Fast image synthesis

  42. Motivation ◼ Artificial Intelligence ▪ Visual Imagination

  43. Our approach ◼ Cascaded refinement networks ◼ Perceptual Loss ◼ Diversity

  44. Cascaded refinement networks High Resolution

  45. Perceptual Loss

  46. Diversity

  47. Comparisons on Cityscapes

  48. Results on NYU dataset Tseung Kwan O, Kowloon

  49. User Study

  50. User study

  51. GTA5 and Demo Video

  52. Semi-parametric Image Synthesis Semantic layouts Our result Xiaojuan Qi, Qifeng Chen, Jiaya Jia, and Vladlen Koltun Semi-parametric Image Synthesis. CVPR 2018

  53. Image Synthesis Semantic layouts Our result NYU dataset [Silberman et al. ECCV 2012] ADE20K dataset [Zhou et al. 2017]

  54. Prior Work: Parametric Models Pix2pix [Isola et al. 2017] CRN [Chen and Koltun 2017]

  55. Prior Work: Non-parametric Models Scene Completion using Millions of Photographs [Hays and Efros 2007]

  56. Our Approach … Sky … Forest … Mountain … Grass External memory

  57. Our Approach … Sky Sky … Forest Mountain Forest Grass … Mountain Semantic layout … Grass External memory

  58. Our Approach … Sky Sky … Forest Mountain Forest Grass … Mountain Semantic layout … Grass External memory

  59. Our Approach Stage 1: Canvas Generation Canvas Retrieved segments

  60. Our Approach Stage 2: Image Synthesis Sky Forest Mountain Canvas Final result Grass Semantic layout

  61. SIMS: Canvas Generation Semantic layout … … … … Car Building External memory

  62. SIMS: Canvas Generation Semantic layout … … … … … Car Building External memory Retrieved segments

  63. SIMS: Canvas Generation Semantic layout … … … … Transformation … … Car Building network Transformed External memory Retrieved segments segments

  64. SIMS: Canvas Generation Semantic layout … Ordering network … … … Transformation … … Car Building Canvas network Transformed External memory Retrieved segments segments

  65. SIMS: Image Synthesis Semantic layout Canvas

  66. SIMS: Image Synthesis Semantic layout Convolution Upsampling Pooling Canvas Synthesis network f

  67. SIMS: Image Synthesis Semantic layout Output Convolution Upsampling Pooling Canvas Synthesis network f

  68. Results

  69. Semantic layout

  70. Pix2pix [Isola et al. 2017]

  71. CRN [Chen and Koltun 2017]

  72. Our result

  73. Diversified Synthesis

  74. Image Statistics Mean Power Spectrum Pix2pix [Isola et al. 2017] Real images

  75. Image Statistics CRN [Chen and Koltun 2017] Real images

  76. Image Statistics Our approach Real images

  77. Perceptual Experiments Cityscape Cityscap Cityscap NYU ADE20K Mean s es es (fine) (coarse) (coarse ) (fine) (GTA5) SIMS > 94.2% 98.1% 95.7% 94.9% 87.6% 94.1% Pix2pix SIMS > CRN 93.9% 74.1% 84.5% 89.1% 88.9% 86.1%

  78. Thank You

  79. Thank You

  80. Thank You

  81. Thank You

  82. Future Prediction

  83. Video Prediction

  84. 3D Motion Decomposition for RGBD Future Dynamic Scene Synthesis

  85. 3D Motion Decomposition for RGBD Future Dynamic Scene Synthesis

  86. Results

  87. Results

  88. Results

  89. Video Colorization

  90. Fully Automatic Video Colorization with Self Regularization and Diversity

  91. Diversity

  92. Results

  93. Thank You https://cqf.io

Recommend


More recommend