Department of Computer Science IV University of Mannheim, Germany - PowerPoint PPT Presentation

Stephan Kopf Department of Computer Science IV University of Mannheim, Germany

 Motivation  Part I: Basic Retargeting Operations ◦ Scaling and cropping ◦ Regions of interest ◦ Automatic crop & scale ◦ Sports video adaptation  Part II: Seam Carving ◦ Seam carving for images ◦ Preservation of straight lines ◦ Fast seam carving for videos  Summary Stephan Kopf 15.02.2011 2

 Mobile phones are multimedia devices that allow to ◦ browse the Web ◦ display images and videos ◦ support novel input technologies (multi-touch)  But they still have limitations: ◦ Small screen size ◦ Wireless connection (bandwidth) ◦ Computational power (CPU, memory) ◦ Battery Stephan Kopf 15.02.2011 3

 Typical resolutions of images and videos ◦ Digital camera: 10 megapixels (3.600 x 2.700 pixels) ◦ Camcorder: high definition (1.920 x 1.080 pixels) ◦ Mobile phone (240 x 320 pixels) HD video mobile phone  Bitrate: 24 Mbit/s  Distortions caused by scaling (aspect ratio) Stephan Kopf 15.02.2011 4

Goal als of media dia retar arget getin ing  Shrink photos and videos for the presentation on a mobile phone (this automatically limits the bitrate)  Keep aspect ratio  Preserve the most important visual content  Algorithms for image and video retargeting Stephan Kopf 15.02.2011 5

 Shrink image (merge pixels) by a fixed scale factor (uniform scaling)  Different scale factors for each axis change the aspect ratio (non-uniform scaling)  Relevance of image content is ignored  „Letterboxing“ is used to preserve aspect ratio  Example: Stephan Kopf 15.02.2011 7

 Crop image borders until aspect ratios of image and display match  Relevance of image content is ignored: important content may be lost  Typically use scaling to convert to target size  Example: Stephan Kopf 15.02.2011 8

Idea ea  Identify most relevant image regions (regions of interest)  Crop borders but preserve regions of interest  Use automatic algorithms to identify regions of interest: ◦ Saliency maps ◦ Faces ◦ Text regions Stephan Kopf 15.02.2011 9

 Assumption: image regions that are relevant for an observer have a high contrast  Step 1: Contrast map of an image of size n × m : color of a pixel: p i ,j pixel in local neighborhood of p i ,j : distance function: d ( . )  Step 2: Quantize contrast map  Step 3: Find connected regions  Step 4: Mark region of interest *Source: Ma and Zhang HJ: Contrast-based image attention analysis by using fuzzy growing, ACM Intl. Conf. on Multimedia, 2003 Stephan Kopf 15.02.2011 10

contrast map quantized contrast map region of interest bounding box Stephan Kopf 15.02.2011 11

 Use automatic face detection algorithms to localize face regions  Frontal face detection algorithms work very robust (in contrast to face recognition) Stephan Kopf 15.02.2011 12

 Characteristic features of text: ◦ horizontal alignment ◦ significant luminance difference between text and background ◦ the character size is within a certain range ◦ single-colored ◦ text is visible in consecutive frames (video) ◦ horizontal or vertical motion is possible (video)  Calculate a horizontal projection profile to detect the boundaries of text lines Stephan Kopf 15.02.2011 13

 Calculate importance value V for each region of size H : minimum perceptible size: H min maximum reasonable size: H max  Find optimal target region W based on regions of interest S i : Stephan Kopf 15.02.2011 14

Selection of one feature Combination of two features … three features Full image Stephan Kopf 15.02.2011 15

scaling crop & scale cropping Stephan Kopf 15.02.2011 16

scaled video modify video content Automatically detect:  Court lines *Source: Kopf, Guthier, Farin, Han:  Players Analysis and Retargeting of Ball Sports Video, IEEE Workshop on Applications  Ball of Computer Vision, 2011 Stephan Kopf 15.02.2011 17

 Step 1: Mark bright pixels (line pixels)  Step 2: Algorithm to detect straight lines (based on RANSAC) 1. Randomly select two line pixels and calculate line parameters 2. Count number of white pixels N located on line 3. If ( N N > threshold) stop 4. Goto 1.  Step 3: Remove line pixels and detect next line (Step 2) RANSAC: Fischler, Bolles: Random sample concensus: a paradigm for model fitting with applications to image analysis and automated cartography, Communications ACM, vol 24(6), 1981. Stephan Kopf 15.02.2011 18

 Problem: Position of lines change from frame to frame  Solution: use a reference court model to estimate camera motion ◦ Step 1: Calculate intersection points of two lines ◦ Step 2: Transform lines to court model  How many intersection points do we need for the transformation? Stephan Kopf 15.02.2011 19

 Translation (horizontal/vertical shift)  1 intersection point  Translation and scaling  2 intersection points  Affine transform (translation, scaling, rotation)  3 intersection points  Perspective transform  4 intersection points Stephan Kopf 15.02.2011 20

cropping scaling crop & scale (zoom on largest player) modify lines & ball Stephan Kopf 15.02.2011 21

 If important content is located near image borders:  crop & scale is not applicable Idea ea of f seam am carvin ving* g*  Systematic removal of less important pixels  Use energy function as measurement of „importance“ of single pixels *Source: Shai Avidan and Ariel Shamir: Seam Carving for Content-Aware Image Resizing. ACM SIGGRAPH, 2007 Stephan Kopf 15.02.2011 23

Image width should be reduced by 40 percent original image energy map Stephan Kopf 15.02.2011 24

 Remove N pixels with the lowest energy from each line source image remove N=200 pixels from each line based on energy values Stephan Kopf 15.02.2011 25

 Summarize energy in each column of the image and remove N columns with lowest energy remove 200 columns original image based on energy values of columns Stephan Kopf 15.02.2011 26

 A vertical seam is an 8-connected path of pixels from top to bottom that contains one and only one pixel in each row.  Formal definition:   x x n n s = {s } = {(x(i), i)} , subject to i : | x(i) - x(i - 1) | 1   i i 1 i 1  Horizontal seams are defined in a analog way. Stephan Kopf 15.02.2011 27

 Advantage of seams compared to columns or rows: ◦ Pixels of low energy are removed ◦ Relevant objects are preserved Stephan Kopf 15.02.2011 28

 Remove the vertical seam with the lowest energy  Repeat this step N times remove N=200 seams source image based on lowest energy Stephan Kopf 15.02.2011 29

 Seam carving uses an energy function that characterizes the relevance of each pixel (similar to saliency maps).  The optimal seam minimizes the cumulated pixel energy of all seam pixels.  Method to find optimal seam: dynamic programming Stephan Kopf 15.02.2011 30

 M ( i, j ) specifies the cost of the optimal (vertical) seam from the upper image border to pixel position (i , j )  Calculate M( i, j ) recursively:    ( 1 , 1 ) M i j      ( , ) ( , ) min ( 1 , ) M i j e i j M i j    ( 1 , 1 ) M i j  Stephan Kopf 15.02.2011 31

 Example how to calculate the optimal seam: 2 5 1 4 2 5 1 1 4 1 2 3 4 3 3 3 4 5 1 2 3 3 4 5 6 6 7 5 4 4 1 9 8 9 7 7 energy map cumulated energy map M( i, j )    ( 1 , 1 ) M i j      ( , ) ( , ) min ( 1 , ) M i j e i j M i j    ( 1 , 1 ) M i j  Stephan Kopf 15.02.2011 32

 Image gradient: simple energy function that calculates the luminance difference to adjacent pixels:     ( ( , )) ( , ) ( , ) e I x y I x y I x y   x y  Assumption: Luminance values do not differ much in image regions of low relevance  This simple energy function gives good results in many cases Stephan Kopf 15.02.2011 33

 Problem: The light house is an important region, but the pixel values are very similar original image optimal seams result Stephan Kopf 15.02.2011 34

 Combine energy function with saliency map    ( ( , )) ( , ) ( ( , )) e I x y w saliency x y e I x y sal s saliency map optimal seams result (e sal is used as energy function) Source: Hwang and Chien. Content-Aware Image Resizing using Perceptual Seam Carving with Human Attention Model. IEEE Conference on Multimedia and Expo, 2008. Stephan Kopf 15.02.2011 35

 Use results from face detection as additional saliency:      ( ( , )) ( , ) ( , ) ( ( , )) e I x y w saliency x y w face x y e I x y  sal face s f saliency map face map seams based on result e sal+face as energy function Stephan Kopf 15.02.2011 36

Department of Computer Science IV University of Mannheim, Germany - PowerPoint PPT Presentation

Stephan Kopf Department of Computer Science IV University of Mannheim, Germany Motivation Part I: Basic Retargeting Operations Scaling and cropping Regions of interest Automatic crop & scale Sports video adaptation

Oscar Gilbert Department of Computer Science and Computer Engineering Sarah Marsh Department of

I do Computer Science. I do Computer Science. Cool! I do Computer

Preparatory Course in Computer programming experience Science Computer Science 1 : Theoretical

COMP 516 COMP 516 Research Methods in Computer Science Research Methods in Computer Science

COMP 516 COMP 516 Research Methods in Computer Science Research Methods in Computer Science

COMP 516 COMP 516 Research Methods in Computer Science Research Methods in Computer Science

COMP 516 COMP 516 Research Methods in Computer Science Research Methods in Computer Science

COMP 516 Research Methods in Computer Science Dominik Wojtczak Department of Computer Science

COMP 516 Research Methods in Computer Science Dominik Wojtczak Department of Computer Science

Computer & Information Science & Engineering Computer & Information Science &

Data Science: Statistics or Computer Science? 9/15/2015 DATA SCIENCE: STATISTICS OR COMPUTER

What is Computer Science? Michael Saelee <saelee@iit.edu> Department of Computer Science

COMP 516 Research Methods in Computer Science Dominik Wojtczak Department of Computer Science

COMP 516 Research Methods in Computer Science Dominik Wojtczak Department of Computer Science

Computer Science 161: Computer Security Computer Science 161 Fall 2016 Popa and Weaver

TSP: operational semantics / department of mathematics and computer science 3/15 / department of

Visual Recognition: Prospects for Image & Video Analytics Jitendra Malik University of

MICROBOONE Taritree Wongjirad DPF 2017 Tufts/MIT Outline Convolutional neural networks

In Search of a Unifying Theory for Image Interpretation Donald Gem an Department of Applied

Neural networks (Ch. 12) Back-propagation The neural network is as good as it's structure and

Connectivity and Coverage Problems in Emerging Networks Arun Sen Computer Science &

A monitoring system for the self- driving car and his driver 1 Problematic We are decades

CSE 373: Final thoughts Michael Lee Friday, Mar 9, 2018 1 Logistics Reminder: 2 Project 4

MULTISCALE DRIVING FORCES SHAPING WATER RESOURCES IN A US-MEXICO TRANSBOUNDARY BASIN A

Department of Computer Science IV University of Mannheim, Germany - PowerPoint PPT Presentation

Stephan Kopf Department of Computer Science IV University of Mannheim, Germany Motivation Part I: Basic Retargeting Operations Scaling and cropping Regions of interest Automatic crop & scale Sports video adaptation

Oscar Gilbert Department of Computer Science and Computer Engineering Sarah Marsh Department of

I do Computer Science. I do Computer Science. Cool! I do Computer

Preparatory Course in Computer programming experience Science Computer Science 1 : Theoretical

COMP 516 COMP 516 Research Methods in Computer Science Research Methods in Computer Science

COMP 516 COMP 516 Research Methods in Computer Science Research Methods in Computer Science

COMP 516 COMP 516 Research Methods in Computer Science Research Methods in Computer Science

COMP 516 COMP 516 Research Methods in Computer Science Research Methods in Computer Science

COMP 516 Research Methods in Computer Science Dominik Wojtczak Department of Computer Science

COMP 516 Research Methods in Computer Science Dominik Wojtczak Department of Computer Science

Computer &amp; Information Science &amp; Engineering Computer &amp; Information Science &amp;

Data Science: Statistics or Computer Science? 9/15/2015 DATA SCIENCE: STATISTICS OR COMPUTER

What is Computer Science? Michael Saelee &lt;saelee@iit.edu&gt; Department of Computer Science

COMP 516 Research Methods in Computer Science Dominik Wojtczak Department of Computer Science

COMP 516 Research Methods in Computer Science Dominik Wojtczak Department of Computer Science

Computer Science 161: Computer Security Computer Science 161 Fall 2016 Popa and Weaver

TSP: operational semantics / department of mathematics and computer science 3/15 / department of

Visual Recognition: Prospects for Image &amp; Video Analytics Jitendra Malik University of

MICROBOONE Taritree Wongjirad DPF 2017 Tufts/MIT Outline Convolutional neural networks

In Search of a Unifying Theory for Image Interpretation Donald Gem an Department of Applied

Neural networks (Ch. 12) Back-propagation The neural network is as good as it's structure and

Connectivity and Coverage Problems in Emerging Networks Arun Sen Computer Science &amp;

A monitoring system for the self- driving car and his driver 1 Problematic We are decades

CSE 373: Final thoughts Michael Lee Friday, Mar 9, 2018 1 Logistics Reminder: 2 Project 4

MULTISCALE DRIVING FORCES SHAPING WATER RESOURCES IN A US-MEXICO TRANSBOUNDARY BASIN A

Computer & Information Science & Engineering Computer & Information Science &

What is Computer Science? Michael Saelee <saelee@iit.edu> Department of Computer Science

Visual Recognition: Prospects for Image & Video Analytics Jitendra Malik University of

Connectivity and Coverage Problems in Emerging Networks Arun Sen Computer Science &