Learning Human Pose from Unaligned Data through Image Translation - PowerPoint PPT Presentation

Learning Human Pose from Unaligned Data through Image Translation Tomas Jakab Ankush Gupta Andrea Vedaldi Hakan Bilen Presented by Triantafyllos Afouras

Goal Learn human-body landmark detectors from unlabelled videos and unaligned annotations Human images Unaligned poses … , … Pose estimate

Model architecture

Autoencoding reconstruction input image code encoder decoder

Autoencoding reconstruction input image code encoder decoder not interpretable L

Filtering geometric information reconstruction input image 2D keypoints encoder decoder

Filtering geometric information reconstruction input image 2D keypoints encoder decoder no appearance information for image reconstruction L

Conditional generation reconstruction input images 2D keypoints geometry decoder encoder appearance encoder

Result: unsupervised 2D keypoints discovery Unsupervised learning of object landmarks through conditional image generation . Jakab, Gupta, Bilen, Vedaldi. Proc. NeurIPS, 2018

Unsupervised 2D keypoints discovered landmarks what we actually want vs. Unsupervised learning of object landmarks through conditional image generation . Jakab, Gupta, Bilen, Vedaldi. Proc. NeurIPS, 2018

Learning to label as image translation reconstruction input image bottleneck encoder decoder discriminator looks like a skeleton?

Image Translation = CycleGAN rgb rgb rgb skeleton skeleton (reconstruction) Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks . Zhu and Park et al., 2017.

needs to smuggle appearance to facilitate Cheating CycleGANs the reconstruction reconstruction input image bottleneck encoder decoder discriminator looks like a skeleton?

Cheating CycleGANs source reconstruction bottleneck log-bottleneck The model cheats and encodes appearance information together with geometry smuggling the appearance

Tightening the screw pre-trained offline skeleton reconstruction input image handcrafted bottleneck image keypoint analytical encoder decoder detector renderer style image discriminator appearance encoder looks like a skeleton?

Our model in detail clean skeleton skeleton detected reconstruction input image images image keypoints analytical keypoint encoder decoder renderer detector unpaired style image skeleton images discriminator appearance encoder looks like a skeleton?

Results

Human pose estimation Simplified Human3.6M Dataset prediction Human3.6M prediction Pennaction prediction

Human pose estimation Pennaction Human3.6M

Unsupervised to labeled keypoints unsupervised methods our method what we actually want directly predicting labelled keypoints discovered landmarks supervised linear regression

Human pose estimation unsupervised discovery + supervised regression 20.0 8.0 %-MSE norm. by image size 7.0 19.5 6.0 19.0 5.0 18.5 MSE in pixels 4.0 no paired data 18.0 3.0 17.5 2.0 17.0 1.0 0.0 16.5 hourglass Thewlis et al. Zhang et al. ours hourglass ours ours (supervised) (supervised) (supervised) Simplified Human3.6M Human3.6M

Ablations 4.5 %-MSE norm. by image size Simplified Human3.6M 4.0 3.5 3.0 2.5 CycleGAN + apperance + clean (analytical) - 2nd cycle = ours conditioning skeleton renderer bottleneck

Disentangling style and geometry

Disentangling style and geometry Mixing appearance and geometry by conditioning on a different identity geometry style reconstruction

Conclusion Learn landmark detectors from unlabeled videos and unaligned pose annotations . Using no paired data / labelled images . Prevent appearance leakage in CycleGAN through: (a) novel bottleneck with a differentiable sketch renderer . (b) Conditioning the generator on an appearance image. Outperform state-of-the-art supervised and unsupervised landmark detectors for human pose. Method factorizes object appearance and geometry → transfer style / pose.

Learning Human Pose from Unaligned Data through Image Translation www.robots.ox.ac.uk/~vgg/ research/unsupervised_pose/ Tomas Jakab Ankush Gupta Andrea Vedaldi Hakan Bilen Presented by Triantafyllos Afouras

Learning Human Pose from Unaligned Data through Image Translation - PowerPoint PPT Presentation

Learning Human Pose from Unaligned Data through Image Translation Tomas Jakab Ankush Gupta Andrea Vedaldi Hakan Bilen Presented by Triantafyllos Afouras Goal Learn human-body landmark detectors from unlabelled videos and unaligned

Human Pose Estimation by Yannic Jnike - 04.11.2019 https://www.youtube.com/watch?v=mxKlUO_tjcg

Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat

Hand Pose Estimation Matthew Krenik Advisor: Fabrizio Pece Agenda What is Hand Pose

LightTrack: A Generic Framework for Online Top-Down Human Pose Tracking Authors: Guanghan Ning,

Image Restoration Image Enhancement and Image Restoration both deal with improving images. Image

Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image Denis Tom

Tsinghua University Monocular Depth-Pose Prediction [R, t] Depth and Pose RGB PoseNet

Helping RE with LLVM lionel@lse.epita.fr 1) Reverse Engineering 2) Obfuscation objectives -

Human Pose Estimation and Action Recognition Gang Yu, Megvii (Face++) Junsong Yuan, SUNY Buffalo

Human Pose Search using Deep Poselets Nataraj Jammalamadaka * Andrew Zisserman C. V. Jawahar *

Chirality Nets for Human Pose Regression Raymond A. Yeh, Yuan-Ting Hu, Alexander G. Schwing

Fields of Parts & Friends peter.gehler.net p i Detection + Geometry p i Human Pose

Human Pose Recovery And Gesture Recognition CS365 : Artificial Intelligence Khandesh

FORUM REPORT - Grandparents and the Law BACKGROUND COTA NSW is a politically unaligned consumer

Unaligned Rebound Attack Application to K ECCAK Alexandre Duc 1 , Jian Guo 2 , Thomas Peyrin 3 and

Recovering dialect geography from an unaligned comparable corpus Yves Scherrer LATL, Department

Understanding the Role of IO as a Bottleneck Morgan Tocker firstname @percona.com 1 Wednesday,

Role of Pricing in Leveraging Market Power Role of Pricing in Leveraging Market Power Tom Hird

Freight Advisory Council June 20, 2014 1 Freight planning input Two exercises: National

FREEING UP MELBOURNE'S BIGGEST BOTTLENECK PRESENTATION TO THE TOURISM INDUSTRY 29 AUGUST 2017

Motivation Pursuing product variety and innovation is an important competition strategy (Adner and

SANDVIK MATERIALS TECHNOLOGY PRIMARY PRODUCTS 1 SAFETY FIRST Sandviks objective is zero harm

MIDAS MANAGED INTELLIGENT DECONFICTION AND SCHEDULING FOR SATELLITE COMMUNICATION IEEE Aerospace

A new way to pro fi le Node . js Matteo Collina Maximum number of servers sales traf fi c

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Learning Human Pose from Unaligned Data through Image Translation - PowerPoint PPT Presentation

Learning Human Pose from Unaligned Data through Image Translation Tomas Jakab Ankush Gupta Andrea Vedaldi Hakan Bilen Presented by Triantafyllos Afouras Goal Learn human-body landmark detectors from unlabelled videos and unaligned

Human Pose Estimation by Yannic Jnike - 04.11.2019 https://www.youtube.com/watch?v=mxKlUO_tjcg

Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat

Hand Pose Estimation Matthew Krenik Advisor: Fabrizio Pece Agenda What is Hand Pose

LightTrack: A Generic Framework for Online Top-Down Human Pose Tracking Authors: Guanghan Ning,

Image Restoration Image Enhancement and Image Restoration both deal with improving images. Image

Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image Denis Tom

Tsinghua University Monocular Depth-Pose Prediction [R, t] Depth and Pose RGB PoseNet

Helping RE with LLVM lionel@lse.epita.fr 1) Reverse Engineering 2) Obfuscation objectives -

Human Pose Estimation and Action Recognition Gang Yu, Megvii (Face++) Junsong Yuan, SUNY Buffalo

Human Pose Search using Deep Poselets Nataraj Jammalamadaka * Andrew Zisserman C. V. Jawahar *

Chirality Nets for Human Pose Regression Raymond A. Yeh*, Yuan-Ting Hu*, Alexander G. Schwing

Fields of Parts &amp; Friends peter.gehler.net p i Detection + Geometry p i Human Pose

Human Pose Recovery And Gesture Recognition CS365 : Artificial Intelligence Khandesh

FORUM REPORT - Grandparents and the Law BACKGROUND COTA NSW is a politically unaligned consumer

Unaligned Rebound Attack Application to K ECCAK Alexandre Duc 1 , Jian Guo 2 , Thomas Peyrin 3 and

Recovering dialect geography from an unaligned comparable corpus Yves Scherrer LATL, Department

Understanding the Role of IO as a Bottleneck Morgan Tocker firstname @percona.com 1 Wednesday,

Role of Pricing in Leveraging Market Power Role of Pricing in Leveraging Market Power Tom Hird

Freight Advisory Council June 20, 2014 1 Freight planning input Two exercises: National

FREEING UP MELBOURNE'S BIGGEST BOTTLENECK PRESENTATION TO THE TOURISM INDUSTRY 29 AUGUST 2017

Motivation Pursuing product variety and innovation is an important competition strategy (Adner and

SANDVIK MATERIALS TECHNOLOGY PRIMARY PRODUCTS 1 SAFETY FIRST Sandviks objective is zero harm

MIDAS MANAGED INTELLIGENT DECONFICTION AND SCHEDULING FOR SATELLITE COMMUNICATION IEEE Aerospace

A new way to pro fi le Node . js Matteo Collina Maximum number of servers sales traf fi c

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Chirality Nets for Human Pose Regression Raymond A. Yeh, Yuan-Ting Hu, Alexander G. Schwing

Fields of Parts & Friends peter.gehler.net p i Detection + Geometry p i Human Pose