Fine-grained Visual Analysis: From Classification to Retrieval - PowerPoint PPT Presentation

Fine-grained Visual Analysis: From Classification to Retrieval Yi-Zhe Song SketchX Lab, CVSSP, University of Surrey, UK http://sketchx.ai

Why fine-grained? Dog Dog Dog I am not just a “dog”   

Why fine-grained? Husky Chihuahua Bulldog Better ☺ At the very heart of human and computer vision!!

What is fine-grained? • Surveys + Seminars exist • a good survey [1] • First Edition of 见微知著 (2019 年 12 月 11 日 ) • Classification + Retrieval most studied • Classification being the favourite child • Images → video, 3D, text • Recent branching to generation, transfer learning, hashing… [1] [1] Deep Learning for Fine-Grained Image Analysis: A Survey. Xiu-Shen Wei, Jianxin Wu, and Quan Cui. arXiv: 1907.03069, 2019.

Classification vs. Retrieval • “ The Curse of the Labels ” • Classification → hard to obtain expert labels • Retrieval → one can not retrieve without knowing the label The only two that I know!

Problem with Classification • Dataset! Dataset! Dataset! → Label! Label! Label! • Obsession with parts • Explicit to start with • Now implicit as well → part is not everything B-CNN (ICCV15) Pairwise confusion (ECCV18) MA-CNN (ICCV17) [1] PMG (ECCV20) MC-Loss (TIP20) NTS-Net (ECCV18) Explicit Models Implicit Models

Problem with Retrieval • Ill-posed to start with → where do we get the labels? • Retrieval dictates expert knowledge to start with! • Best input modality? • Yes, there is image (but is it the only choice?) • Human subjectivity → text best for that (?) • There is just not enough work!

All about Retrieval • Is the old “fine - grained” enough? → more than just names (labels)! • Pose, instance-level details • “a Labrador standing on two feet, looking at the camera with a smile ” • Latent sub-classes • Labrador → English Labrador and American Labrador • Flexibility to meet human subjectivity • as flexible as text? • What would be the best input modality ? • More practical with real application scenarios?

Sketch for Retrieval NO FLEXIBLE & IMPRECISE FLEXIBILITY EXACT Text Image Sketch Customised list of closely Many irrelevant results Lots of very similar images relevant images To be explored

Sketch for Retrieval • Specific challenges • Cross-modal • Human subjectivity • Learning under small data

Sketch for Retrieval

FG-SBIR : F ine- G rained S ketch- B ased I mage R etrieval FG-SBIR 1.0 – pose correspondence FG-SBIR 3.0 – on-the-fly retrieval (BMVC’15) (CVPR’20 Oral) Ours Baseline FG-SBIR 2.0 – instance correspondence (CVPR’16 Oral, SIGGRAPH’16, ICCV’17, 3xECCV’18, Ours CVPR’19 Oral, CVPR’20) Baseline

FG-SBIR : F ine- G rained S ketch- B ased I mage R etrieval • Dataset usually very small • ImageNet pre-training is thus a must + fine-tuning. • Triplet Ranking Network • pushing positive sketch-photo pairs near, and negatives apart. [1] Qian Yu, Feng Liu, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales, Chen Change Loy, Sketch Me That Shoe , CVPR 2016 Oral

FG-SBIR : The Role of Jigsaw • Jigsaw puzzles helps with fine-grained [1] • See also [2] for classification [1] Kaiyue Pang, Yongxin Yang, Timothy Hospedales, Tao Xiang, Yi-Zhe Song, Solving Mixed-modal Jigsaw Puzzle for Fine-Grained Sketch-Based Image Retrieval , CVPR 2020 [2] Ruoyi Du, Dongliang Chang, Ayan Kumar Bhunia, Jiyang Xie, Yi-Zhe Song, Zhanyu Ma, Jun Guo. Fine-Grained Visual Classification via Progressive Multi- Granularity Training of Jigsaw Patches , ECCV 2020

FG-SBIR : The Role of Jigsaw • Solving a mixed-modality jigsaw model requires learning to: • Bridge the domain discrepancy • Understand holistic object configuration • Encode fine -grained detail. • A permutation inference problem • Normalisation via Sinkhorn iterations • Great performance boost to long standing practice of ImageNet pre-training.

FG-SBIR : The Role of Jigsaw NOTE: opposite conclusions for category-level task!

FG-SBIR : The Role of Jigsaw Effect of jigsaw modality Effect of jigsaw granularity • mixed-modal Jigsaw is the best • granularity of jigsaw not crucial

FG-SBIR : On-the-Fly Gallery Images Sketch Problem – “I can’t sketch” • Time taken to draw a complete sketch • Drawing skill of the user [1] Ayan Kumar Bhunia, Yongxin Yang, Timothy Hospedales, Tao Xiang, Yi-Zhe Song, Sketch Less for More: On-the-Fly Fine-Grained Sketch Based Image Retrieval , CVPR 2020 Oral

FG-SBIR : On-the-Fly Old Setup: sketch first, then retrieve OLD New On-the-fly Setup: retrieve as you sketch NEW Bingo! Less is more!

FG-SBIR : On-the-Fly • Natural : incomplete sketches can already retrieve! • Faster : no need to sketch the whole thing • More accurate : modelling the sketching process does help In most cases, we can retrieve with ~30% less strokes!

FG-SBIR : On-the-Fly • Reinforcement Learning (RL) for cross-modal modelling. • Reward design to encourage early retrieval • Rank optimization over a complete sketch drawing episode

FG-SBIR : On-the-Fly Quantitative Results vs Different Baselines (A@q, m@A, and m@B) Percentage-wise Results for Shoe-V2 (m@A, and m@B) Percentage-wise Results for Chair-V2 (m@A, and m@B)

Classification  Retrieval • Classification → Retrieval • Obvious • Retrieval → Classification [1] • Cure for web data? • Sub-class discovery? [1] Zhang C, Yao Y, Liu H, et al. Web-Supervised Network with Softly Update-Drop Training for Fine-Grained Visual Classification, AAAI. 2020

Conclusion • Fine-grained is important! • Classification bottlenecked • Retrieval needs more work • Unique challenges • Practical applications • Can help classification • Beyond 2D!

Fine-grained Visual Analysis: From Classification to Retrieval - PowerPoint PPT Presentation

Fine-grained Visual Analysis: From Classification to Retrieval Yi-Zhe Song SketchX Lab, CVSSP, University of Surrey, UK http://sketchx.ai Why fine-grained? Dog Dog Dog I am not just a dog Why fine-grained? Husky

Fine Grained Access Control Fine-Grained Access Control Fine Grained Access Control

Fine-Grained Access Control Fine Grained Access Control Fine-grained access control examples:

Fine-Grained Geographic Communication (Geocast) Nexus Workshop Frank Drr 23.07.2003 1

Average-Case Fine-Grained Hardness Marshall Ball Alon Rosen Manuel Sabin Prashant Nalini

Addressing Inter-Class Similarity in Fine-Grained Visual Classification Abhimanyu Dubey

Mechanized Verification of Fine-grained Concurrent Programs Ilya Sergey Aleks Nanevski

Junfeng Fan ESAT/COSIC ECC implementation methods Multi-core systems Coarse-Grained

Learning from Fine-Grained and Long-Tailed Visual Data Yin Cui Google Research Dec 11 2019

Phase Transition in 3SAT Yi Zhou Phase Transition in 3SAT Phase Transition in 3SAT Fine Grained

Combining Data-Intense and Compute-Intense Methods for Fine-Grained Morphological Analyses Petra

Fine-Grained Power Modeling for Smartphones Using System Call Tracing Based on paper and

Fine-Grained Tracking of Grid Infections Ashish Gehani SRI Basim Baig, Salman Mahmood, Dawood

Fine-grained Image Recognition Lei Wang VILA group School of Computing and Information

On the Correctness Criteria of Fine-Grained Access Control in Relational Databases Qihua Wang,

Fine Grained Coordinated Parallelism in a Real World Application Mohammad Rezaei, PhD June 2012

Enhancing Fine- Grained Parallelism Loop vectorization, Loop distribution, Scalar expansion

Lucity Specific IT/GIS Services Presenter: Luke Savage, GISP lsavage@lucity.com System Design

Teresa Oliviere, Head of Legal INTRODUCING RE:FIT Public sector bodies Partnership Energy E

Transforming SEND Services in Hertfordshire Schools Forum 26 June 2019 David Butcher Head

https://www.gov.uk/register-to-vote Can register here and at home, vote in both local elections

Charlie Garrod Chris Timperley 17-214 1 Administrivia Homework 4b due next Thursday, October

www.drupaleurope.org How to grow up skills by contribution to community Andriy Iun Drupal

PDDL PDDL Introduction Introduction WordPad WordPad PDDL PDDL

Greedy Algorithms Kleinberg and Tardos, Chapter 4 1 Selecting gas stations Road trip from

Sambuz

Useful Links

Newsletter

Mail Us