look imagine and match improving textual visual cross
play

Look, Imagine and Match: Improving Textual-Visual Cross-Modal - PowerPoint PPT Presentation

Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models Jiuxiang Gu Jianfei Cai Shafiq Joty Li Niu Gang Wang Goal Text-to-Image Retrieval Image-to-Text Retrieval A young man doing a


  1. Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models Jiuxiang Gu Jianfei Cai Shafiq Joty Li Niu Gang Wang

  2. Goal Text-to-Image Retrieval Image-to-Text Retrieval A young man doing a skateboard trick while others watch Bright room with a couch and A man doing a skate trick during a various different competition event with a audience dressers Guys on a course made for skate boarding A group of people doing skateboarding tricks on a car … A boy riding on his skateboard at a skate park while other guys watch …

  3. Classical Pipeline 𝑗 𝑑 Image Feature Text Feature Bright room 𝑤 " 𝑢 " with a couch Similarity … and various … different dressers Image Encoder Text Encoder

  4. Motivation: Look è Imagine è Match Text-to-Image Retrieval Image-to-Text Retrieval Global Global Similarity Similarity 𝑗 𝑤 𝑢 𝑑 𝑗 𝑤 𝑢 𝑑 Local Local Similarity Similarity Imagine 𝚥̂ Imagine 𝑑̂

  5. Look è Imagine

  6. Match

  7. Look è Imagine

  8. Match

  9. Proposed Approach

  10. Cross-Modal Retrieval with Generative Learning

  11. Cross-Modal Retrieval with Generative Learning

  12. Results

  13. Results (Classical Pipeline)

  14. Results (Ours)

  15. • Additional details At the Poster: • Quantitative results • Discussion

Recommend


More recommend