recording the visual mind
play

Recording the Visual Mind Appu Shaji Head of Research & - PowerPoint PPT Presentation

Recording the Visual Mind Appu Shaji Head of Research & Development @panteanaghavi EyeEm The Evolution of Photography The Boulevard du Temple, (1837) by Louis Daguerre (public domain). A Harvest of Death, Gettysburg,


  1. Recording the Visual Mind Appu Shaji Head of Research & Development @panteanaghavi EyeEm

  2. The Evolution of Photography “The Boulevard du Temple”, (1837) by Louis Daguerre (public domain).

  3. “A Harvest of Death, Gettysburg, Pennsylvania”, (1863) by Timothy H. O'Sullivan (public domain).

  4. “Migrant Mother”, (1936) by Dorothy Lange (public domain).

  5. “Earth rise”, (1969) by William Anders (public domain).

  6. "Revenge of the goldfish”, (1981) by Sandy Skoglund (fair use).

  7. “Paris, Montparnasse”, (1993) by Andreas Gurksy (fair use).

  8. AT THE CORE Storytelling A story behind every image.

  9. EYEEM Vision Towards uncovering the stories within an image. https://www.eyeem.com/tech

  10. EyeEm Vision will organize your visual content Tags & Caption portrait of young woman Identify all relevant concepts See the story in a 89% headline Head shot Aesthetics Contemplation The aesthetic score ranks the quality of images Trained on visual trends Human body and feedback from EyeEm community Young woman Adult Front view One person

  11. WE ARE A Community 20 million Marketplace PHOTOGRAPHERS AUTHENTIC PHOTOGRAPHY 150 Technology COUNTRIES SEARCH & DISCOVERY @BettinaDarger

  12. IN SHORT EyeEm is EyeEm is a photography company. We build the world's leading computer vision technology to connect our global creative community with iconic brands. @LAX2NRT

  13. AT THE CORE Storytelling A story behind every image.

  14. EYEEM Vision Towards uncovering the stories within an image.

  15. http://www.eyeem.com/tech

  16. http://www.eyeem.com/tech

  17. Understanding Aesthetics. @Aadnan

  18. Can we learn from the masters ?

  19. Bombay Churchgate Station, (1994) by Raghu Rai (fair use).

  20. Bombay Churchgate Station, (1995) by Sebastião Salgado (fair use).

  21. Bombay Churchgate Station, (2011) by Randy Olson (fair use).

  22. Bombay Churchgate Station, (unknown ) from Google Image Search

  23. What is common among them ? What differentiates them ? Steve McCurry David Uzochukwu Me :-)

  24. Photographs by EyeEm users @nicanorgarcia, @cocu_liu , and me.

  25. Data • Aesthetics are hard to express. • Aesthetics differ from person to person. • However, experts often find a common language to define and communicate about aesthetics, after considerable GOOD BAD conversation and debate. 4 million Photographs curated by Experts 3 years and ongoing Crowdsourced Social Data Community Expertly Curated

  26. PERSONALIZED Aesthetics Isn't Aesthetics Subjective ?

  27. @ken_kou @jackyczj2010 @KatePhellini @itchban @idjphotography @sayjor @ArifNurhakim @svanteberg

  28. Dense Predictions Layer Convolutional Layers + Non Linearity Image Feature Representations

  29. Feature Representations All Images BCG Kayak

  30. Personalized Dense Predictions Rank Layer Non-linear Ranker Convolutional Layers (Multi-layer Perceptron) + Non Linearity Image Feature Representations

  31. Energy Surface All Images BCG Kayak

  32. 3 ~ 4 secs 200,000 Train a New Personalized Layer ( in Titan GPU ) Photos can be scored in a sec.

  33. https://www.youtube.com/watch?v=lYFKqoekP_0

  34. COMPUTER VISION AND MACHINE LEARNING RESEARCH At EyeEm ( Hint : It is about answering questions with no easy answers ) Data Algorithms Efficiency • What kind of data should we collect ? • How do you pose the problem ? • How do you make your networks faster ? • What do you look for in good feature • How do you understand the data ? • How do you make your networks Human thinking is non-linear, and any representations ? smaller ? given selection is prone to biases. • Can we transfer learn effectively what we • Will the architecture engineered for • How do you sample the data ? have learned in the past ? classification task be the best for aesthetics ? • How do you evaluate & analyze the results ? • What is the network learning ? • How much data is enough ? • How do you fill in data gaps ?

  35. CNN Zero-shot GANs RNN/LSTM Many-shot Fill-in-the blanks Attention Mechanisms Transfer Learning style model Semi/Weakly Supervised Unsupervised Reinforcement Learning Learning Research Directions Understanding Personalization Reasoning Models In production system

  36. Technology Product ASK LORENZ

  37. Photos Conversations Technology Community

  38. Joint Work With • Dr. Gökhan Yildrim • Harsimrat Sandhawalia • Ludwig Schmidt-Hackenberg • Dr. Hicham Badri Thanks • Dr. Praveen Kulkarnai • Nour Karessli • Photography Team @ EyeEm • Dr. Josip Krapac • Core Engineering @ EyeEm • Fulya Neubert • Clients @ EyeEm And yes, we are hiring ( https://www.eyeem.com/jobs ).

  39. A photograph exists in past, present and future; and magic can happen in any of these times! @evanscsmith appu@eyeem.com

Recommend


More recommend