visual disability
play

Visual disability Low vision 2015 Estimated blind people 2020 - PowerPoint PPT Presentation

Visual disability Low vision 2015 Estimated blind people 2020 Visually impaired 285 M Blind 54 M Blind 39 M Global data souce: WHO, IBU See the world through the eyes of a visually impaired person Normal vision Cataract Glaucoma


  1. Visual disability Low vision 2015 Estimated blind people 2020 Visually impaired 285 M Blind 54 M Blind 39 M Global data souce: WHO, IBU

  2. See the world through the eyes of a visually impaired person

  3. Normal vision

  4. Cataract

  5. Glaucoma

  6. Macular degeneration

  7. Diabetic Retinopathy

  8. Complete blindness

  9. One goal: independence

  10. Text recognition

  11. Object recognition

  12. Mobility assistance

  13. Scene and photos description

  14. Face recognition

  15. Possible approaches Smartphone Single Stereo Stereo Stereo App Camera Camera CPU Camera FPGA Camera GPU Day-lasting battery OTA Upgrades Real time performance (offline) Obstacle perception

  16. How Horus works 1 Externalinput identification Image are transferred 3 to the computing unit Cameras 2 aquire images 6 Sound output Audio is transferred back 5 to the headset 4 Information extraction

  17. What Horus does Horus can help the user with: Scene description Face recognition Text reading Object recognition Mobility assistance

  18. User interaction Navigation menu Scene description Text reading Navigationbuttons Face recognition Object recognition ON/OFF button Mobility assistance After powering Horus, the user can choose the desired functionality by navigating a vocal menu using the navigation buttons.

  19. Image description with Deep Learning A sunset over the mountains Language model Convolutional network Whole process runs on NVIDIA TK1

  20. Results on TK1 CPU vs GPU (CNN + LSTM) Processing time on Memory footprint Processing time on GPU GPU CNN: 22ms CNN: 595ms 300 MB LSTM: 498ms LSTM: 1200ms TOTAL: 520ms TOTAL: 1795ms ~3.5X faster on GPU

  21. Reporting obstacles Low-pitched sound center High-pitched sound High-pitched sound left right User Horus uses 3D sound to report the presence of obstacles during movement. The space in front of the user is divided in different sectors: lateral obstacles generate high-pitched sounds in one of the two speakers, while central obstacles generate low pitched centered sounds. These sounds are repetitive and they increase in repetition frequency as the obstacle gets closer.

  22. Results on TK1 CPU vs GPU (SGBM @480p) Processing time on Processing time on GPU (Visionworks) CPU 116 ms 5650 ms ~48X faster on GPU

  23. Example of audio feedback If the text is located in the upper part of the fieldof If the text is located in the lower part of the fieldof view, Horus emits a high-pitched sound to tell the view, Horus emits a low-pitched sound to tell the userto lower the text userto raise the text

  24. Face recognition Source frame Face detection CNN Classification Tracking

  25. Future improvements 3D reconstruction of faces 3D undistortion of sheets Object recognition Multi language LSTM models

  26. www.horus.tech info@horus.tech

Recommend


More recommend