dynamic control of magnified image for low vision
play

Dynamic Control Of Magnified Image For Low Vision Observers R.B. - PowerPoint PPT Presentation

1 Dynamic Control Of Magnified Image For Low Vision Observers R.B. Goldstein 1 , E.Peli 1 , H.Apfelbaum 1 , R.Hier 2 . 1 Schepens Eye Research Institute, Boston, MA; 2 DigiVision, Inc., San Diego, CA. Poster Board Number: B787 Grant Identification:


  1. 1 Dynamic Control Of Magnified Image For Low Vision Observers R.B. Goldstein 1 , E.Peli 1 , H.Apfelbaum 1 , R.Hier 2 . 1 Schepens Eye Research Institute, Boston, MA; 2 DigiVision, Inc., San Diego, CA. Poster Board Number: B787 Grant Identification: NIH EY05957and EY12890 ARVO 2002 Presentation Number: 3803 Purpose: Magnification is an effective aid for people with conditions causing resolution loss. However, it inherently restricts the field of view. We have developed a system of magnifying television images while addressing the field restriction in two ways. Methods: The system requires a point in each video frame that is to be centered on the screen when the image is magnified. This selection should maintain the most relevant details in the scene. The coordinates of this “center of interest” point can potentially be included with each frame. The eye movement patterns of normal-vision observers were used to determine the centers of interest within scenes. Six ten-minute video segments were selected from a broad range of scenes. To the extent possible, centers of interest were defined in terms of objects. For example, if a view fixated on a person’s nose and then moved fixation to an ear, the object of fixation in both cases was defined as the person’s head. In typical use, the visually impaired person uses a remote control to adjust the magnification (“zoom”) and can also override the pre-set center of interest (“roam”). To maintain wide field context, an edge-enhanced outline of the full scene can be superimposed on the magnified image. Results: A system implementing the magnification and shifting was built and integrated to use the preset center coordinates. A real-time outline derivation system that superimposes an outline of the full frame on the magnified image was developed. Eye movement patterns were analyzed off line. A comparison across groups allowed us to compare differences in areas of interest or eye movement patterns based on age or gender. The location of the center of each object was translated to the center of interest attached to the frame. Modified videos were shown to low vision patients with and without the full field outline. Magnification preferences, override frequency, and satisfaction were recorded. Patients were also interviewed regarding the impact of information in the scene that might have been missed due to the magnification. Conclusion: A demonstration system will be available for use during the poster session.

  2. Abstract 2 3 Purpose: Magnification is an effective aid for people with low vision conditions causing loss Background of resolution. However, it inherently restricts the field of view. We have developed a system of magnifying television images while addressing the field restriction in two ways. Methods: • Visually Impaired people benefit from magnification 1. Magnified image is displayed centered on “Point of Interest” or “Point of Regard” (POR) of the frame. This POR is determined on a frame-by-frame basis from the response of • Electronic zoom can magnify TV normally sighted viewers. • Magnification reduces field of view 2. To maintain context, a real-time outline derivation system that superimposes an outline of – Solutions the full frame on the magnified image was developed. This POP (Picture Over Picture) technique provides spatial multiplexing of the full view. • Center magnification at point of interest • Provide spatial multiplexing of full view Determination of POR Eye recording data was used to identify, across normally sighted viewers, fixations that – Picture Over Picture (POP) overlap in time and position. These overlaps define the center of interest in the scenes (POR). Conclusion: A demonstration of a prototype system is available for use during this session 4 5 Picture Over Picture (POP) Implementation Original Image • Point of Interest is provided with video 16:9 HDTV Letterbox on 4:3 NTSC Display frame User’s control turns magni- Computer syncs control of fication and enhanced edges • Magnification is centered at Point of Zoom and Roam with DVD on screen on and off, and can playback, based upon the 35” Monitor Interest (POR) adjust magnification level Point of Regard data • Maximum magnification limit is also provided • Edge enhanced image (original size) Edge superimposed on magnified image Internal Filter (POP) DVD • Edge-detection of original size image in real time Video Computer • NTSC NTSC • User controls level of magnification and IDE Mixer on/off of edge-detected image Zoom Frame #, filtered and eye fixation (x,y) RS232 Data File coordinates, and Roam magnification POP Magnified Image

  3. 6 7 Point of Interest of Observers System Diagram of Recording Phase • Computer “plays” DVD while • Calibration Remote ISCAN reading eye position data from – 5 point ISCAN calibration ISCAN device – Calibration rechecked and redone (if TV Width=22” necessary) between video segments • ISCAN Specifications • Synchronization with DVD – Pupil/Corneal Reflection Video RS232 System – MSWebDVD only provides an actual 16.9 o frame count at the beginning of an 0.4 to – Sampling Rate 60Hz IDE Bus Internal 74 ” Computer 27” Monitor 1 second block of video data on a DVD. SVGA to NTSC DVD – Accuracy of 0.5 o over ± 20 o range We use a VB timer event (interrupt • MS DirectX 8.1Technology every 33ms ) to interpolate to frame- DVD sends SVGA 16x9 level precision. (Microsoft) HDTV on a to monitor at same time it sends 4x3 NTSC – The MSWebDVD object provides frame number to the Visual Basic 6 interface used to computer control the DVD Data File Data file contains Frame #, x,y coordinates, magnification POR Across Observers 8 9 Categories of Videos and Observer Groups Sample Velocity Trace Velocity (deg/sec) 50 40 30 20 10 0 60483 62483 64483 66483 68483 70483 72483 74483 6 Segments Drawn from Frame Number Different Categories of Video • Artifact Removal Eye Traces Superimposed on Representative Video Frame • Zero data (loss of tracking, blinks) are removed Determination of Position Overlap (3 Observers) Category Title Time Saccades - Data rejected if velocity exceeds 30 o per second • 600 (hh:mm:ss) • Data rejected if pupil diameter too large or too small Talk Show Quiz Show 00:06:40 500 Romance Shakespeare in Love 00:07:06 Vertical 400 Sports Any Given Sunday 00:04:12 • Remaining data are characterized as Fixations or Smooth 300 Documentary Blue Planet 00:08:14 Pursuits Outlier Vertical News Network 00:04:02 • Data is considered a Pursuit unless either 200 4 Fixations Overlap Comedy Big 00:06:29 • Horizontal range and Vertical range is less than 50 pixels 100 Total 00:37:29 (Visual angle 1.6 o ) • Correlation coefficient between X and Y values is less than 0.5 0 100 200 300 400 500 Horizontal Group (Gender-Age) N hh:mm:ss After outliers are removed, the average (Useful data) 0 100 200 300 400 500 <x,y> position is found. A fixation Definition of Overlap in Time Male < 40 7 04:09:23 Horizontal sequence is defined to be an overlap if it Male > 45 3 01:49:56 is within the bounding box centered at <x,y> of side 128 pixels (Visual angle Female < 40 5 03:21:33 Fixation of Observer 1 Fixation of Observer 2 4.2 o ) Female >45 4 02:26:42 Overlap (Minimum 2 Frames) SEE DEMONSTRATION OF DVD PLAYBACK OF EYE MOVEMENTS

  4. 10 11 Hardware Systems Preliminary Recording Phase Results Other Artifacts include %Other % Zero % Pupil %Saccade % # Group Artifact #Pursuits -Removal of first and last frames of Frames Rejects Rejects Accepted Fixations Rejects* every “accepted” sequence Subject Viewing Video M<40 9.2% 8.5% 7.6% 26.3% 48.3% 16032 1404 -Duplicate or non-monotonic frame numbers (due to interactions between the F<40 10.8% 24.8% 6.4% 25.6% 32.3% 9644 752 timing of the DVD and ISCAN) F>45 21.4% 17.6% 5.6% 23.8% 31.5% 6777 471 M>45 11.9% 27.8% 5.2% 25.6% 29.6% 4914 465 -Rejection of sequences with fewer than 4 frames The number of segments that have fixations that overlap in time and of the number that have overlaps in time and position Edge Filter Process Time Overlaps (All Videos) for 7 Males < 40 Position Overlaps (All Videos) for 7 Males < 40 Y = c(E-Ê ) 800 800 700 700 600 600 Y = output image 500 500 Count Count 400 400 300 300 c = contrast 200 200 100 100 DZ1-Zoom And Roam E = input image 0 0 0 1 2 3 4 5 6 7 8 9 10 11 12 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Ê = local average luminance Number of Overlaps in Segment Number of Overlaps In a Segment 12 13 System Demonstrations Future Work • Eye positions superimposed on video • Analysis of POR variation within and across groups • POR Controlled Zoom System • Better determination of POR to use with the display system (smoothed, threshold POR changes, handle pursuits) – For sections of video where no position overlap • Conduct “satisfaction and use” study with low vision population exists, center of magnification is kept at prior center • Integrate POR into DVD and other video formats (broadcast POR • Real-time edge enhanced image with frame • Data Analysis System Acknowledgements • Supported by NIH Grants EY05957 and EY12890 • Thanks to Gang Luo for extensive data collection efforts

Recommend


More recommend