classification of raster maps for automatic feature
play

Classification of Raster Maps for Automatic Feature Extraction - PowerPoint PPT Presentation

Classification of Raster Maps for Automatic Feature Extraction Yao-Yi Chiang and Craig A. Knoblock University of Southern California 1 Motivation Raster map is a bitmap image of a map Raster maps are easily accessible Contain


  1. Classification of Raster Maps for Automatic Feature Extraction Yao-Yi Chiang and Craig A. Knoblock University of Southern California 1

  2. Motivation  Raster map is a bitmap image of a map  Raster maps are easily accessible  Contain information that is difficult to find elsewhere  Contain historical data USGS topographic map of St. Louis, MO Travel map of Tehran, Iran 2

  3. Exploit the geospatial information in raster maps  Extracting geographic features from raster maps  Road Extraction  Text Extraction and Recognition  Building Extraction  … Text Original map Roads 3

  4. Pre-Processing for feature extraction  Much of the feature extraction work relies on user input to extract the foreground pixels from the maps as a preprocessing step  Pre-Processing examples:  Convert to grayscale Text Line  Thresholding the grayscale histogram Background The map profile for pre-processing Convert road pixels to road vectors Convert text pixels to machine-editable text 4

  5. Automatic determine an applicable map profile !  Can we automatically select a map profile for new input map? Map Profiles New map Map repository 5

  6. Automatic feature extraction with map classification We can eliminate the manual pre-processing task using the map classification component 6

  7. Can we use meta-data to determine a map profile?  Meta-data such as map source, is not always available  Maps from the same source can be very different  Two USGS topographic maps covering two different cities 7

  8. Content-based Image Retrieval (CBIR)  CBIR is the technique to find images with similar ‘content’  Content similarity defined by the comparison features  In our case, similar content means two raster maps shared the same map profile for extracting their foreground pixels  Comparison feature – Luminance-Boundary Histogram  Classifier – Nearest-Neighbor Classifier 8

  9. Luminance or Color  Luminance is chosen instead of using one or all of the Red, Green, and Blue components  One-dimensional features is more computational efficient  Luminance is the most representative component by design Color images Grayscale images 9

  10. Luminance-Boundary Histogram (LBH)  LBH captures the spatial relationships between neighboring luminance levels in the map  The two example maps have similar spatial relationship between their luminance levels Blacks are surrounded by gray and white pixels in both maps Map One Map Two 10

  11. High/Low Luminance-Boundary Histogram  A set of LBH contain a High Luminance-Boundary Histogram (HLBH) and a Low Luminance-Boundary Histogram (LLBH)  HLBH and LLBH are based on the high and low luminance-boundary values (LBV)  For a luminance level in a map image How to generate the HLBH and LLBH?  The High LBV represents the least luminous upper bound among its surrounding luminance levels  The Low LBV represents the greatest luminous lower bound among its surrounding luminance levels  Together, the High and Low LBH represent the comparative importance of the luminance level in a raster map  A highlighted luminance level has higher values of High and Low LBV Surrounded by luminance levels that have high contrast against the highlighted level 11

  12. Nearest-Neighbor Classification  Use L1 Distance to compare two sets of LBH  A smaller distance indicates that the spatial relationships between luminance levels in one map are similar to the ones in the other map 12

  13. Experiments  Compare luminance-boundary histogram with  Color Histogram (CH):  Record the number of pixels of each color in a given color space  Color Moments (CM):  Based on statistical analysis of CH, i.e., average, standard deviation, and skewness  Color-Coherence Vectors (CCV):  Similar to CH, and further incorporates sizes of color regions into CH  Two types of experiment:  Image retrieval queries  Evaluate the robustness of test features  Map classification tasks  Simulate a map classification component in a map feature extraction system 13

  14. Test Data  60 test maps from 11 different sources  Manually separated test maps into 12 class based on their luminance usage  Insert the test maps to a map repository contained 1,495 raster maps Map Profiles 14

  15. Experiments on Image Retrieval  Test on Robustness  Remove a test class from the repository, such as a class of five test maps from Google Maps, namely G1, G2, G3, G4, and G5.  Insert one test map, say G1, into the repository (there is only one correct answer for each query in the repository)  Use G2 as the query image  Record the rank of G1 in the returned query results  Next, we used G3, G4, and G5 in turn as the query image  Remove G1 from the repository, insert G2, and repeat the experiments 15

  16. Image Retrieval Sample Results Query map Target map Query map Target map 1/15/269/724 Rank : LBH/CCV/CH/CM 1/289/713/275 Query map Target map Non-shared luminance levels have strong luminance-boundary values -> Lower the the comparative importance for the shared luminance levels Rank : LBH/CCV/CH/CM 3/1/1/231 16

  17. Experiments on Simulating Map Classification  Simulate a real map classification task  Example:  Remove one test map, such as G1, to query the repository (i.e., G1 represents a new input map and there are 4 correct answers)  If the first returned map was G2, G3, G4, or G5, then we had a correct classification  The accuracy is defined as the number of successful classifications divided by the total number of tested classifications 17

  18. Computation time on feature generation  We implemented our experiments using Microsoft .Net running on a Microsoft Windows 2003 Server powered by a 3.2 GHz Intel Pentium 4 CPU with 4GB RAM  Compare the top two features in the experiments  With 1,949 images  428 seconds to generate the luminance-boundary histograms  805 seconds to generate color-coherence vectors  The smallest test image in pixels is 130-by-350 and the largest image is 3000-by-2422 18

  19. Related Work  Map Classification using Meta-data (Gelernter, 09)  Answer queries such as finding the historical raster maps of a specific region for a specific year  Image Comparison Features  Shape :  Histogram of oriented gradient - HoG (Dalal and Triggs, 05) for human detection  T exture :  Tamura texture features (Tamura et al., 78), Gabor wavelet transform features (Manjunath and Ma, 96)  Represent the overall texture of an image does not fit our goal  Color :  Color Histogram and Color Moments (Stricker and Orengo, 95) do not generate robust results  Color-Coherence Vectors (Pass et al., 96) requires threshold tuning 19

  20. Discussion and Future Work  Achieve 95% accuracy on map classification task  Make it possible to extract geographic features (e.g., roads and text) automatically on new input maps  LBH generation is efficient  Future Work  Test with modern classifiers (e.g., SVM) or off-the-shelf content-based image retrieval (CBIR) systems  Integrate with our current system of map feature extraction 20

  21. Normalized HLBH and LLBH (Cont’d) 21

  22. High/Low luminance-boundary histogram  High/Low luminance-boundary histogram (HLBH/LLBH)  X-axis represents the luminance spectrum  Y-axis represents the the comparative importance of the luminance level in a raster map  A highlighted luminance level is surrounded by luminance levels that have high contrast against the highlighted level  Luminance-boundary value  The luminous differences between adjacent luminance levels  HLBH value  A higher boundary in the grayscale histogram that separates the luminance level from its adjacent luminance levels in the raster map  LLBH value indicates a lower boundary 22

  23. Content-based Image Retrieval (CBIR)  Find images with similar ‘content’  Content similarity defined by the comparison features  Shape features 23

  24. CBIR Cont’d  T exture features  Represent visual patterns in images and their spatial relationship (how they are defined spatially) The Near-regular Texture Database from Penn Stats Univ. 24

  25. CBIR Cont’d  Color Features Use Color Information Only Color Histogram and Color Moments Color Coherence Vectors (Pass et al., 96) Use Spatial Information of the Color Pixels Only 25

  26. Extracting features from raster maps  Aligning raster maps with other geospatial data  Labeling other geospatial data with map features  Creating map context, e.g., georeferenced road names 26

  27. High/Low luminance-boundary histogram  High/Low luminance-boundary histogram (HLBH/LLBH)  X-axis represents the luminance spectrum  Y-axis represents the the comparative importance of the luminance level in a raster map  A highlighted luminance level is surrounded by luminance levels that have high contrast against the highlighted level 27

  28. LBH Values Low Luminance-Boundary value The greatest lower bound among the 64 128 255 surrounding luminance levels 0 64 128 64 – 0 = 64 0 0 64 Luminance Levels High Luminance-Boundary value 64 X X 128 128 X 255 255 X The least upper bound among the surrounding luminance levels X 0 0 64 64 64 128 128 X 128 – 64 = 64 X 0 0 X 0 0 64 X X 28

  29. Normalized HLBH and LLBH  The comparative importance of the luminance level in a raster map 29

Recommend


More recommend