Classification of Raster Maps for Automatic Feature Extraction Yao-Yi Chiang and Craig A. Knoblock University of Southern California 1
Motivation Raster map is a bitmap image of a map Raster maps are easily accessible Contain information that is difficult to find elsewhere Contain historical data USGS topographic map of St. Louis, MO Travel map of Tehran, Iran 2
Exploit the geospatial information in raster maps Extracting geographic features from raster maps Road Extraction Text Extraction and Recognition Building Extraction … Text Original map Roads 3
Pre-Processing for feature extraction Much of the feature extraction work relies on user input to extract the foreground pixels from the maps as a preprocessing step Pre-Processing examples: Convert to grayscale Text Line Thresholding the grayscale histogram Background The map profile for pre-processing Convert road pixels to road vectors Convert text pixels to machine-editable text 4
Automatic determine an applicable map profile ! Can we automatically select a map profile for new input map? Map Profiles New map Map repository 5
Automatic feature extraction with map classification We can eliminate the manual pre-processing task using the map classification component 6
Can we use meta-data to determine a map profile? Meta-data such as map source, is not always available Maps from the same source can be very different Two USGS topographic maps covering two different cities 7
Content-based Image Retrieval (CBIR) CBIR is the technique to find images with similar ‘content’ Content similarity defined by the comparison features In our case, similar content means two raster maps shared the same map profile for extracting their foreground pixels Comparison feature – Luminance-Boundary Histogram Classifier – Nearest-Neighbor Classifier 8
Luminance or Color Luminance is chosen instead of using one or all of the Red, Green, and Blue components One-dimensional features is more computational efficient Luminance is the most representative component by design Color images Grayscale images 9
Luminance-Boundary Histogram (LBH) LBH captures the spatial relationships between neighboring luminance levels in the map The two example maps have similar spatial relationship between their luminance levels Blacks are surrounded by gray and white pixels in both maps Map One Map Two 10
High/Low Luminance-Boundary Histogram A set of LBH contain a High Luminance-Boundary Histogram (HLBH) and a Low Luminance-Boundary Histogram (LLBH) HLBH and LLBH are based on the high and low luminance-boundary values (LBV) For a luminance level in a map image How to generate the HLBH and LLBH? The High LBV represents the least luminous upper bound among its surrounding luminance levels The Low LBV represents the greatest luminous lower bound among its surrounding luminance levels Together, the High and Low LBH represent the comparative importance of the luminance level in a raster map A highlighted luminance level has higher values of High and Low LBV Surrounded by luminance levels that have high contrast against the highlighted level 11
Nearest-Neighbor Classification Use L1 Distance to compare two sets of LBH A smaller distance indicates that the spatial relationships between luminance levels in one map are similar to the ones in the other map 12
Experiments Compare luminance-boundary histogram with Color Histogram (CH): Record the number of pixels of each color in a given color space Color Moments (CM): Based on statistical analysis of CH, i.e., average, standard deviation, and skewness Color-Coherence Vectors (CCV): Similar to CH, and further incorporates sizes of color regions into CH Two types of experiment: Image retrieval queries Evaluate the robustness of test features Map classification tasks Simulate a map classification component in a map feature extraction system 13
Test Data 60 test maps from 11 different sources Manually separated test maps into 12 class based on their luminance usage Insert the test maps to a map repository contained 1,495 raster maps Map Profiles 14
Experiments on Image Retrieval Test on Robustness Remove a test class from the repository, such as a class of five test maps from Google Maps, namely G1, G2, G3, G4, and G5. Insert one test map, say G1, into the repository (there is only one correct answer for each query in the repository) Use G2 as the query image Record the rank of G1 in the returned query results Next, we used G3, G4, and G5 in turn as the query image Remove G1 from the repository, insert G2, and repeat the experiments 15
Image Retrieval Sample Results Query map Target map Query map Target map 1/15/269/724 Rank : LBH/CCV/CH/CM 1/289/713/275 Query map Target map Non-shared luminance levels have strong luminance-boundary values -> Lower the the comparative importance for the shared luminance levels Rank : LBH/CCV/CH/CM 3/1/1/231 16
Experiments on Simulating Map Classification Simulate a real map classification task Example: Remove one test map, such as G1, to query the repository (i.e., G1 represents a new input map and there are 4 correct answers) If the first returned map was G2, G3, G4, or G5, then we had a correct classification The accuracy is defined as the number of successful classifications divided by the total number of tested classifications 17
Computation time on feature generation We implemented our experiments using Microsoft .Net running on a Microsoft Windows 2003 Server powered by a 3.2 GHz Intel Pentium 4 CPU with 4GB RAM Compare the top two features in the experiments With 1,949 images 428 seconds to generate the luminance-boundary histograms 805 seconds to generate color-coherence vectors The smallest test image in pixels is 130-by-350 and the largest image is 3000-by-2422 18
Related Work Map Classification using Meta-data (Gelernter, 09) Answer queries such as finding the historical raster maps of a specific region for a specific year Image Comparison Features Shape : Histogram of oriented gradient - HoG (Dalal and Triggs, 05) for human detection T exture : Tamura texture features (Tamura et al., 78), Gabor wavelet transform features (Manjunath and Ma, 96) Represent the overall texture of an image does not fit our goal Color : Color Histogram and Color Moments (Stricker and Orengo, 95) do not generate robust results Color-Coherence Vectors (Pass et al., 96) requires threshold tuning 19
Discussion and Future Work Achieve 95% accuracy on map classification task Make it possible to extract geographic features (e.g., roads and text) automatically on new input maps LBH generation is efficient Future Work Test with modern classifiers (e.g., SVM) or off-the-shelf content-based image retrieval (CBIR) systems Integrate with our current system of map feature extraction 20
Normalized HLBH and LLBH (Cont’d) 21
High/Low luminance-boundary histogram High/Low luminance-boundary histogram (HLBH/LLBH) X-axis represents the luminance spectrum Y-axis represents the the comparative importance of the luminance level in a raster map A highlighted luminance level is surrounded by luminance levels that have high contrast against the highlighted level Luminance-boundary value The luminous differences between adjacent luminance levels HLBH value A higher boundary in the grayscale histogram that separates the luminance level from its adjacent luminance levels in the raster map LLBH value indicates a lower boundary 22
Content-based Image Retrieval (CBIR) Find images with similar ‘content’ Content similarity defined by the comparison features Shape features 23
CBIR Cont’d T exture features Represent visual patterns in images and their spatial relationship (how they are defined spatially) The Near-regular Texture Database from Penn Stats Univ. 24
CBIR Cont’d Color Features Use Color Information Only Color Histogram and Color Moments Color Coherence Vectors (Pass et al., 96) Use Spatial Information of the Color Pixels Only 25
Extracting features from raster maps Aligning raster maps with other geospatial data Labeling other geospatial data with map features Creating map context, e.g., georeferenced road names 26
High/Low luminance-boundary histogram High/Low luminance-boundary histogram (HLBH/LLBH) X-axis represents the luminance spectrum Y-axis represents the the comparative importance of the luminance level in a raster map A highlighted luminance level is surrounded by luminance levels that have high contrast against the highlighted level 27
LBH Values Low Luminance-Boundary value The greatest lower bound among the 64 128 255 surrounding luminance levels 0 64 128 64 – 0 = 64 0 0 64 Luminance Levels High Luminance-Boundary value 64 X X 128 128 X 255 255 X The least upper bound among the surrounding luminance levels X 0 0 64 64 64 128 128 X 128 – 64 = 64 X 0 0 X 0 0 64 X X 28
Normalized HLBH and LLBH The comparative importance of the luminance level in a raster map 29
Recommend
More recommend