Computational Scientometrics: Mapping the Structure and Evolution of Science Cyberinfrastructure for Network Science Center, Director Information Visualization Laboratory, Director School of Library and Information Science Indiana University, Bloomington, IN katy@indiana.edu Maps of Science help answer questions such as: � What are the major research areas, experts, institutions, regions, nations, grants, publications, journals in xx research? � Which areas are most insular? � What are the main connections for each area? � What is the relative speed of areas? � Which areas are the most dynamic/static? � What new research areas are evolving? � Impact of xx research on other fields? � How does funding influence the number and quality of publications? Answers are needed by funding agencies, companies, researchers & society. • Shiffrin, Richard M. and Börner, Katy (Eds.) (2004). Mapping Knowledge Domains. Proceedings of the National Academy of Sciences of the United States of America, 101(Suppl_1). • Börner, Katy, Chen, Chaomei, and Boyack, Kevin. (2003). Visualizing Knowledge Domains. In Blaise Cronin (Ed.), Annual Review of Information Science & Technology, Volume 37, Medford, NJ: Information Today, Inc./American Society for Information Science and Technology, chapter 5, pp. 179-255. • Börner, Katy, Sanyal, Soma and Vespignani, Alessandro. (in press) Network Science: A Theoretical and Practical Framework. In Blaise Cronin (Ed.), Annual Review of Information Science & Technology, Volume 41, Medford, NJ: Information Today, Inc./American Society for Information Science and Technology.
Dec 1 & 2, 2006: Mapping Science Workshop at Thomson Scientific, Philadelphia, PA.
April 4, 2006: Mapping Science Workshop at the New York Academy of Science, New York City, NY.
May 21, 2006: Modeling Science Workshop at Indiana University, Bloomington, IN.
May 29 & 30, 2006: Integrating Semantic and Linkage-Based Data Mining Approaches Albert Mons & Marc Weber, Knewco visit our Lab/Center.
Mapping the Evolution of Co-Authorship Networks in Information Visualization, 1988 - 2004 Ke, Visvanath & Börner, (2004) Won 1st price at the IEEE InfoVis Contest. 6
7
Spatio-Temporal Information Production and Consumption of Major U.S. Research Institutions Börner & Penumarthy. (2005) Does Internet lead to more global citation patterns, i.e., more citation links between papers produced at geographically distant research instructions? Analysis of top 500 most highly cited U.S. institutions. γ 82-86 = 1.94 (R 2 =91.5%) Each institution is γ 87-91 = 2.11 (R 2 =93.5%) assumed to produce and γ 92-96 = 2.01 (R 2 =90.8%) consume information. γ 97-01 = 2.01 (R 2 =90.7%) 8
Mapping Medline Papers, Genes, and Proteins Related to Melanoma Research Boyack, Mane & Börner. (2004) IV Conference, pp. 965-971. 9
Mapping Indiana’s Intellectual Space (Ke, Börner & Mei, 2005) Identify � Pockets of innovation � Pathways from ideas to products � Interplay of industry and academia 10
Mapping Topic Bursts Co-word space of the top 50 highly frequent and bursty words used in the top 10% most highly cited PNAS publications in 1982-2001. Mane & Börner. (2004) PNAS, 101(Suppl. 1): 5287-5290. 11
Comparison of Similarity Metrics � ISI file year 2000, SCI and SSCI: 7,121 journals. � Different similarity metrics • Inter-citation (raw counts, cosine, modified cosine, Jaccard, RF, Pearson) • Co-citation (raw counts, cosine, modified cosine, Pearson) � Maps were compared based on • regional accuracy, • the scalability of the similarity algorithm, and • the readability of the layouts. Boyack, Kevin W., Klavans, R. and Börner, Katy. (2005). Mapping the Backbone of Science. Scientometrics. 64(3), 351-374. 12
Selecting the similarity measure with the best regional accuracy � For each similarity measure, the VxOrd layout was subjected to k- 400 means clustering using different numbers of 380 clusters. � Resulting cluster/category 360 Z-score memberships were IC Raw IC Cosine compared to actual 340 IC Jaccard category memberships IC Pearson IC RFavg 320 using entropy/mutual CC Raw CC K50 information method by CC Pearson 300 Gibbons & Roth, 2002. � Increasing Z-score 280 indicates increasing 100 150 200 250 Number of k-means clusters distance from a random solution. � Most similarity measures are within several percent Boyack, Kevin W., Klavans, R. and Börner, Katy. (2005). of each other. Mapping the Backbone of Science. Scientometrics. 64(3), 351-374. 13
A ‘Backbone’ Map of Science & Social Science � The map is comprised of LIS Comp Sci 7,121 journals from year Geogr 2000. Robot PolySci Law � Each dot is one journal Oper Res Econ Social Sci Math � An IC-Jaccard similarity Comm Appl Sociol Math Hist AI measure was used. Stat Elect Eng Mech Eng Geront � Journals group by Psychol Physics Aerosp Educ MatSci discipline. Constr Neurol Psychol Anthrop CondMat Nuc Psychol Pharma Fuels � Groups are labeled by Elect Sport Analyt Radiol Chem Sci P Chem Chem Env Health hand. Care Emerg Gen/Org Med Gen Med Neuro GeoSci Chem Eng Astro � Large font size labels Sci Biomed Chemistry Rehab Polymer identify major areas of Nursing Genet BioChem Cardio OtoRh Meteorol Oncol Ped science. Marine GeoSci Endocr Surg Env Immun Medicine Hemat Ecol � Small labels denote the Nutr Urol Virol Paleo Earth Sciences Gastro Plant disciplinary topics of Ob/Gyn Soil Endocr Derm nearby large clusters of Dairy Dentist Food Sci Pathol journals. Zool Parasit Agric Ophth Vet Med Ento 14
LIS Comp Sci Geogr Robot PolySci Law Oper Res Econ Social Sci Math Comm Appl Sociol Math Hist AI Stat Elect Eng Mech Eng Geront Psychol Physics Aerosp Educ MatSci Neurol Constr Psychol Anthrop CondMat Nuc Psychol Pharma Fuels Elect Sport Analyt Radiol Chem Sci P Chem Chem Env Health Care Emerg Gen/Org Med Gen Med Neuro GeoSci Chem Eng Astro Sci Chemistry Biomed Rehab Polymer Nursing Genet BioChem Cardio OtoRh Meteorol Oncol Ped Marine GeoSci Endocr Surg Env Immun Medicine Ecol Hemat Nutr Urol Virol Paleo Earth Sciences Gastro Plant Ob/Gyn Soil Endocr Derm Dairy Dentist Pathol Food Sci Zool Parasit Agric Ophth Vet Med Ento 15
Latest ‘Base Map’ of Science Kevin W. Boyack & Richard Klavans, unpublished work. � Uses combined SCI/SSCI Math from 2002 Law • 1.07M papers, 24.5M Computer Tech Policy Statistics references, 7,300 journals Economics • Bibliographic coupling of CompSci Phys-Chem papers, aggregated to Vision Chemistry Education Physics journals Psychology � Initial ordination and clustering Brain Environment GeoScience of journals gave 671 clusters Psychiatry MRI � Coupling counts were Biology BioChem Bio- reaggregated at the journal Materials cluster level to calculate the Microbiology Plant • (x,y) positions for each Cancer Animal journal cluster Disease & Treatments • by association, (x,y) Infectious Diseases Virology positions for each journal 16
Science map applications: Identifying core competency Kevin W. Boyack & Richard Klavans, unpublished work. Funding patterns of the US Department of Energy (DOE) Math Law Computer Tech Policy Statistics Economics CompSci Phys-Chem Vision Chemistry Education Physics Psychology Brain Environment GeoScience Psychiatry MRI Biology BioChem GI Bio- Materials Microbiology Plant Cancer Animal Infectious Diseases Virology 17
Science map applications: Identifying core competency Kevin W. Boyack & Richard Klavans, unpublished work. Funding Patterns of the National Science Foundation (NSF) Math Law Computer Tech Policy Statistics Economics CompSci Phys-Chem Vision Chemistry Education Physics Psychology Brain Environment GeoScience Psychiatry MRI Biology BioChem GI Bio- Materials Microbiology Plant Cancer Animal Infectious Diseases Virology 18
Science map applications: Identifying core competency Kevin W. Boyack & Richard Klavans, unpublished work. Funding Patterns of the National Institutes of Health (NIH) Math Law Computer Tech Policy Statistics Economics CompSci Phys-Chem Vision Chemistry Education Physics Psychology Brain Environment GeoScience Psychiatry MRI Biology BioChem GI Bio- Materials Microbiology Plant Cancer Animal Infectious Diseases Virology 19
Places & Spaces: Cartography of the Physical and the Abstract This science exhibit aims to demonstrate the power of maps to navigate physical places and abstract knowledge spaces. http://vw.indiana.edu/places&spaces/
The Power of Maps The Power of Maps Four Early Maps of Our World Four Early Maps of Our World VERSUS VERSUS Six Early Maps of Science Six Early Maps of Science (1st Iteration of Places & Spaces Exhibit - - 2005) 2005) (1st Iteration of Places & Spaces Exhibit
The Power of Reference Systems The Power of Reference Systems Four Existing Reference Systems Four Existing Reference Systems VERSUS VERSUS Six Potential Reference Systems of Science Six Potential Reference Systems of Science For Sale! For Sale! nd Iteration of Places & Spaces Exhibit (2 nd Iteration of Places & Spaces Exhibit - - 2006) 2006) (2
Recommend
More recommend