Building an IT Taxonomy with Co- occurrence Analysis, Hierarchical Clustering, and Multidimensional Scaling Chia-jung Tsui, Ping Wang , Kenneth R. Fleischmann, Asad B. Sayeed, Amy Weinberg, and Douglas Oard The abundance of IT is a challenges for both IT management & information management. Cartoon by Sidney Harris
We Have Lots of IT, But … SaaS Ajax RFID Portable Ultramobile Personality BPO Devices Application Quality SOA Dashboards Mashup Chatbots VoIP DRM Identity Management OSS Thin Provisioning Business Intelligence Semantic Web Cloud SCM Computing Tera-architectures Web2.0 CRM Distributed 3 Encryption … Little and Dated Understanding 1993 1998 4
Extant Approach to IT Taxonomy Compile list of ITs by empirical surveys. Experts rate ITs according to their assessments of functions or features of the technologies. Limitations Narrow representation: arbitrary and limited choices of features and functions, few ITs Static: snapshots few and far in between Not scalable: more ITs lower reliability 5 Scalable Computational Approach Downloaded full-text articles published in 1998-2007 from six magazines: ComputerWorld & InformationWeek BusinessWeek & The Economist Newsweek & US News and World Report Extracted ~220,000 paragraphs containing 50 IT concepts. 6
IT Concepts Included in Analysis AI Artificial intelligence Multimedia Multimedia ASP Application service provider MP3 MP3 player BI Business intelligence MySpace MySpace Blog Blog NeuralNet Neural net Bluetooth Bluetooth OLAP Online analytical processing BizProReen Business process reengineering OSS Open source software CloudCom Cloud computing Outsource Outsourcing CRM Customer relationship management PDA Personal digital assistant DigiCam Digital camera RFID Radio frequency identification DLearn Distance learning SmartCard Smart card DSL Digital subscriber line SCM Supply chain management DecisionSS Decision support system SFA Salesforce automation DW Data warehouse SocNet Social networking eBiz Electronic business SOA Service oriented architecture eCom Electronic commerce Telecommute Telecommuting EDI Electronic data interchange TabletPC Tablet PC ERP Enterprise resource planning UtiComp Utility computing ExpertSys Expert system Virtualization Virtualization GPS Global positioning system VPN Virtual private network Grpware Groupware Web2 Web 2.0 IM Instant messaging WebServ Web services iPhone iPhone WiFi Wi-Fi iPod iPod Wiki Wiki 7 KM Knowledge management Wikipedia Wikipedia Linux Linux YouTube YouTube Scalable Computational Approach Downloaded full-text articles published in 1998-2007 from six magazines: ComputerWorld & InformationWeek BusinessWeek & The Economist Newsweek & US News and World Report Extracted ~220,000 paragraphs containing 50 IT concepts. Counted co-occurrence of IT concepts in paragraphs. 8
Co-Occurrence of IT Concepts “Over the past few years, we Links between groupware and have seen the ERP vendors-led ERP applications speed users' by SAP-move into different access from within a groupware business areas,” says Byron application to key business data, Miller, an analyst with the Giga such as purchase orders, Information Group. “The inventory, customer histories, competitive advantage of just and other supply-chain having ERP has diminished. information. The next big thing beyond ERP is supply-chain management .” 9 Hierarchical Clustering Cluster 1 Cluster 1 Cluster 1 Cluster 2 Cluster 2 Cluster 2 Cluster 3 Cluster 3 Cluster 3 Cluster 4 Cluster 4 Cluster 4 Cluster 5 Cluster 5 Cluster 5 Cluster 6 Cluster 6 Cluster 6 Cluster 7 Cluster 7 Cluster 7 Cluster 8 Cluster 8 Cluster 8
Face Validity of Our Approach 11 Benefits of This Approach Representative More IT concepts to study Monitor and understand popularity More data sources Represent reality by pooling data Compare to exam segments of communities Dynamic Multiple periods Reveal what exactly is diffusing Visualize species and speciation of innovations Scalable 12
Popularity of E-commerce & E-business 3000 2500 Number of Paragraphs 2000 1500 1000 500 0 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 eBiz eCom 13 Source: InformationWeek Popularity of Web Services & SOA 800 700 600 Number of Paragraphs 500 400 300 200 100 0 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 SOA WebServ 14 Source: InformationWeek
Implications for IT Management When expert knowledge is not readily available, this approach offers maps of IT domains or sub-domains. A new technology’s cluster membership may suggest its broader type. Taxonomy is useful for vendors in product/service labeling and for adopters in IT portfolio management. 15 Takeaways Computational discourse analysis based on co-occurrence and hierarchical clustering can help us explore complex relationships among IT concepts in a representative, dynamic, and scalable way. Social-technical approach: we used social artifacts (language/discourse) to chart technological terrains. Effective information management and effective IT management go hand-in-hand. 16
Thank You from the PopIT Team Thanks to National Science Foundation for grants IIS- 0729459 and SBE- 0915645 http://terpconnect.umd.edu/~pwang/PopIT/ * pwang@umd.edu
Recommend
More recommend