Fedelucio Narducci*, Matteo Palmonari *, Giovanni Semeraro° *DISCO, University of Milan-Bicocca, Italy ° Department of Computer Science, University of Bari Aldo Moro, Italy ! !"#$%&' " !('' AI for Smart Cities Workshop # &&!))'*'''''''''''''''''' AI*IA 2013 - 25th Year Anniversary $ !+),$#-./#%,$'' XIII Conference +!)!#+&0'1+,23'45$6,$.,'7!--,8' Turin (Italy), December 5, 2013 093:;;<<<=>.=2$.(#=.6;?)<#3''
http://www.smart-cities.eu/model.html 2
• A large amount of Open Government Data in many languages*: o 1,000,000+ datasets published online (February 2013) o 40 different countries o 24 different languages *http://logd.tw.rpi.edu/iogds_data_analytics 3
• Government service catalogs are part of the LOD cloud o Effective Service Delivery ( ESD )-toolkit o European Local Government Service List ( LGSL ) • 2000+ interlinked public services in 6 languages 4
• Advantages for PAs o Compare local service offerings with best practices in other countries o Support interoperability among PAs of different countries and other service providers o Enrich service descriptions with additional information via links to LGSL (e.g., link to life event ontologies) • Advantages for citizens o Find eGov services when in a foreign country o Towards cross-language service access Costly and Error Prone Activity Catalogs of several hundreds of services 5
Challenging cross-language matching • ! sameAs links problem Most of the approaches: • use structural information [Spohr et al. Semantic heterogeneity • 2011, Fu et al. 2011, Wang et al. 2009] or not a mere “translation” o long textual descriptions [Knoth et al. problem 2011] cultural bias o or report problems when automatic • translation returns descriptions with heterogeneous vocabulary [Hertling & Ultra-short descriptions Paulheim 2012] 6
• CroSeR o A tool to support the linkage of a source eGov service catalog represented in any language to a target catalog represented in English o Based on automatic translation and Explicit Semantic Analysis Web tool to support the linkage of a source eGov service catalog represented in any language to a target catalog represented in English Based on Machine Translation and Explicit Semantic Analysis (ESA) TRY IT @ http://siti-rack.siti.disco.unimib.it:8080/croser/ 7
• CroSeR o A tool to support the linkage of a source eGov service catalog represented in any language to a target catalog represented in English o Based on automatic translation and Explicit Semantic Analysis Web tool to support the linkage of a source eGov service catalog represented in any language to a target catalog represented in English 8 Based on Machine Translation and Explicit Semantic Analysis (ESA)
• CroSeR o A tool to support the linkage of a source eGov service catalog represented in any language to a target catalog represented in • Load a catalog English o Based on automatic translation and Explicit Semantic Analysis Web tool to support the linkage of a source eGov service catalog represented in any language to a target catalog represented in English 9 Based on Machine Translation and Explicit Semantic Analysis (ESA)
• CroSeR o A tool to support the linkage of a source eGov service catalog represented in any language to a target catalog represented in • Load a catalog English o Based on automatic translation and Explicit Semantic Analysis • Select a source service Web tool to support the linkage of a source eGov service catalog represented in any language to a target catalog represented in English 10 Based on Machine Translation and Explicit Semantic Analysis (ESA)
• CroSeR o A tool to support the linkage of a source eGov service catalog represented in any language to a target catalog represented in • Load a catalog English o Based on automatic translation and Explicit Semantic Analysis • Select a source service • Look at the retrieved services (link recommendations) Web tool to support the linkage of a source eGov service catalog represented in any language to a target catalog represented in English 11 Based on Machine Translation and Explicit Semantic Analysis (ESA)
• CroSeR o A tool to support the linkage of a source eGov service catalog represented in any language to a target catalog represented in • Load a catalog English o Based on automatic translation and Explicit Semantic Analysis • Select a source service • Look at the retrieved services (link recommendations) • Link SKOS broader / exact / narrower match Web tool to support the linkage of a source eGov service catalog represented in any language to a target catalog represented in English 12 Based on Machine Translation and Explicit Semantic Analysis (ESA)
• CroSeR o A tool to support the linkage of a source eGov service catalog represented in any language to a target catalog represented in English o Based on automatic translation and Explicit Semantic Analysis Web tool to support the linkage of a source eGov service catalog represented in any language to a target catalog represented in English 13 Based on Machine Translation and Explicit Semantic Analysis (ESA)
Machine Translation of Service Descriptions Extraction of ESA-based Top-k representations and indexing Service Retrieval (Vector Space Model) by Cosine Similarity 14
15 Technique able to provide a fine-grained semantic representation of natural language texts in a high-dimensional space of comprehensible concepts derived from Wikipedia [GM06] A,+->' A.E.3!>.#'F.!<!>'#)'#$' %&'%(%)* 'G' A#+'BB' @#$60!+#' #'&,--!&%,$',H'? +, '&,$&!36)' C#$!' D,$>#' B)-#$>' [GM06] E. Gabrilovich and S. Markovitch. Overcoming the Brittleness Bottleneck using Wikipedia: Enhancing Text Categorization with Encyclopedic Knowledge. In Proceedings of the 21th National Conf. on Artificial Intelligence and 15 the 18th Innovative Applications of Artificial Intelligence Conference , pages 1301–1306. AAAI Press, 2006.
CroSeR finds matchings that cannot be discovered by • machine translation + keyword comparison CroSeR’s recommendations can support the users to refine the • links GT:“Absentee Ballot” 16
• Model and Experimental Evaluation @ISWC 2013 o F. Narducci, M. Palmonari, G. Semeraro. Cross-Language Semantic Retrieval and Linking of E-Gov Services . " The Semantic Web - ISWC 2013 - 12th International Semantic Web Conference, Sydney, NSW, Australia, October 21-25, 2013, LNCS 8219, 130-145, Springer, 2013 • System Demo @ECIR 2014 o F. Narducci, M. Palmonari, G. Semeraro. CroSeR: Cross-language Semantic Retrieval of Open Government . " 36 th European Conference on Information Retrieval, Amsterdam, the Netherlands, April 13-16, 2014. To Appear 17
ECIR 2014 live demo: http://siti-rack.siti.disco.unimib.it:8080/croser/ 18
Recommend
More recommend