Semantic Resources and Machine Learning for Quality, Efficiency - PowerPoint PPT Presentation

MultilingualWeb, Luxembourg, 16 th of March, 2012: Results of a theme session on Semantic Resources and Machine Learning for Quality, Efficiency and Personalisation of Accessing Relevant Information over Language Borders (different languages and different uses of a same language)

Participants Timo Honkela, Aalto University (rapporteur) ● Peter Schmitz, Publications Office of the EU ● Elena Montanes, Oviedo University ● Tasos Koutoumanos, AgroKnow Tech., Greece ● Corinne Frappart, Publications Office of the EU ● Poul Andersen, WEB translation unit, EU Commission ● Ghassan Haddad, Facebook ● Spyridon Pilos, Language applications, European Commission ● Jose Emilio Labra Gayo, University of Oviedo, Spain ● Maria Pia Montoro, Intrasoft International, Luxembourg ● Daaniel Garcia Magarinos, European Central Bank ●

Quality and consistency versus accessibility and contextual appropriateness of terminology ● Terms good for experts in different domains versus laypersons ● Case: “member state” versus “EU country” ● Case: “human trafficking” versus “modern slavery” ● Case: Bank note security features ● A thesaurus was created as a mapping from technical terms to colloquial language (“iridescent stripe” to “glossy stripe”) ● Case: legislation (Asturias region in Spain): mapping of colloquial terms to official terms, new project: library of congress in Chile

Quality and consistency versus accessibility and contextual appropriateness of terminology ● Convergent and divergent processes in language use ● Ontologies: carefully crafted resources that require considerable resources for implementation and use ● Folksonomies: resources that provide information on the variation and are constructed by the crowds > Possibility to model the crowdsourced data using machine learning techniques

Multilingual contents and thesauri: trust and quality ● Use of EU-generated resources such as ● Eurovoc ● JRC-Names ● Importance of linked open data (LOD) ● Choosing keywords from a controlled vocabulary ● Connecting different term versions with an ontology (or folksonomy) ● Determining a proper contexts using LOD ● Multilingual content: provenance of data ● Quality assurance of LOD

Effect of context in translation: need for context-rich representations ● Often the variation in translation of terminology stems from contextual factors ● It would be important to store enough contextual information in order to facilitate appropriate choices

Social and cognitive levels of language use ● Push and pull of terminology ● Regulation and market economy of language ● Different levels of expertise ● Experts in different domains versus laypersons ● Take home messages: ● Variation among language in conceptual structures (challenges for ontology translation) ● Semantic variation among languge users

Space under Construction Language-Specific Spatial Categorization In First Language Acquisition Melissa Bowerman Max Planck Institute for Psycholinguistics Lund University Cognitive Science 2003

DUTCH OP AAN IN OP AAN IN

Categorization of `opening’ in English and Korean . TTUT YEL TA A 'remove barrier 'tear away PELLITA to interior space' from base' 'separate two parts symmetrically' open take off spread wallpaper open box legs apart open door open unwrap open clamshell mouth open bag package envelope open pair of shutters OPEN open latched eyes open take off drawer open hand ring sun rises open book take cassette open TTUTA out of case fan ‘rise’ spread blanket out PPAYTA peacock spreads tail ‘unfit’ PHYELCHITA 'spread out flat thing'

(Pye 1995, 1996) PLATE STICK ROPE CLOTHES ENGLISH break break break tear, rip duàn MANDARIN può può può (long rigid thing) K’ICHE’ -paxi:j -tóqopi’j -q’upi:j rach’aqij MAYAN (rock, glass, (long, flexible (other hard (“tear”) clay thing) thing) thing) http://www.mpi.nl/people/bowerman-melissa http://www.mpi.nl/people/bowerman-melissa/publications

User-specific difficulty measure Paukkeri, Ollikainen & Honkela, submitted

GICA analysis: Word 'health' in State of the Union Addresses GICA: Grounded Intersubjectivity Concept Analysis Timo Honkela, Juha Raitio, Krista Lagus, Ilari T. Nieminen, Nina Honkela, and Mika Pantzar. Subjects, objects and contexts: Using GICA method to quantify epistemological subjectivity . In Proceedings of IJCNN 2012, International Join Conference on Neural Networks, to appear.

Core of GICA: Subject-Object-Context Tensors Timo Honkela, Nina Janasik, Krista Lagus, Tiina Lindh-Knuutila, Mika Pantzar, and Juha Raitio. GICA: Grounded intersubjective concept analysis - a method for enhancing mutual understanding and participation. Technical Report TKK-ICS-R41, AALTO-ICS, ESPOO, December 2010. http://users.ics.tkk.fi/tho/info/TKK-ICS-R41.shtml http://users.ics.tkk.fi/tho/publications.shtml

Guidelines are needed on how to publish data in multiple languages ● Different versions in different languages ● Alternative language versions ● A standard way of describing how how different versions are related to each other ● Case FAO: Translations should refer back to the original documents

Linport has related objectives

Semantic Resources and Machine Learning for Quality, Efficiency - PowerPoint PPT Presentation

MultilingualWeb, Luxembourg, 16 th of March, 2012: Results of a theme session on Semantic Resources and Machine Learning for Quality, Efficiency and Personalisation of Accessing Relevant Information over Language Borders (different languages

Machine Learning for Annotating Semantic Web Services Andreas He, Nicholas Kushmerick

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Creating Semantic Mashups: Bridging Web 2.0 and the Semantic Web Jamie Taylor, Colin Evans, Toby

Align, Disambiguate, and Walk A Unified Approach for Measuring Semantic Similarity Semantic

Module 13 Introduction to Semantic Technology, Ontologies and the Semantic Web Module 13 Outline

: on the Semantic Web : on the Semantic Web Building a Semantic Prototype for Danish Building a

Semantic Processing Augmenting CFGs Currying Quantifier scope Semantic Grammars L445 / L545

Semantic Similarity MultiJEDI ERC 259234 Semantic Similarity Semantic Similarity Mostly

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Application: Semantic Role Labeling CS 6956: Deep Learning for NLP Overview What is semantic

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Development of Tungsten Monoblock Technology for ITER Full-Tungsten Divertor in Japan 25th

Sc a nning a nd T o ssing I ma g ing Re q uire me nts fo r Pa pe r Ba se d Re c o rds

6 th February, 2018 BEE LABELLING PROGRAM UPDATE 1 st Jan 2016 31 st Dec 2017 1 st Jan 2018

Soft-Close Undermount Drawer Slides The Next Evolution in Undermount Technology The next

Development of molecular cassettes for the excitation energy transfer in the red region of the

Downsizing Your Home, A Three Week Guide to Making the Most of Moving On WEEK O ONE Slide 1

Knape &Vogt Slides Last Updated: 07/02/10 M averick Hardware KV Slides Medium Duty Slides

Gotcha!* Upgrading PDF plugins to DITA OT 2.x *and some helpful hints too Leigh White, DITA

Sambuz

Useful Links

Newsletter

Mail Us

Semantic Resources and Machine Learning for Quality, Efficiency - PowerPoint PPT Presentation

MultilingualWeb, Luxembourg, 16 th of March, 2012: Results of a theme session on Semantic Resources and Machine Learning for Quality, Efficiency and Personalisation of Accessing Relevant Information over Language Borders (different languages

Machine Learning for Annotating Semantic Web Services Andreas He, Nicholas Kushmerick

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Creating Semantic Mashups: Bridging Web 2.0 and the Semantic Web Jamie Taylor, Colin Evans, Toby

Align, Disambiguate, and Walk A Unified Approach for Measuring Semantic Similarity Semantic

Module 13 Introduction to Semantic Technology, Ontologies and the Semantic Web Module 13 Outline

: on the Semantic Web : on the Semantic Web Building a Semantic Prototype for Danish Building a

Semantic Processing Augmenting CFGs Currying Quantifier scope Semantic Grammars L445 / L545

Semantic Similarity MultiJEDI ERC 259234 Semantic Similarity Semantic Similarity Mostly

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Application: Semantic Role Labeling CS 6956: Deep Learning for NLP Overview What is semantic

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Development of Tungsten Monoblock Technology for ITER Full-Tungsten Divertor in Japan 25th

Sc a nning a nd T o ssing I ma g ing Re q uire me nts fo r Pa pe r Ba se d Re c o rds

6 th February, 2018 BEE LABELLING PROGRAM UPDATE 1 st Jan 2016 31 st Dec 2017 1 st Jan 2018

Soft-Close Undermount Drawer Slides The Next Evolution in Undermount Technology The next

Development of molecular cassettes for the excitation energy transfer in the red region of the

Downsizing Your Home, A Three Week Guide to Making the Most of Moving On WEEK O ONE Slide 1

Knape &amp;Vogt Slides Last Updated: 07/02/10 M averick Hardware KV Slides Medium Duty Slides

Gotcha!* Upgrading PDF plugins to DITA OT 2.x *and some helpful hints too Leigh White, DITA

Sambuz

Useful Links

Newsletter

Mail Us

Knape &Vogt Slides Last Updated: 07/02/10 M averick Hardware KV Slides Medium Duty Slides