A FRAMEWORK FOR MULTILINGUAL AND SEMANTIC ENRICHMENT OF DIGITAL - PowerPoint PPT Presentation

Co-funded by the Horizon 2020 Framework Programme of the European Union Grant Agreement Number 644771 A FRAMEWORK FOR MULTILINGUAL AND SEMANTIC ENRICHMENT OF DIGITAL CONTENT (NEW L10N BUSINESS OPPORTUNITIES) FREME WEBINAR HELD FOR GALA, 28 APRIL 2016 Presented by Tatjana Gornostaja (Tilde) and www.freme-project.eu Felix Sasaki (DFKI / W3C Fellow) WWW.FREME-PROJECT.EU 1 FREME Webinar for GALA – April 2016

OVERVIEW • Introduction • Technological aspects of the framework • Localization and other FREME business cases • Q&A WWW.FREME-PROJECT.EU 2 FREME Webinar for GALA – April 2016

Cou Coupl pling ing Know Kn owledge ledge an and Lan d Languag guage vi via e a e-Ser Service vice Ec Ecos osys ystem tem WWW.FREME-PROJECT.EU 3 FREME Webinar for GALA – April 2016

Language Knowledge WWW.FREME-PROJECT.EU 4 FREME Webinar for GALA – April 2016

Knowledge Language WWW.FREME-PROJECT.EU 5 FREME Webinar for GALA – April 2016

Knowledge Language WWW.FREME-PROJECT.EU 6 FREME Webinar for GALA – April 2016

FRE FREME ME Picture: coloringpageswallpaper.com WWW.FREME-PROJECT.EU 7 FREME Webinar for GALA – April 2016

THE FREME PROJECT • Two year H2020 Innovation action; start February 2015 • Industry partners leading four business cases around digital content and (linked) data • Technology development bridging language and data • Outreach and business modelling demonstrating monetization of the multilingual data value chain WWW.FREME-PROJECT.EU 8 FREME Webinar for GALA – April 2016

CURRENT STATE OF SOLUTIONS Machine Linked data translation, creation & terminology processing annotation, ... GAPS THAT HINDER BUSINESS: • Plethora of formats • Adaptability and platform dependency • Language coverage • Usability “The right tool for the right person in given and new enterprises”: technology influences job profiles WWW.FREME-PROJECT.EU 9 FREME Webinar for GALA – April 2016

FREME TO THE RESCUE: ENRICHING DIGITAL CONTENT Machine Linked data translation, creation & terminology processing annotation, ... A SET OF INTERFACES* - DESIGN DRIVEN BY BUSINESS CASES LT and LD for various * Graphical interfaces user types: (application) * Software Interfaces LT and LD as first class developer, content citizens on the Web architect, content author, … WWW.FREME-PROJECT.EU 10 FREME Webinar for GALA – April 2016

WWW.FREME-PROJECT.EU 11 FREME Webinar for GALA – April 2016

FREME FROM A TECHNICAL PERSPECTIVE A framework for multilingual and semantic enrichment of digital content that provides access via a set of APIs and GUIs to six E- services. • e-Entity for enriching content with information on named entities; • e-Link for enrichment with linked data sources; • e-Terminology for detecting terms and enriching them with term related information; • e-Translation for providing custom machine translation systems; • e-Internationalisation for processing a variety of digital content formats; and • e-Publishing for exporting the outcome of enrichment processes in the ePub format. WWW.FREME-PROJECT.EU 13 FREME Webinar for GALA – April 2016

FREME FROM A TECHNICAL PERSPECTIVE How to access FREME – several options: • A life version 0.5 (0.6 soon to be released!) including documentation at http://api.freme-project.eu/doc/current/ • A development version at http://api-dev.freme-project.eu/doc/ • A Java / maven software package; see the documentation for installation instructions • Source code in a GitHub project https://github.com/freme-project/ • The framework is available under Apache 2.0 license to ease commercial use • Underlying services have various licensing conditions WWW.FREME-PROJECT.EU 14 FREME Webinar for GALA – April 2016

LINGUISTIC LINKED DATA AND OTHER STANDARDS PUT IN ACTION VIA FREME • NIF (Natural Language Processing Interchange Format) for representing digital content and enrichment information in a format agnostic manner, based on the linked data stack; • OntoLex lemon for representing lexical information, to be used e.g. for improving machine translation output; • Internationalization Tag Set 2.0 for representing various types of enrichment information in a standardized manner, related e.g. to terminology named entities; and • The general linked data technology stack (RDF, SPARQL etc.) FREME is built on outcomes of standard driving projects in FP7 in the area of linguist linked data: LIDER and FALCON Cf. http://lider-project.eu/ and http://falcon-project.eu/ WWW.FREME-PROJECT.EU 15 FREME Webinar for GALA – April 2016

EXAMPLE API CALL • The request is made to the API for the e-Entity service, a service that enriches content with named entities. • The input format of content is plain text; the output format is turtle. • The content to enrich is “Welcome to the city of Prague”. • The language or the content is English. • The dataset used for the enrichment is DBpedia. WWW.FREME-PROJECT.EU 16 FREME Webinar for GALA – April 2016

EXAMPLE OUTPUT: USING NIF TO STORE CONTENT … (1) <http://freme-project.eu/#char=0,29> (2) a nif:String , nif:Context , nif:RFC5147String ; (3) nif:beginIndex "0"^^xsd:int ; (4) nif:endIndex "29"^^xsd:int ; (5) nif:isString "Welcome to the city of Prague"^^xsd:string . 1) Identifying the content via a URI 2) Adding certain types from NIF* 3) Identifying the start offset of the content 4) Identifying the end offset of the content 5) Providing the string content itself. * For More on NIF: see a dedicated tutorial http://de.slideshare.net/m1ci/nif-tutorial WWW.FREME-PROJECT.EU 17 FREME Webinar for GALA – April 2016

… AND ENRICHMENT INFORMATION (1) <http://freme-project.eu /#char=23,29> … (2) nif:anchorOf "Prague"^^xsd:string ; (3) nif:beginIndex "23"^^xsd:int ; (4) nif:endIndex "29"^^xsd:int ; (5) nif:referenceContext <http://freme-project.eu/#char=0,29> ; (6) itsrdf:taClassRef <http://dbpedia.org/ontology/City>. 1) Identifying the annotation via a URI 2) Providing the string content of the annotation 3) Identifying the start offset of the content 4) Identifying the end offset of the content 5) Relating the content to annotations 6) Enrichment with ITS 2.0 class information (“Prague” = a city) WWW.FREME-PROJECT.EU 18 FREME Webinar for GALA – April 2016

SIMPLIFIED OUTPUT HELPS API DEVELOPERS TO CONSUME LINKED DATA • FREME provides user specified filter mechanism to simply the output • Supports CVS, XML or JSON • Example output as CSV http://dbpedia.org/resource/Prague,50.0878367932108,14.424132200 1241 For more infos on filtering, see http://api.freme-project.eu/doc/current/knowledge-base/filtering.html WWW.FREME-PROJECT.EU 19 FREME Webinar for GALA – April 2016

FORMAT COVERAGE • Processing of various content formats ◦ NIF, RDF, Text, HTML, OpenOffice, XLIFF 1.2, various XML formats, … • Many formats are processed via e-Internationalization services • Format specified in API call as input and (partially supported) output WWW.FREME-PROJECT.EU 20 FREME Webinar for GALA – April 2016

USING E-TERMINOLOGY WITH HTML OUTPUT <!DOCTYPE html> … <body> Welcome to the city of Prague. </body> … </html> Call of e-Terminology <!DOCTYPE html> … Welcome to the city of Prague. …</html> WWW.FREME-PROJECT.EU 21 FREME Webinar for GALA – April 2016

TRANSLATING XLIFF CONTENT WITH E-TRANSLATION ...<trans-unit> <source>This is car</source> </trans-unit> ... Call of e-Translation <http://freme-project.eu/#char=0,13> nif:isString "This is a car"@en itsrdf:target "Dies ist ein Auto"@de . WWW.FREME-PROJECT.EU 22 FREME Webinar for GALA – April 2016

IMPROVING E-TRANSLATION OUTPUT VIA E-TERMINOLOGY “The EU in brief. The EU is a unique economic and political partnership between 28 European countries that together cover much of the continent.” Call of e-Terminology: detection of translation suggestions continent, partnership, briefing, economics, covering Call of e-Translation: improved output! De voorschriften in DE EU. De EU is een uniek partnerschap tussen politiek en economie in de Europese landen, die gezamenlijk 28 verpakking van het continent. WWW.FREME-PROJECT.EU 23 FREME Webinar for GALA – April 2016

MOTIVATION • Aid translators ◦ Supplement typical linguistic support tools like glossary look-up with entity recognition and term disambiguation ◦ Possibility to introduce proprietary and domain-specific semantic datasets • Provide “Value - Add” to customers ◦ Make their content more interactive, compelling and discoverable ◦ Open up service offerings to new customers from existing and new channels WWW.FREME-PROJECT.EU 25 FREME Webinar for GALA – April 2016

A FRAMEWORK FOR MULTILINGUAL AND SEMANTIC ENRICHMENT OF DIGITAL - PowerPoint PPT Presentation

Co-funded by the Horizon 2020 Framework Programme of the European Union Grant Agreement Number 644771 A FRAMEWORK FOR MULTILINGUAL AND SEMANTIC ENRICHMENT OF DIGITAL CONTENT (NEW L10N BUSINESS OPPORTUNITIES) FREME WEBINAR HELD FOR GALA, 28

Hardware Observability Framework Hardware Observability Framework Hardware Observability

Play Framework One Web Framework to rule them all Felix Mller Agenda Yet another web

A Spatial Cloaking Framework A Spatial Cloaking Framework A Spatial Cloaking Framework A Spatial

A framework for linking land use and A framework for linking land use and A framework for linking

LEGAL FRAMEWORK FOR STRENGTHENING LEGAL FRAMEWORK FOR STRENGTHENING LEGAL FRAMEWORK FOR

Technical Framework Technical Framework Technical Framework Supporting eBusiness eBusiness

National Improvement Framework for Scottish Education The National Improvement Framework Aim

Implementation of global framework agreements in transnational Implementation of global framework

Northeast Conservation h Framework Framework What is it and why do we need it? National LCC

Spring 2013 1 A Framework for Junior Cycle The Framework Published by Minister Quinn on 4

ACADEMIC PERFORMANCE FRAMEWORK 1 AGENDA Why an Academic Performance Framework? What is

Framework & update Dr. Clara Bocchino SADC TFCA M&E Framework & Piloting Consultant

April 2009 Wes J. Lloyd 1 Framework Invasiveness Coupling between application code and

Unit T esting Framework for T cl Unit T esting Framework for T cl What is Unit T

A Linear Logical A Linear Logical A Linear Logical Framework Framework Framework Iliano

A Recareering Framework @fuzzing_panda A Recareering Framework 1. Framework a. Recon b.

Learning Representations of Relational Data Sebastijan Dumani DTAI, CS Department, KU Leuven

Ramsey regularity, MAD families, and their relatives David Schrittesser (KGRC) Joint work with

ARNOLD STABILITY of TIME-OSCILLATING FLOWS Legacy of Vladimir Arnold Fields Institute, November,

Logistic Regression, Gradient Descent, and Newton Method Matthieu R. Bloch 1 Maximum Likelihood

Random matrix ensembles for quantum spins and decoherence Franois David IPhT Saclay & CNRS

The Total Least Squares Problem with Multiple Right-Hand Sides A X B Martin Ple singer in

Promoting Education under Distortionary Taxation: A Comparison between Equality of Opportunity

Random Processes DS GA 1002 Probability and Statistics for Data Science

Sambuz

Useful Links

Newsletter

Mail Us

A FRAMEWORK FOR MULTILINGUAL AND SEMANTIC ENRICHMENT OF DIGITAL - PowerPoint PPT Presentation

Co-funded by the Horizon 2020 Framework Programme of the European Union Grant Agreement Number 644771 A FRAMEWORK FOR MULTILINGUAL AND SEMANTIC ENRICHMENT OF DIGITAL CONTENT (NEW L10N BUSINESS OPPORTUNITIES) FREME WEBINAR HELD FOR GALA, 28

Hardware Observability Framework Hardware Observability Framework Hardware Observability

Play Framework One Web Framework to rule them all Felix Mller Agenda Yet another web

A Spatial Cloaking Framework A Spatial Cloaking Framework A Spatial Cloaking Framework A Spatial

A framework for linking land use and A framework for linking land use and A framework for linking

LEGAL FRAMEWORK FOR STRENGTHENING LEGAL FRAMEWORK FOR STRENGTHENING LEGAL FRAMEWORK FOR

Technical Framework Technical Framework Technical Framework Supporting eBusiness eBusiness

National Improvement Framework for Scottish Education The National Improvement Framework Aim

Implementation of global framework agreements in transnational Implementation of global framework

Northeast Conservation h Framework Framework What is it and why do we need it? National LCC

Spring 2013 1 A Framework for Junior Cycle The Framework Published by Minister Quinn on 4

ACADEMIC PERFORMANCE FRAMEWORK 1 AGENDA Why an Academic Performance Framework? What is

Framework &amp; update Dr. Clara Bocchino SADC TFCA M&amp;E Framework &amp; Piloting Consultant

April 2009 Wes J. Lloyd 1 Framework Invasiveness Coupling between application code and

Unit T esting Framework for T cl Unit T esting Framework for T cl What is Unit T

A Linear Logical A Linear Logical A Linear Logical Framework Framework Framework Iliano

A Recareering Framework @fuzzing_panda A Recareering Framework 1. Framework a. Recon b.

Learning Representations of Relational Data Sebastijan Dumani DTAI, CS Department, KU Leuven

Ramsey regularity, MAD families, and their relatives David Schrittesser (KGRC) Joint work with

ARNOLD STABILITY of TIME-OSCILLATING FLOWS Legacy of Vladimir Arnold Fields Institute, November,

Logistic Regression, Gradient Descent, and Newton Method Matthieu R. Bloch 1 Maximum Likelihood

Random matrix ensembles for quantum spins and decoherence Franois David IPhT Saclay &amp; CNRS

The Total Least Squares Problem with Multiple Right-Hand Sides A X B Martin Ple singer in

Promoting Education under Distortionary Taxation: A Comparison between Equality of Opportunity

Random Processes DS GA 1002 Probability and Statistics for Data Science

Sambuz

Useful Links

Newsletter

Mail Us

Framework & update Dr. Clara Bocchino SADC TFCA M&E Framework & Piloting Consultant

Random matrix ensembles for quantum spins and decoherence Franois David IPhT Saclay & CNRS