jorge gr jo gracia ia jose labra jo labra
play

Jorge Gr Jo Gracia ia Jose Labra Jo Labra Ontology Engineering - PowerPoint PPT Presentation

Jorge Gr Jo Gracia ia Jose Labra Jo Labra Ontology Engineering Group (OEG) Web Semantics Oviedo (WESO) Universidad Politcnica de Madrid (UPM) University of Oviedo jgracia@fi.upm.es labra@uniovi.es Multilingual Web Workshop Madrid


  1. Jorge Gr Jo Gracia ia Jose Labra Jo Labra Ontology Engineering Group (OEG) Web Semantics Oviedo (WESO) Universidad Politécnica de Madrid (UPM) University of Oviedo jgracia@fi.upm.es labra@uniovi.es Multilingual Web Workshop Madrid (Spain) 7-8 May 2014

  2. } Motivation } The group } Main goals } Activities } Where are we now? Multilingual Web Workshop Madrid, May 2014 Multilingual Web Workshop Madrid, May 2014 2

  3. Moti tivati tion Multilingual Web Workshop Madrid, May 2014 3

  4. Monolingual Multilingual datasets datasets 349 635 676 1,906 2,201 1,984 January 2012 June 2012 December 2012 A. Gómez-Pérez, D. Vila-Suero, E. Montiel-Ponsoda, J. Gracia, and G. Aguado-de Cea, "Guidelines for multilingual linked data," in Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics, ser. WIMS '13. New York, NY, USA: ACM, Jun. 2013. Multilingual Web Workshop Madrid, May 2014 4

  5. RDF literals without RDF literals with language tag language tag 2,567,324 3,154,779 3,365,930 10,250,936 10,594,338 12,272,806 January 2012 June 2012 December 2012 A. Gómez-Pérez, D. Vila-Suero, E. Montiel-Ponsoda, J. Gracia, and G. Aguado-de Cea, "Guidelines for multilingual linked data," in Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics, ser. WIMS '13. New York, NY, USA: ACM, Jun. 2013. Multilingual Web Workshop Madrid, May 2014 5

  6. Multilingual Web Workshop Madrid, May 2014 6

  7. Vocabulary selection RDF generation Data Interlinking Web Publishing Multilingual Web Workshop Madrid, May 2014 Multilingual Web Workshop Madrid, May 2014 7

  8. http ttp://example.org/Spain http ttp://example.org/I23AX4 X45 http ttp://example.org/Es España Multilingual Web Workshop Madrid, May 2014 8

  9. Multilingual Web Workshop Madrid, May 2014 9

  10. Multilingual Web Workshop Madrid, May 2014

  11. Th The g e grou roup Multilingual Web Workshop Madrid, May 2014 11

  12. W3C community group on Best Practises for Multilingual Linked (Open) Data https://www.w3.org/community/bpmlod 
 Started on June 2013 bi-weekly telcos 3 chairs. Currently: José Labra Jorge Gracia John McCrae 67 members from academia and industry Multilingual Web Workshop Madrid, May 2014 Multilingual Web Workshop Madrid, May 2014 12

  13. and many others … Multilingual Web Workshop Madrid, May 2014 Multilingual Web Workshop Madrid, May 2014 13

  14. Main Main g goals oals Multilingual Web Workshop Madrid, May 2014 14

  15. Crowdsourcing ideas from the community regarding best practices to produce multilingual linked (open) data. Documenting patterns and best practices for the creation, linking, and use of multilingual linked data. Multilingual Web Workshop Madrid, May 2014 Multilingual Web Workshop Madrid, May 2014 15

  16. Linked Data for Language Ontology lexica (Ontolex) Technologies (LD4LT) BP for lemon BP for LD in LT using lemon Use Cases specification BPMLOD BP for Multlingual Data on the Web BP for Data on the Web Data on the Web Best Practices Multilingual Web Workshop Madrid, May 2014 Multilingual Web Workshop Madrid, May 2014 16

  17. Acti tiviti ties Multilingual Web Workshop Madrid, May 2014 17

  18. TOPIC classification USE CASES PATTERNS BEST PRACTISES & GUIDELINES Multilingual Web Workshop Madrid, May 2014 Multilingual Web Workshop Madrid, May 2014 18

  19. TOPIC classification NAMING Multilingual Web Workshop Madrid, May 2014 Multilingual Web Workshop Madrid, May 2014 19

  20. � Naming Opaque URIs, Descriptive URIs, IRIs, … � Textual information Language tags, linguistic information, … � Linking Interlanguage links, owl:sameAs, … � Ontologies and vocabularies Mono/multilingual vocabularies, ontology localisation… � Quality of MLOD � Tools and examples of MLOD � Other related aspects licensing, legal aspects, … https://www.w3.org/community/bpmlod/wiki/Topic_classification Multilingual Web Workshop Madrid, May 2014 Multilingual Web Workshop Madrid, May 2014 20

  21. TOPIC classification USE CASES Multilingual Web Workshop Madrid, May 2014 Multilingual Web Workshop Madrid, May 2014 21

  22. USE CASES 1. Localization workflow [D. Lewis] 2. Lexicalisation of RDF Datasets [E. Montiel, G. Dunshire] 3. Ontology localisation [E. Montiel, L. Aguado, G. Dunsire] 4. Crosslingual linked data matching [J. Gracia] 5. Machine translation [T. Heuss] 6. Application localization [J. McCrae] CASE STUDIES 1. Translations of multilingual terminologies for libraries [G. Dunsire] https://www.w3.org/community/bpmlod/wiki/Use_cases_definition Multilingual Web Workshop Madrid, May 2014 Multilingual Web Workshop Madrid, May 2014 22

  23. TOPIC classification USE CASES PATTERNS Multilingual Web Workshop Madrid, May 2014 Multilingual Web Workshop Madrid, May 2014 23

  24. Difficult to establish a boundary between Patterns vs Best Practices vs Bad smells By now: we identify the main practices Bad/Good may depend on the context/use case Examples: ◦ Patterns for naming and dereferencing Multilingual Web Workshop Madrid, May 2014 Multilingual Web Workshop Madrid, May 2014 24

  25. } Example: URI for Armenia? Descriptive URIs http://example.org/Armenia ¡ Human-readable May be unreadable for non-Latin alphabet users Opaque URIs http://example.org/I23AX45 ¡ Good tool support Difficult to be descriptive enough in some contexts %-encoding non-ASCII characters Independence between concept and language Non Human-readable http:// օրինակ . օրգ # Հայաստան ¡ Full IRIs http://example.org/Espa%3Fa Maintenance: changes in text don't affect URI Difficult to handle by developers ¡ Suitable for LD generation Readable (for one language) Security issues (spoofing) Internationalized paths only http://example.org# Հայաստան ¡ Unreadable for speakers of other languages Tool support Unreadable for speakers of other languages Less security issues http://hy.example.org# Հայաստան ¡ Language in host name Path readable (for one language) http://en.example.org#Armenia ¡ Where should we put the language tag? Practical reasons http://example.org/Armenia.en ¡ Dialects can become unwieldy Language in Path Independent development of http://example.org/en/Armenia ¡ Example: languages & sublanguages datasets by language http://example.org/Armenia?lang=en ¡ hy-­‑Latin-­‑IT-­‑arevela ¡ Compatible with content negotiation Dialects ¡ Multilingual Web Workshop Madrid, May 2014 Multilingual Web Workshop Madrid, May 2014 25

  26. } Which data should I return when accessing a URI? No language content negotiation http://example.org/Armenia ¡ Ignore Accept-language...all the data ¡<> ¡rdfs:label ¡"Armenia"@en, ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡" Հայաստան "@hy ¡. Easy to develop Clients have to filter triples in other languages Language content negotiation http://example.org/Armenia ¡ Consistency of data Bandwidth overhead Accept-­‑language:hy ¡ Accept-­‑language:en ¡ :<> ¡rdfs:label ¡" Հայաստան "@hy ¡. ¡ <> ¡rdfs:label ¡"Armenia"@en ¡. ¡ http://example.org/Armenia ¡ Language content redirection Less network overhead Difficult to implement Looses data Accept-­‑language:hy ¡ Accept-­‑language:en ¡ 303 ¡ 303 ¡ See ¡also: ¡http://example.org/Armenia.en ¡ See ¡also:http://example.org/Armenia.hy ¡ Keeps difference between concept More difficult to implement and language representation Not always feasible Multilingual Web Workshop Madrid, May 2014 Multilingual Web Workshop Madrid, May 2014 26

  27. TOPICS classification USE CASES PATTERNS BEST PRACTISES & GUIDELINES Multilingual Web Workshop Madrid, May 2014 Multilingual Web Workshop Madrid, May 2014 27

  28. Some (future) EXAMPLES. Guidelines for: Linguistic Linked Data generation RDF and Ontology translation Multilingual Linked Data generation, publication and exploitation ... Multilingual Web Workshop Madrid, May 2014 Multilingual Web Workshop Madrid, May 2014 28

  29. Wh Where are w ere are we n e now ow? Multilingual Web Workshop Madrid, May 2014 29

  30. TOPICS classification We are here (Patterns for textual information) USE CASES PATTERNS BEST PRACTISES & GUIDELINES Multilingual Web Workshop Madrid, May 2014 Multilingual Web Workshop Madrid, May 2014 30

  31. Thanks… and get involved! Next telco: Thursday 22nd May 10:00 CEST https://www.w3.org/community/bpmlod 
 Multilingual Web Workshop Madrid, May 2014 Multilingual Web Workshop Madrid, May 2014 31

Recommend


More recommend