Darmstadt Knowledge Processing Repository Based on UIMA Iryna - PowerPoint PPT Presentation

Darmstadt Knowledge Processing Repository Based on UIMA Iryna Gurevych, Max Mühlhäuser, Christof Müller, Jürgen Steimle, Markus Weimer, Torsten Zesch Ubiquitous Knowledge Processing Group Telecooperation, Computer Science Department Darmstadt University of Technology

Telecooperation

THESEUS Darmstadt Knowledge Processing Software Ubiquitous Knowledge Processing Repository AQUA SIR

A utomatic Qu ality A ssessment and Feedback in eLearning 2.0 (AQUA) AQUA 5

User Generated Discourse in Web 2.0

AQUA – Anoto pen

AQUA - System Architecture Natural Language Processing Machine Learning

AQUA – System Architecture

SIR (in cooperation with Prof. Hinrichs) • Semantic Information Retrieval Natural language low level expression of communication information need interface Bridge the human – computer gap Semantic search (SIR) based on semantic relatedness Natural language low level expression of communication information need interface

Information Retrieval (IR) Boolean, Vector Space, ... Document ... Keywords Document 2 � Document 1 Document ... Document ... Document ... Document 3

SIR-Project baker, to program, Semantic Relatedness quality assurance Profession ... Essay Profession 2 Profession 1 cake, computer, Profession ... to read, ... Profession ... Profession ... Semantic search (SIR) based on semantic relatedness Profession 3 Natural language low level expression of communication information need interface

SIR Example find good index terms Compound Splitting Negation Detection WSD compute semantic relatedness

THESEUS - TEXO • Large-scale BMBF-Project, industry (SAP, Siemens, etc.) • Service Marketplaces in Web 2.0 � Find services, both users and machines • Problem: � Only keyword-based search � Lack of ontologies for semantic search • Solution: � Use natural language descriptions of web services � Apply Semantic Information Retrieval � Community Mining for optimized service selection � Darmstadt Knowledge Processing Repository

UIMA components SIR AQUA THESEUS Wikipedia reader, Forum reader , Plain text reader Data import Tokenizer, Sentence splitter, Stopword tagger Linguistic preprocessing Stemmer, Lemmatizer, Compound Splitter Morphological analysis PoS-Tagger, Parser Syntactic analysis NE tagger, Sentiment detector, WSD component Semantic analysis Swear word tagger (AQUA), Negation detection (SIR) Project specific analysis Indexer (Lucene, Terrier), ARFF export Data export

Advantages of UIMA • Components can be shared between projects • Shared model of thinking � “Reader + Annotators + Consumer” � Configuration of components • Descriptive component orchestration

Challenges • Agree on a type system � No automatic type mapping • Some rough edges in UIMA � No real plug’n’work with PEAR packages � Using constraints to align annotations seems to be slow

Wish list • Automatic type matching • Better tool support � Improving Eclipse plug-ins (robustness, features) � Refactoring of UIMA components � CPE runner ++ (automatic logging, performance monitor, etc.) • Plug’n’work approach • “Import by name” in CPEs � Or make ${CPM_HOME}/path also work for readers/consumers • Construct XML descriptors from Java annotations • More intuitive API

Thank you very much! Thank you very much! • Acknowledgements: � DFG for funding “Semantic Information Retrieval” � DFG for funding “Automatic Quality Assessment and Feedback in eLearning 2.0” http://www.ukp.tu-darmstadt.de/

Darmstadt Knowledge Processing Repository Based on UIMA Iryna - PowerPoint PPT Presentation

Darmstadt Knowledge Processing Repository Based on UIMA Iryna Gurevych, Max Mhlhuser, Christof Mller, Jrgen Steimle, Markus Weimer, Torsten Zesch Ubiquitous Knowledge Processing Group Telecooperation, Computer Science Department

Development of IBM Watson with UIMA DUCC Eddie Epstein eae@apache.org Apache UIMA PMC Member

GATE and UIMA in Language Technology Teaching Graham Wilcock University of Helsinki

Iterative Learning of Relation Patterns for Market Analysis with UIMA Sebastian Blohm , Jrgen

Advanced GATE Embedded Additional material: UIMA/GATE integration Fifth GATE Training Course

Processing Dialogue-Based Data in the UIMA Framework Milan Gnjatovi , Manuela Kunze, Dietmar

An UIMA-based Tool Suite for Semantic Text Processing Katrin Tomanek, Ekaterina Buyko, Udo Hahn

Knowledge-Based Agents knowledge knowledge representation, knowledge base, types of knowledge

Three-nucleon forces and exotic nuclei Javier Menndez Institut fr Kernphysik (TU Darmstadt)

UIMA: Unstructured Information Management Architecture Alessandro Moschitti Department of

UIMA-based Annotation Type System for a Text Mining Architecture Udo Hahn, Ekaterina Buyko ,

Advanced GATE Embedded Track II, Module 8 Second GATE Training Course May 2010 Advanced GATE

Plan for today Knowledge-based systems 1 Explicit knowledge Knowledge Representation Inferred

Plan for today Knowledge-based systems 1 Tacit knowledge Knowledge Representation Inferred

Limited Use Repository Updates Citizens Coordination Council April 18, 2018 Craig Cameron U.S.

Repository (IDR) Dr. Chris Harle Becky Liao Integrated Data Repository (IDR) Mar. 3, 2020

Status of the Repository at Status of the Repository at Yucca Mountain Presented to: DOE-EM

Information Retrieval Evaluation (COSC 488) Nazli Goharian nazli@cs.georgetown.edu @ Goharian,

Utilizing Knowledge Bases for Text Retrieval: A Wishlist for Text Retrieval: A Wishlist

Digital preservation at Wellcome Alex Chan ~ a.chan@wellcome.ac.uk ~ they/them Senior

XML Out-Of-Band Data Retrieval Timur Yunusov Alexey

Natural Language Processing with Deep Learning Neural Information Retrieval Navid Rekab-Saz

Heterogenous Private Information Retrieval Hamid Mozaffari, Amir Houmansadr University of

How to Read Paintings: Semantic Art Understanding with Multi-Modal Retrieval Noa Garcia &

Introduction to Information Retrieval http://informationretrieval.org IIR 1: Boolean Retrieval

Sambuz

Useful Links

Newsletter

Mail Us

Darmstadt Knowledge Processing Repository Based on UIMA Iryna - PowerPoint PPT Presentation

Darmstadt Knowledge Processing Repository Based on UIMA Iryna Gurevych, Max Mhlhuser, Christof Mller, Jrgen Steimle, Markus Weimer, Torsten Zesch Ubiquitous Knowledge Processing Group Telecooperation, Computer Science Department

Development of IBM Watson with UIMA DUCC Eddie Epstein eae@apache.org Apache UIMA PMC Member

GATE and UIMA in Language Technology Teaching Graham Wilcock University of Helsinki

Iterative Learning of Relation Patterns for Market Analysis with UIMA Sebastian Blohm , Jrgen

Advanced GATE Embedded Additional material: UIMA/GATE integration Fifth GATE Training Course

Processing Dialogue-Based Data in the UIMA Framework Milan Gnjatovi , Manuela Kunze, Dietmar

An UIMA-based Tool Suite for Semantic Text Processing Katrin Tomanek, Ekaterina Buyko, Udo Hahn

Knowledge-Based Agents knowledge knowledge representation, knowledge base, types of knowledge

Three-nucleon forces and exotic nuclei Javier Menndez Institut fr Kernphysik (TU Darmstadt)

UIMA: Unstructured Information Management Architecture Alessandro Moschitti Department of

UIMA-based Annotation Type System for a Text Mining Architecture Udo Hahn, Ekaterina Buyko ,

Advanced GATE Embedded Track II, Module 8 Second GATE Training Course May 2010 Advanced GATE

Plan for today Knowledge-based systems 1 Explicit knowledge Knowledge Representation Inferred

Plan for today Knowledge-based systems 1 Tacit knowledge Knowledge Representation Inferred

Limited Use Repository Updates Citizens Coordination Council April 18, 2018 Craig Cameron U.S.

Repository (IDR) Dr. Chris Harle Becky Liao Integrated Data Repository (IDR) Mar. 3, 2020

Status of the Repository at Status of the Repository at Yucca Mountain Presented to: DOE-EM

Information Retrieval Evaluation (COSC 488) Nazli Goharian nazli@cs.georgetown.edu @ Goharian,

Utilizing Knowledge Bases for Text Retrieval: A Wishlist for Text Retrieval: A Wishlist

Digital preservation at Wellcome Alex Chan ~ a.chan@wellcome.ac.uk ~ they/them Senior

XML Out-Of-Band Data Retrieval Timur Yunusov Alexey

Natural Language Processing with Deep Learning Neural Information Retrieval Navid Rekab-Saz

Heterogenous Private Information Retrieval Hamid Mozaffari, Amir Houmansadr University of

How to Read Paintings: Semantic Art Understanding with Multi-Modal Retrieval Noa Garcia &amp;

Introduction to Information Retrieval http://informationretrieval.org IIR 1: Boolean Retrieval

Sambuz

Useful Links

Newsletter

Mail Us

How to Read Paintings: Semantic Art Understanding with Multi-Modal Retrieval Noa Garcia &