Background Framework Implementation Demo Evaluation Conclusion Extraction of Author’s Definitions Using Indexed Reference Identification Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory 18 September 2009, RANLP 2009 Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion Outline 1 Background 2 Framework 3 Implementation 4 Demo 5 Evaluation 6 Conclusion Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion 1 Background 2 Framework 3 Implementation 4 Demo 5 Evaluation 6 Conclusion Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion Studies on definition in the LaLIC laboratory (E. Cartier, 2004; T. Hacene 2008; C. Teissedre 2008) Implementation: several tools for segmentation and semantic annotation: SegaTex: G. Mourad 2001, B. Djioua 2006; Excom annotation platform: B. Djioua and J.-P. Descles 2006, M. Alrahabi 2008. Work in the field of Bibliosemantics (M. Bertin 2006-2009): identification and annotation of relations between authors based on bibliographic links. Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion 1 Background 2 Framework 3 Implementation 4 Demo 5 Evaluation 6 Conclusion Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion Our aim is to establish links between authors by using indexed references in the text, and then identify the definitions and relate them to the authors. The method that we propose is based on the indexed references which allow us, in the case when we identify a definition in the research scope determined by the segmentation, to link this definition to the author cited in the text. Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion Two relations: 1 relation between the definiendum , what is to be defined, and the definiens , what defines it. 2 relation between the definition itself and the author. We can associate a definition to an author. In this case we can talk about signed definitions . The bibliographic links give us a starting context or scope for the research of definitions. Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion Linguistic Study of the Definition We have used the semantic map proposed by T. Hacene (2008). In the implementation we have used a part of this semantic map according to our purpose. The linguistic study of our corpus has led us to a better understanding of the distinction between a definition and a definatory characteristic , which has been taken in consideration for the construction of our linguistic resources. We define a definatory characteristic as a sentence that gives only some essential properties of the defined object. Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion Linguistic Study of the Definition We have distinguished three sub-categories of the definatory characteristics: 1 identification 2 determined categorization 3 pseudo-definition Two sub-categories of the definition: 1 general definitions 2 axiomatic definitions Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion (Taouise Hacene, 2008) Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion 1 Background 2 Framework 3 Implementation 4 Demo 5 Evaluation 6 Conclusion Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion Processing Overview Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion Segmentation Segmentation tools: SegaTex (G. Mourad, 2001; B. Djioua 2006), Excom-2 (M. Alrahabi, 2008) Segmentation into sentences, paragraphs, sections. Segmentation rules based on the punctuation and capitalisation. Different languages (French, English, Bulgarian, Arabic, ... ) Input: text files Output: DocBook format, UTF8 encoding Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion Segmentation Output Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion Processing Overview Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion Processing Overview Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion Indexed Reference Identification - 1 Norms: ISO-690, ISO 690-2, AFNOR NF Z 44-005, AFNOR NF Z 44-005-2 Examples: (Hoc, 1990a), (Thom, 1970), (Dingwall et al., 1995; Hartmann and G¨ orlich, 1995), [24], Pickett-Heaps et al. (1990), (like other authors e.g. Raven, 1983), (Cwuc and SPRAGUE 1989), (18, 53, 56) Finite state automata and identification of known names entities Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion Indexed Reference Identification - 2 Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion Annotation Automatic annotation through exploration of the context: The Contextual Exploration Method (Descl´ es, 1997, 2006) Based on linguistic resources, which are manually constructed Resources: surface linguistic markers (indicators and clues) and contextual exploration rules Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion Annotation Excom annotation system (B. Djioua, 2006; M. Alrahabi, 2008). Available online: www.excom.fr Input: segmented XML files Output: annotated XML files Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion Contextual Exploration Rule: Example Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion Annotated sentence: Example Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion What can we do with the annotations? Information retrieval of definitions. Identify the definitions of a given notion. Sometimes the same notion has several different definitions, esp. in humanitarian sciences. For a given keyword, identify the domains in which it is used. Find the definitions related to an author. Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion System Overview: Interface and Exploitation Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion 1 Background 2 Framework 3 Implementation 4 Demo 5 Evaluation 6 Conclusion Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Background Framework Implementation Demo Evaluation Conclusion 1 Background 2 Framework 3 Implementation 4 Demo 5 Evaluation 6 Conclusion Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009
Recommend
More recommend