Searching Sanskrit Texts in SARIT Patrick McAllister June 6, 2017 This work is licensed under the Creative Commons Attribution-ShareAlike 4.0 International License . To view a copy of this license, visit http://creativecommons.org/licenses/by-sa/4.0/ or send a letter to Creative Commons, PO Box 1866, Mountain View, CA 94042, USA.
Searching Sanskrit . . . . . . . . . . . . . . . . . . . . Searching Sanskrit Texts in SARIT SARIT’s search facilities Patrick McAllister Institute for the Cultural and Intellectual History of Asia (IKGA) . . Texts in SARIT . Patrick McAllister Introduction SARIT’s full text search Indexing SARIT’s texts Conclusions . . . . . . . . . . . . . . . . . 2017-05-23
Searching Sanskrit . . . . . . . . . . . . . . . . . . . . Outline Introduction SARIT’s full text search Indexing SARIT’s texts . . Texts in SARIT . Patrick McAllister Introduction SARIT’s full text search Indexing SARIT’s texts Conclusions . . . . . . . . . . . . . . . . . Conclusions
Searching Sanskrit . . . . . . . . . . . . Texts in SARIT . . . . . . . . The TEI Guidelines and SARIT humanities Not a technology to create/edit/display TEI . . . . Patrick McAllister Introduction SARIT’s full text search Indexing SARIT’s texts Conclusions . . . . . . . . . . . . . . . . documents ▶ Toolset for the analysis of many texts common in
Searching Sanskrit . . . . . . . . . . . . Texts in SARIT . . . . . . . . The TEI Guidelines and SARIT humanities . . . . Patrick McAllister Introduction SARIT’s full text search Indexing SARIT’s texts Conclusions . . . . . . . . . . . . . . . . documents ▶ Toolset for the analysis of many texts common in ▶ Not a technology to create/edit/display TEI
Searching Sanskrit . . . . . . . . . . . . . Texts in SARIT . . . . . . What might make or break SARIT? Various editorial systems possible, e.g.: Series-, area-, text-editor Open source software development models 2. basic toolset for dealing with TEI encoded texts . . . . Patrick McAllister Introduction SARIT’s full text search Indexing SARIT’s texts Conclusions . . . . . . . . . . . . . . . . . Toolset [!=] Finished application 1. clear and simple way to add texts
Searching Sanskrit . . . . . . . . . . . . Texts in SARIT . . . . . . . What might make or break SARIT? Series-, area-, text-editor Open source software development models 2. basic toolset for dealing with TEI encoded texts . . . . Patrick McAllister Introduction SARIT’s full text search Indexing SARIT’s texts Conclusions . . . . . . . . . . . . . . . . . Toolset [!=] Finished application 1. clear and simple way to add texts ▶ Various editorial systems possible, e.g.:
Searching Sanskrit . . . . . . . . . . . . . . . . . . . . . What might make or break SARIT? Open source software development models 2. basic toolset for dealing with TEI encoded texts Texts in SARIT . . . Patrick McAllister Introduction SARIT’s full text search Indexing SARIT’s texts Conclusions . . . . . . . . . . . . . . . . Toolset [!=] Finished application 1. clear and simple way to add texts ▶ Various editorial systems possible, e.g.: ▶ Series-, area-, text-editor
Searching Sanskrit . . . . . . . . . . . . . . . . . . . . . What might make or break SARIT? 2. basic toolset for dealing with TEI encoded texts Texts in SARIT . . . Patrick McAllister Introduction SARIT’s full text search Indexing SARIT’s texts Conclusions . . . . . . . . . . . . . . . . Toolset [!=] Finished application 1. clear and simple way to add texts ▶ Various editorial systems possible, e.g.: ▶ Series-, area-, text-editor ▶ Open source software development models
Searching Sanskrit . . . . . . . . . . . . . . . . . . . . . What might make or break SARIT? 2. basic toolset for dealing with TEI encoded texts Texts in SARIT . . . Patrick McAllister Introduction SARIT’s full text search Indexing SARIT’s texts Conclusions . . . . . . . . . . . . . . . . Toolset [!=] Finished application 1. clear and simple way to add texts ▶ Various editorial systems possible, e.g.: ▶ Series-, area-, text-editor ▶ Open source software development models
Searching Sanskrit . . . . . . . . . . . . . . . . . . . . . What might make or break SARIT? 2. basic toolset for dealing with TEI encoded texts Texts in SARIT . . . Patrick McAllister Introduction SARIT’s full text search Indexing SARIT’s texts Conclusions . . . . . . . . . . . . . . . . 1. clear and simple way to add texts ▶ Various editorial systems possible, e.g.: ▶ Series-, area-, text-editor ▶ Open source software development models ▶ Toolset [!=] Finished application
Searching Sanskrit . . . . . . . . . . . . . . . . . Tools Burghart 2016: Introduction: The various mechanisms ofgered by the TEI schema and Guidelines for the encoding of crit- ical editions sufger from one major shortcoming: the lack of user-friendly tools allowing philolo- gists and their readers to display and process TEI-encoded editions. After witnessing–and per- sonally experiencing–this frustration, I decided to develop an application especially dedicated to supporting philologists in their work, and helping . . Texts in SARIT . Patrick McAllister Introduction SARIT’s full text search Indexing SARIT’s texts Conclusions . . . . . . . . . . . . . . . . . . . . them to fully benefjt from their encoding work.
Searching Sanskrit . . Texts in SARIT . . . . . . . . . . . . . . . . . . . Grepping? . . . . Patrick McAllister Introduction SARIT’s full text search Indexing SARIT’s texts Conclusions . . . . . . . . . . . . . . . (when (string-match ”limited.*utility” ”limited inutility”) ”A match!”) A match! (when (not (string-match ”limited.*utility” ”the utility is limited for searching XML documents”)) ֒ → ”No match!”) No match!
Searching Sanskrit . . . . . . . . . . . . . . . . . . . . . . General indexing? . . Texts in SARIT . Patrick McAllister Introduction SARIT’s full text search Indexing SARIT’s texts Conclusions . . . . . . . . . . . . . . . Figure: Recoll search for “( liṅga OR hetu ) AND *numān*”
( https://github.com/sarit/sarit-data ) ( https://github.com/sarit/sarit-pm ), which is what 2. A dedicated XML database ( http://exist-db.org/ ) Searching Sanskrit . . . . . . . . . . . . . Texts in SARIT . . . . . SARIT’s framework 3. Two applications that ‘speak’ to the database: 3.1 Loader/manager of SARIT etext library 3.2 Interface to the texts . . . . Patrick McAllister Introduction SARIT’s full text search Indexing SARIT’s texts Conclusions . . . . . . . . . . . . . . . . . . currently allows you to read and search the texts. 1. The SARIT texts ( https://github.com/sarit/sarit-data )
( https://github.com/sarit/sarit-data ) ( https://github.com/sarit/sarit-pm ), which is what Searching Sanskrit . . . . . . . . . . . . . Texts in SARIT . . . . . SARIT’s framework 3. Two applications that ‘speak’ to the database: 3.1 Loader/manager of SARIT etext library 3.2 Interface to the texts . . . . Patrick McAllister Introduction SARIT’s full text search Indexing SARIT’s texts Conclusions . . . . . . . . . . . . currently allows you to read and search the texts. . . . . . . 1. The SARIT texts ( https://github.com/sarit/sarit-data ) 2. A dedicated XML database ( http://exist-db.org/ )
( https://github.com/sarit/sarit-data ) ( https://github.com/sarit/sarit-pm ), which is what Searching Sanskrit . . . . . . . . . . . . . Texts in SARIT . . . . . SARIT’s framework 3. Two applications that ‘speak’ to the database: 3.1 Loader/manager of SARIT etext library 3.2 Interface to the texts . . . . Patrick McAllister Introduction SARIT’s full text search Indexing SARIT’s texts Conclusions . . . . . . . . . . . . currently allows you to read and search the texts. . . . . . . 1. The SARIT texts ( https://github.com/sarit/sarit-data ) 2. A dedicated XML database ( http://exist-db.org/ )
( https://github.com/sarit/sarit-pm ), which is what Searching Sanskrit . . . . . . . . . . . . . . . . . . . . SARIT’s framework 3. Two applications that ‘speak’ to the database: 3.1 Loader/manager of SARIT etext library 3.2 Interface to the texts Texts in SARIT . . . Patrick McAllister Introduction SARIT’s full text search Indexing SARIT’s texts Conclusions . . . . . . . . . . . currently allows you to read and search the texts. . . . . . . 1. The SARIT texts ( https://github.com/sarit/sarit-data ) 2. A dedicated XML database ( http://exist-db.org/ ) ( https://github.com/sarit/sarit-data )
Recommend
More recommend