1. Organization 2. Conservation 3. Dissemination 4. Data Modelling 5. Implementation The TEI Workflow: from design to dissemination Lou Burnard Consulting 1/52
1. Organization 3 The TEI has most to contribute to aspects 4 and 5 . . implementation : in which we actually build something 5 . . essence of our materials data modelling : in which we argue about the structure and 4 . . available to people dissemination : in which we consider how to make our work . 2. Conservation . work is not lost when the money runs out conservation : in which we think about how we will ensure our 2 . trying to do, and recruit someone to do it organisation : in which we get the funding, decide what we're 1 . . Stages in managing a digital project 5. Implementation 4. Data Modelling 3. Dissemination 2/52
1. Organization 2. Conservation 3. Dissemination 4. Data Modelling 5. Implementation An (imaginary) case study The Virgolos Archive holds one of the largest and most varied imaginary collections of historic picture postcards in the world. Its founder, recently-deceased eccentric Belgian millionaire Marcel Virgolos, left substantial funding in his will for the digitization and dissemination of the archive. Proposals are now invited ... 3/52
1. Organization How will each stage be validated? http://www.tei-c.org/AccessTEI/ TEI Consortium members may get special rates; see Eg (in UK) http://www.jiscdigitalmedia.ac.uk/digitisation Get advice from professionals : not all digitization is the same Digitization in house or by a vendor? You will need to select suppliers... A GANTT chart might help What will you do if targets are not achieved? the project plan? 2. Conservation What dependencies are there between the different stages of What deliverables are expected at each stage of the project? What will be done when and by whom? Does your institution provide a research support officer? You will need to draw up a persuasive project plan... 1. Organization 5. Implementation 4. Data Modelling 3. Dissemination 4/52
1. Organization 2. Conservation 3. Dissemination 4. Data Modelling 5. Implementation Organizing Virgolos : shopping list Make a representative sample from the archive Get quotes from prospective digitization suppliers Experiment with transcribers Calculate digitization/transcription workflow Test workflow Revise workflow till it works... Award contracts 5/52
1. Organization Think also about the format of your data: can it be migrated start with your institutional librarian! there ready to help you: e.g. the Digital Preservation Coalition... but Again there are specialist national and international agencies out the interface you provide to it Take steps to preserve your data : it is far more valuable than without effort? going to last forever? 2. Conservation This applies (obviously) to their physical storage : is the Cloud Digital media are fragile and must be properly maintained This may seem obvious, but... 2. Conservation 5. Implementation 4. Data Modelling 3. Dissemination 6/52
1. Organization 2. Conservation 3. Dissemination 4. Data Modelling 5. Implementation Conserving Virgolos Our digital data is all in standard formats (TIFF, PNG, TEI-XML...) Our TEI-XML is formally documented in an ODD We control dissemination via our own server We are negotiating a deposit arrangement with a national or institutional library 7/52
1. Organization Analyse its data content Visualise patterns in distribution of data or content 6 . . sender, recipient, location, date, time... topics represented visually, or discussed Search by 5 . . Analyse its linguistic content 4 . 3 2. Conservation . Extract (parts of) it as an e-book 2 . . Browse it on the web 1 . . What will people want to do with our digital archive? 3. Dissemination 5. Implementation 4. Data Modelling 3. Dissemination 8/52
1. Organization 2. Conservation 3. Dissemination 4. Data Modelling 5. Implementation And what kind of people will they be? are there any legal and IPR issues to resolve? is this a project for the general public or for specialists only? what can we do to address concerns about linguistic or cultural sensitivities? how about accessibility issues? 9/52
1. Organization We will make our digital versions available under a Creative Some good role models: Gallica, Old Bailey resources with theirs And we will welcome other agencies trying to integrate our attractive and easy to use website academic publications in specialist journals social media for promotional activity We will set up an outreach programme Commons licence the general public 2. Conservation bibliographers and librarians historical linguists geographers and social historians cultural heritage and tourism Our audience is potentially large and divers: The Virgolos Digital Archive : a public resource 5. Implementation 4. Data Modelling 3. Dissemination 10/52
1. Organization Transform Turn the TEI XML into (eg) HTML5 for an eBook And how about making the TEI XML source available too? . . text Expose the data Provide machine-readable entry points into the Index Provide human-readable entry points into the text Display Render TEI XML in a web browser 2. Conservation selected items Print Produce high quality printed readable version of Access methods 5. Implementation 4. Data Modelling 3. Dissemination 11/52
1. Organization task of data modelling modelling Information (concept) modelling is a necessary preliminary of data scientists! In other words, the job should not be left to the information domain-specific knowledge Their application should always be informed by Several formal methods have been developed to assist in the 2. Conservation [Sowa, 1984] lexicographers, systems analysts and database administrators.' `Conceptual analysis is the work of philosophers, lawyers, 4. Data Modelling 5. Implementation 4. Data Modelling 3. Dissemination 12/52
1. Organization 2. Conservation 3. Dissemination 4. Data Modelling 5. Implementation A traditional data analysis Seeks to identify .. the "objects of interest" their attributes or properties relationships amongst those properties processes and anticipating processing of those objects 13/52
1. Organization 2. Conservation 3. Dissemination 4. Data Modelling 5. Implementation For example... 14/52
1. Organization 2. Conservation 3. Dissemination 4. Data Modelling 5. Implementation Is the TEI data model suitable for our postcards? TEI out of the box is designed to work with traditionally organised books and manuscripts. But suppose we want to work on a slightly different kind of object... a postcard collection, or a monumental inscription? How do we make a TEI schema to handle hundreds or thousands of things like this: 15/52
1. Organization 2. Conservation 3. Dissemination 4. Data Modelling 5. Implementation A postcard (front) 16/52
1. Organization 2. Conservation 3. Dissemination 4. Data Modelling 5. Implementation A postcard (back) 17/52
1. Organization 2. Conservation 3. Dissemination 4. Data Modelling 5. Implementation Another postcard Not all cards are organized the same way... 18/52
1. Organization 2. Conservation 3. Dissemination 4. Data Modelling 5. Implementation Which are the most significant components of these texts? the picture the postmark the printed part the message(s) written on them the addressee(s) subject matter of the picture information about the publishing, printing, circulation of the card or other metadata... 19/52
1. Organization 2. Conservation 3. Dissemination 4. Data Modelling 5. Implementation Which are the most significant components of these texts? the picture the postmark the printed part the message(s) written on them the addressee(s) subject matter of the picture information about the publishing, printing, circulation of the card or other metadata... 19/52
1. Organization 2. Conservation 3. Dissemination 4. Data Modelling 5. Implementation Which are the most significant components of these texts? the picture the postmark the printed part the message(s) written on them the addressee(s) subject matter of the picture information about the publishing, printing, circulation of the card or other metadata... 19/52
1. Organization 2. Conservation 3. Dissemination 4. Data Modelling 5. Implementation Which are the most significant components of these texts? the picture the postmark the printed part the message(s) written on them the addressee(s) subject matter of the picture information about the publishing, printing, circulation of the card or other metadata... 19/52
1. Organization 2. Conservation 3. Dissemination 4. Data Modelling 5. Implementation Which are the most significant components of these texts? the picture the postmark the printed part the message(s) written on them the addressee(s) subject matter of the picture information about the publishing, printing, circulation of the card or other metadata... 19/52
1. Organization 2. Conservation 3. Dissemination 4. Data Modelling 5. Implementation Which are the most significant components of these texts? the picture the postmark the printed part the message(s) written on them the addressee(s) subject matter of the picture information about the publishing, printing, circulation of the card or other metadata... 19/52
Recommend
More recommend