Creating a Digital Microfilm Library Dan R. Olsen Jr Computer - PowerPoint PPT Presentation

Creating a Digital Microfilm Library Dan R. Olsen Jr Computer Science Dept Brigham Young University

The meaning of family history

Getting in touch with your ancestors

Family history out of the library and into the home

How big is the problem? • 2.5 million films and growing • 1000 images per film = 2.5 billion images • 600 KB per image = 1,500,000 Gigabytes 1,500,000 Gigabytes 25,000 laptop hard disks :-)

What will it cost to store it? • 1,500,000 GB • $30 per GB - real servers not PCs • Total library cost – Today - $45,000,000 – 5 years - $4,500,000 $450,000 – 10 years - $450,000

Producing the Image Library • Scanning rate - 100 frames per minute (optimistic) • Images to scan - 2.5 Billion • Scanner time per year - 2000 hours • To complete the library –208 scanner years 208 scanner years

Cost of production • 20 scanners = $1,000,000 –10 years to finish – replacement costs $1,000,000 • Worker costs = 10x$100,000 = $1,000,000 • 10 year plan – GB per year - 150,000 $3,000,000 – Cost - $3,000,000

Creating the Digital Microfilm Library • 10 years • $8,000,000

Delivering images to the home • Image size = 500K • Dialup data rate = 5K Bytes / sec – (on a good day) • Time per image = 1.6 minutes • You cannot scan digital images the way you scan through microfilm • The library must be indexed at the image level

Extracting data from images • Extraction 12/hour – Assumes one record per image • hours to extract the entire library – 208 million hours – cost at $5/hour = $1 billion • 20,000 extractors - 100 hours per year – 104 years to complete • 2 million extractors to complete in 10 years

To build the library in 10 years • $5,000,000 to store • $3,000,000 to scan • must be indexed • $1,000,000,000 to extract using current approach • Need a new indexing plan

Extract for index • Ordered collections • by name • by date – Parish records – Death records – Main archive group sheets • Unordered collections – Wills – Deeds – Other records where image order is not helpful

Ordered collections • Extract top date or name from each image – 100/hour – 12 years to complete - 20,000 volunteers • Sample extract every 10 images then interpolate – 1000/hour – 1.2 years to complete - 20,000 volunteers

Unordered collections • Extract only essential name, date and place info • Let the image carry most of the data – Eliminate interpretation errors in extraction • Extractors map extracted data to image fragments • Auto extraction methods - OCR and Handwriting – Use extraction as a training set for new algorithms

What library should we build?

Lessons from the past • Microfilm library – Make the raw data available in a uniform way • WWW/GEDCOM – Make the library open – Base collection • Tools on top but not in place • Support both people and software • Guaranteed archiving –Digital Stone Digital Stone • Guaranteed naming

The vision • Out of the library into the home • Scan it all in 10 years – $8,000,000 • Index not extract – High level index (beat microfilm) – Deeper indices on important collections • Open library architecture – Raw data and raw indexes publicly available • Library of last resort - Digital Stone – Guaranteed archive, guaranteed naming

Family history out of the library and into the home

Creating a Digital Microfilm Library Dan R. Olsen Jr Computer - PowerPoint PPT Presentation

Creating a Digital Microfilm Library Dan R. Olsen Jr Computer Science Dept Brigham Young University The meaning of family history Getting in touch with your ancestors Family history out of the library and into the home How big is the

Digital Microfilm Frame Detection Christopher Nelson Heath Nielson & Shane Hathaway The

FamilySearch Scanning (Scanstone) An Automated Exposure Method For Scanning Microfilm Heath

Results of Just-In-Time Browsing for Digital Microfilm Douglas J. Kennard and William A.

Library Department FY 2021 Library Department FY 2021 Library Organization Chart Springfield

Presentation 7.3b: Multiple linear regression Murray Logan 09 Aug 2016 library (GGally) library

Details of the Dense Gas Containment microfilm membranes supported on tungsten wires/mesh with

AAPoly Library Orientation Library Contacts Phone : 61 3 8610 4132 Email : library@aapoly.edu.au

Digital Library Ken Hermens Digital Library: Part of a Balanced Assessment System What is the

The Homeschooling - Library Connection Diane Pamel- Library Director Southworth Library and

Eric Lashley Library Director, Georgetown Public Library (TX) Patrick Lloyd, LMSW Community

Module 4: Creating Data Types and Tables Overview Creating Data Types Creating Tables

King Fahd University of Petroleum & Minerals Deanship of Library Affairs KFUPM Library

2. Digital Data CHAPTER HIGHLIGHTS Elements of digital media. Digital codes. Di it l d

Library RMR Project Renovate, Modernize, Reorganize Library Serves Patrons of Every Age This

PopUp Library @ Senior Center Whats a PopUp Library? Library services somewhere that is

What do you do with the temporarily placed programs? The problem is more widespread than just

WELCOME to ACT Give Back College PREP NIGHT! AGENDA: ACT PREP OPTIONS HOW TO REGISTER FOR

Why Admissions Tests Matter PSAT SCORE INTERPRETATION 1 12/22/16 Test Taking Timeline PSAT

Q3 2014 Earnings Webcast Presentation October 23, 2014 Safe Harbor Statement Note: All

Company Presentation September 2018 | | Legal Disclaimer This presentation contains

Caring for an Aging Parent Be Prepared Caring For An Aging Parent Caring for an Aging Parent

Mentoring Through Qualitative Discussion Training for Child W elfare Supervisors FL Departm ent

Protecting GRUs Customers With Fuel Diversity, Renewable Energy, And Power Purchase Contract

I-65/I 65/I-70 N 70 Nor orth th Split Split Pr Project oject Public Open House October 10,

Sambuz

Useful Links

Newsletter

Mail Us

Creating a Digital Microfilm Library Dan R. Olsen Jr Computer - PowerPoint PPT Presentation

Creating a Digital Microfilm Library Dan R. Olsen Jr Computer Science Dept Brigham Young University The meaning of family history Getting in touch with your ancestors Family history out of the library and into the home How big is the

Digital Microfilm Frame Detection Christopher Nelson Heath Nielson &amp; Shane Hathaway The

FamilySearch Scanning (Scanstone) An Automated Exposure Method For Scanning Microfilm Heath

Results of Just-In-Time Browsing for Digital Microfilm Douglas J. Kennard and William A.

Library Department FY 2021 Library Department FY 2021 Library Organization Chart Springfield

Presentation 7.3b: Multiple linear regression Murray Logan 09 Aug 2016 library (GGally) library

Details of the Dense Gas Containment microfilm membranes supported on tungsten wires/mesh with

AAPoly Library Orientation Library Contacts Phone : 61 3 8610 4132 Email : library@aapoly.edu.au

Digital Library Ken Hermens Digital Library: Part of a Balanced Assessment System What is the

The Homeschooling - Library Connection Diane Pamel- Library Director Southworth Library and

Eric Lashley Library Director, Georgetown Public Library (TX) Patrick Lloyd, LMSW Community

Module 4: Creating Data Types and Tables Overview Creating Data Types Creating Tables

King Fahd University of Petroleum &amp; Minerals Deanship of Library Affairs KFUPM Library

2. Digital Data CHAPTER HIGHLIGHTS Elements of digital media. Digital codes. Di it l d

Library RMR Project Renovate, Modernize, Reorganize Library Serves Patrons of Every Age This

PopUp Library @ Senior Center Whats a PopUp Library? Library services somewhere that is

What do you do with the temporarily placed programs? The problem is more widespread than just

WELCOME to ACT Give Back College PREP NIGHT! AGENDA: ACT PREP OPTIONS HOW TO REGISTER FOR

Why Admissions Tests Matter PSAT SCORE INTERPRETATION 1 12/22/16 Test Taking Timeline PSAT

Q3 2014 Earnings Webcast Presentation October 23, 2014 Safe Harbor Statement Note: All

Company Presentation September 2018 | | Legal Disclaimer This presentation contains

Caring for an Aging Parent Be Prepared Caring For An Aging Parent Caring for an Aging Parent

Mentoring Through Qualitative Discussion Training for Child W elfare Supervisors FL Departm ent

Protecting GRUs Customers With Fuel Diversity, Renewable Energy, And Power Purchase Contract

I-65/I 65/I-70 N 70 Nor orth th Split Split Pr Project oject Public Open House October 10,

Sambuz

Useful Links

Newsletter

Mail Us

Digital Microfilm Frame Detection Christopher Nelson Heath Nielson & Shane Hathaway The

King Fahd University of Petroleum & Minerals Deanship of Library Affairs KFUPM Library