Using a Hidden-Markov Model in Semi-Automatic Indexing of - PowerPoint PPT Presentation

Dec 07, 2023 •360 likes •534 views

Using a Hidden-Markov Model in Semi-Automatic Indexing of Historical Handwritten Records Thomas Packer, Oliver Nina, Ilya Raykhel Computer Science Brigham Young University The Challenge: Indexing Handwriting Millions of historical

Using a Hidden-Markov Model in Semi-Automatic Indexing of Historical Handwritten Records Thomas Packer, Oliver Nina, Ilya Raykhel Computer Science Brigham Young University
The Challenge: Indexing Handwriting • Millions of historical documents. • Many hours of manual indexing. • Years to complete using hundreds of thousands of volunteers. • Previous transcriptions not fully leveraged.
Family Search Indexing Tool
A Solution: On-Line Machine Learning • Holistic handwritten word recognition using a Hidden Markov Model (HMM), based on Lavrenko et al. (2004). • HMM selects words to maximize joint probability: • Word-feature probability model • Word-transition probability model • Word-feature model predicts a word from its visual features. • Word-transition model predicts a word from its neighboring word.
The Process Census Images Transcriptions Labeled Examples Word Feature Rectangle Vectors s Learne Training Model r Examples Classifie Test Results r Examples
Census Images • 3 US Census images • Same census taker • Preprocessing: Kittler's algorithm to threshold images
Extracted Fields • Manually copied bounding rectangles • 3 columns: 1. Relationship to Head (14) 2. Sex (2) 3. Marital Status (4) • 123 rows total • N-fold cross validation • N = 24 (5 rows to test)
Examples to Feature Vectors 25 Numeric Features Extracted: o Scalar Features:  height ( h)  width ( w )  aspect ratio ( w / h )  area (w * h ) o Profile Features:  projection profile  upper/lower word profile  7 lowest scalar values from DFT
HMM and Transition Probability Model • Probability Model: o Hidden Markov Model o State Transition Probabilities
Observation Probability Model o Multi-variate normal distribution:
Accuracies with and without HMM
Accuracies for Separate Columns with and without HMM
Accuracies of HMM for Varying Numbers of Training Examples
Accuracies of “Relationship to Head” for Varying Numbers of Examples
Conclusions and Future Work • 10% correction rate for chosen columns after one page. • Measure indexing time. • Update models in real-time. • Columns with larger vocabularies. • More image preprocessing. • More visual features. • More dependencies among words (in different rows). • More training data.
Questions?

Recommend

Hidden Markov Models Pratik Lahiri Introduction A hidden Markov model (HMM) is a

Hidden Markov Models Pratik Lahiri Introduction A hidden Markov model (HMM) is a statistical Markov model in which the system being modeled is assumed to be a Markov process with unobserved (hidden) states. We call the observed event

741 views • 13 slides

Hidden Markov Model (HMM) Sensor Markov assumption: P ( E t | X 0: t , E 1: t 1 ) = P ( E t | X

Hidden Markov Model (HMM) Sensor Markov assumption: P ( E t | X 0: t , E 1: t 1 ) = P ( E t | X t ) Stationary process: transition model P ( X t | X t 1 ) and Hidden Markov Models sensor model P ( E t | X t ) fixed for all t HMM is a

324 views • 3 slides

Multivariate Hidden Markov model An application to study correlations among cryptocurrency

Introduction Proposed Hidden Markov Model (HMM) Data Results Conclusions References Multivariate Hidden Markov model An application to study correlations among cryptocurrency log-returns Fulvia Pennoni Bartolucci F. , Forte G.

440 views • 28 slides

A C Standard model has 1-1 correspondence between symbols and states, P(A | T) thus P ( x i

Outline Markov chains CSCE 471/871 Lecture 3: Markov Chains and Hidden Markov Models Hidden Markov models (HMMs) Formal definition Finding most probable state path (Viterbi algorithm) Stephen D. Scott Forward and backward

640 views • 5 slides

The Hidden Markov The Hidden Markov Model (HMM) Model (HMM) 1 Lecture Outline Lecture Outline

Digital Speech Processing Digital Speech Processing Lecture 20 Lecture 20 The Hidden Markov The Hidden Markov Model (HMM) Model (HMM) 1 Lecture Outline Lecture Outline Theory of Markov Models discrete Markov processes

991 views • 87 slides

Hidden Markov Models Markov Model (Finite State Machine with Probs) Modeling a sequence of

Hidden Markov Models Markov Model (Finite State Machine with Probs) Modeling a sequence of weather observations Hidden Markov Models Assume the states in the machine are not observed and we can observe some output at certain states. Hidden

692 views • 26 slides

Hidden Markov Models Training Selecting model parameters What we know The terminology and

Hidden Markov Models Training Selecting model parameters What we know The terminology and notation of hidden Markov models ( HMMs ) The forward- and backward-algorithms for determining the likelihood p ( X ) of a sequence of

1.02k views • 60 slides

Lecture 9: Hidden Markov Model Kai-Wei Chang CS @ University of Virginia kw@kwchang.net Couse

Lecture 9: Hidden Markov Model Kai-Wei Chang CS @ University of Virginia kw@kwchang.net Couse webpage: http://kwchang.net/teaching/NLP16 CS6501 Natural Language Processing 1 This lecture v Hidden Markov Model v Different views of HMM v HMM

579 views • 34 slides

Markov Models Kunsch, H.R., State Space and Hidden Markov Models . ETH- Zurich, Zurich;

State Space and Hidden Markov Models Kunsch, H.R., State Space and Hidden Markov Models . ETH- Zurich, Zurich; Aliaksandr Hubin Oslo 2014 Contents 1. Introduction 2. Markov Chains 3. Hidden Markov and State Space Model 4. Filtering and

853 views • 37 slides

1 X 1 X 2 X 3 Ghostbusters HMM Chain Rule and HMMs E 1 E 2 E 3 P(X 1 ) = uniform 1/9 1/9

Hidden Markov Models 1 Hidden Markov Models Markov chains not so useful for most agents Need observations to update your beliefs Hidden Markov models (HMMs) Underlying Markov chain over states X You observe outputs (effects) at

617 views • 7 slides

Hidden Markov Models (HMM) Many slides from Michael Collins and HMMs Overview I The Tagging

Hidden Markov Models (HMM) Many slides from Michael Collins and HMMs Overview I The Tagging Problem I Generative models, and the noisy-channel model, for supervised learning I Hidden Markov Model (HMM) taggers I Basic definitions I Parameter

968 views • 37 slides

Hidden Markov Model, Kalman Filter and A Unifying View Mu Li April 16, 2013 Outline Hidden

Recitations for 10-701: Hidden Markov Model, Kalman Filter and A Unifying View Mu Li April 16, 2013 Outline Hidden Markov Model Kalman Filter A Unifying View of Linear Gaussian Models based on slides from Simma & Batzoglou Outline

770 views • 37 slides

Markov chains and Hidden Markov Models 9000 Markov chains and HMMs We will discuss: Markov

Markov chains and Hidden Markov Models 9000 Markov chains and HMMs We will discuss: Markov chains Hidden Markov Models (HMMs) Algorithms: Viterbi, forward, backward, posterior decoding Profile HMMs Baum-Welch algorithm 9001

1.16k views • 87 slides

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 7: Hidden

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 7: Hidden Markov Models (Part III) Instructor: Preethi Jyothi Aug 14, 2017 Recap: Learning HMM Parameters Given an HMM = ( A , B ) and an observation se-

514 views • 19 slides

Tagging Problems, and Hidden Markov Models Michael Collins, Columbia University Overview The

Tagging Problems, and Hidden Markov Models Michael Collins, Columbia University Overview The Tagging Problem Generative models, and the noisy-channel model, for supervised learning Hidden Markov Model (HMM) taggers Basic

778 views • 38 slides

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 8: Hidden

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 8: Hidden Markov Models (IV) - Tied State Models Instructor: Preethi Jyothi Jan 30, 2017 Recap: Triphone HMM Models Each phone is modelled in the context of

542 views • 25 slides

1 Real HMM Examples Real HMM Examples Speech recognition HMMs: Machine translation HMMs:

Hidden Markov Models CSE 473: Artificial Intelligence Markov chains not so useful for most agents Hidden Markov Models Eventually you dont know anything anymore Need observations to update your beliefs Hidden Markov models

567 views • 9 slides

CSE P 590 A Markov Models and Hidden Markov Models

CSE P 590 A Markov Models and Hidden Markov Models http://upload.wikimedia.org/wikipedia/commons/b/ba/Calico_cat Dosage Compensation Reminder: Proteins Read DNA and X-Inactivation E.g.: 2 copies (mom/dad) of each

1.28k views • 21 slides

1 Real HMM Examples Real HMM Examples Speech recognition HMMs: Machine translation HMMs:

Hidden Markov Models CSE 473: Artificial Intelligence Hidden Markov Models Markov chains not so useful for most agents Eventually you dont know anything anymore Need observations to update your beliefs Hidden Markov models (HMMs)

253 views • 7 slides

Continuous time semi-Markov inference of biometric laws associated with a Long-Term Care

The continuous time semi-Markov model Estimation of parameters Results, pricing and reserving Continuous time semi-Markov inference of biometric laws associated with a Long-Term Care Insurance portfolio Guillaume Biessy SCOR Global Life,

547 views • 31 slides

CSE 527 " Markov Models and Hidden Markov Models !

CSE 527 " Markov Models and Hidden Markov Models ! http://upload.wikimedia.org/wikipedia/commons/b/ba/Calico_cat ! Dosage Compensation Reminder: Proteins Read DNA ! and X-Inactivation ! E.g.: ! 2 copies (mom/dad) of each chromosome

770 views • 17 slides

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 5: Hidden

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 5: Hidden Markov Models (Part I) Instructor: Preethi Jyothi Lecture 5 OpenFst Cheat Sheet Qv ick Intro to OpenFst (www.openfst.org) a 0

415 views • 19 slides

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University 2 Markov Chains

1 Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University 2 Markov Chains and Hidden Markov Models Modeling the statistical properties of biological sequences and distinguishing regions based on these models For the

437 views • 32 slides

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University Markov Chains and

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University Markov Chains and Hidden Markov Models Modeling the statistical properties of biological sequences and distinguishing regions based on these models For the

1.33k views • 96 slides

Using a Hidden-Markov Model in Semi-Automatic Indexing of - PowerPoint PPT Presentation

Using a Hidden-Markov Model in Semi-Automatic Indexing of Historical Handwritten Records Thomas Packer, Oliver Nina, Ilya Raykhel Computer Science Brigham Young University The Challenge: Indexing Handwriting Millions of historical

Hidden Markov Models Pratik Lahiri Introduction A hidden Markov model (HMM) is a

Hidden Markov Model (HMM) Sensor Markov assumption: P ( E t | X 0: t , E 1: t 1 ) = P ( E t | X

Multivariate Hidden Markov model An application to study correlations among cryptocurrency

A C Standard model has 1-1 correspondence between symbols and states, P(A | T) thus P ( x i

The Hidden Markov The Hidden Markov Model (HMM) Model (HMM) 1 Lecture Outline Lecture Outline

Hidden Markov Models Markov Model (Finite State Machine with Probs) Modeling a sequence of

Hidden Markov Models Training Selecting model parameters What we know The terminology and

Lecture 9: Hidden Markov Model Kai-Wei Chang CS @ University of Virginia kw@kwchang.net Couse

Markov Models Kunsch, H.R., State Space and Hidden Markov Models . ETH- Zurich, Zurich;

1 X 1 X 2 X 3 Ghostbusters HMM Chain Rule and HMMs E 1 E 2 E 3 P(X 1 ) = uniform 1/9 1/9

Hidden Markov Models (HMM) Many slides from Michael Collins and HMMs Overview I The Tagging

Hidden Markov Model, Kalman Filter and A Unifying View Mu Li April 16, 2013 Outline Hidden

Markov chains and Hidden Markov Models 9000 Markov chains and HMMs We will discuss: Markov

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 7: Hidden

Tagging Problems, and Hidden Markov Models Michael Collins, Columbia University Overview The

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 8: Hidden

1 Real HMM Examples Real HMM Examples Speech recognition HMMs: Machine translation HMMs:

CSE P 590 A Markov Models and Hidden Markov Models

1 Real HMM Examples Real HMM Examples Speech recognition HMMs: Machine translation HMMs:

Continuous time semi-Markov inference of biometric laws associated with a Long-Term Care

CSE 527 &quot; Markov Models and Hidden Markov Models !

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 5: Hidden

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University 2 Markov Chains

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University Markov Chains and

CSE 527 " Markov Models and Hidden Markov Models !