Handwritten Chinese Text Recognition Wenchao Wang, Jun Du and Zi-Rui - PowerPoint PPT Presentation

Parsimonious HMMs for Offline Handwritten Chinese Text Recognition Wenchao Wang, Jun Du and Zi-Rui Wang University of Science and Technology of China ICFHR 2018, Niagara Falls, USA, Aug. 5-8, 2018 1

Background • Offline handwritten Chinese text recognition (OHCTR) is challenging – No trajectory information in comparison to the online case – Large vocabulary of Chinese characters – Sequential recognition with the potential segmentation problem • Approaches – Oversegmentation approaches – Character oversegmentation /classification – Segmentation-free approaches – GMM-HMM: Gaussian mixture model - hidden Markov model – MDLSTM-RNN: Multidimensional LSTM-RNN + CTC – DNN-HMM: Deep neural network – hidden Markov model 2

Review of HMM Approach for OHCTR • Left-to-right HMM is adopted to represent Chinese character. • The character HMMs are concatenated to model the text line. The sequence of concatenated character HMMs 映反得到 The observation sequence of sliding windows 3

Review of DNN-HMM Approach for OHCTR • The Bayesian framework Character modeling Output distribution DNN to calculate state posterior probability 4

Motivation • High demand of memory and computation from DNN output layer • Model redundancy due to similarities among different characters • Parsimonious HMMs to address these two problems • Decision tree based two-step approach to generate tied-state pool 5-state HMM for 5-state HMM for 5-state HMM for character 冻 character 缴 character 练 ... ... Tied-state Pool 5

Binary Decision Tree for State Tying • The parent set has a distribution O 1 , the total log-likelihood of all P x 1 ( ) observations in on the distribution O 1 =  O1,P1(x) of is: L O ( ) log( P x ( )) P x 1 ( ) 1  1 x O 1 • The child set has a distribution O One question 2 2 ( ) P x , the total log-likelihood of all observations in on the distribution O 2 =  of is : L O ( ) log( P x ( )) P x 2 ( ) 2  2 x O 2 O2,P2(x) O3,P3(x) • The child set has a distribution O 3 P x 3 ( ) , the total log-likelihood of all = observations in on the distribution O O O O 3 1 2 3 =  of is : L O ( ) log( P x ( )) P x 3 ( )  3 3 x O 3 • The total increase in set-conditioned log-likelihood of observations due to + − ( ) ( ) ( ) partitioning is: L O L O L O 2 3 1 6

Step 1: Clustering Characters with Decision Tree Is in 愧怀怳忧快忱恍恢悦惋惯 ? • All states with the same HMM position are initially grouped Yes No together at the root node. Is in 愧怳忱恢悦惋惯 ? Is in 愉愤懈怖惝 ? • Each node is then recursively Yes No Yes No partitioned to maximize the Is in 慎懂性恼惊 ? increase in expected log-likelihood with question set. Yes No Is in 慎懂 ? Yes No Leaf node • All states in the leaves of the Non-leaf node decision tree are tied together. A tree fragment for tying the first state of HMM 7

Step 2: Bottom-up Re-clustering • In the second step, the clusters Decision in leaf nodes obtained in the Tied-state Tree ... ... Pool Yes No first step is re-clustered by a bottom-up procedure using Yes No Yes No sequential greedy optimization. ... 3. Generate 1 2 i n Tied-state pool Tied-state leaf nodes • The expected log-likelihood 2. If #cluster > N: calculate objf decrease by clustering decrease by combining every clusters, recluster two cluster two clusters is calculated. 1. Calculate the with the minimum objf objf decrease by decrease to a new cluster. clustering each • two leaf nodes, A minimum priority queue is push these to this cluster cluster cluster ... ... queue. maintained to re-cluster the two (i,j) (m,n) (k,l) clusters with minimum Minimum Priority Queue log-likelihood decrease to a new cluster. 8

Training Procedure for Parsimonious HMMs 1. Training conventional GMM-HMM system 2. Calculating the first-order and second-order statistics based on state-level forced-alignment 3. Two-step algorithm: First-step : Building the state-tying tree Second-step : Re-clustering the tied-states based on the first-step 4. Parsimonious GMM-HMMs training based on the tied states 5. Parsimonious DNN-HMMs training based on the tied states 9

Experiments • Training set CASIA-HWDB database including HWDB1.0, HWDB1.1, HWDB2.0-HWDB2.2 • Test set ICDAR-2013 competition set. • Vocabulary: 3980 character classes • GMM-HMM system – Each character modeled by a left-to-right HMM with 40-component GMM – Gradient-based features followed by PCA to obtain a 50-dimensional vector • DNN-HMM system – 350-2048-2048-2048-2048-2048-2048-3980*N • DNN-PHMM system – 350-2048-2048-2048-2048-2048-2048-M 10

HMM vs. PHMM • Performance saturation with the increase of states for each character • PHMM outperforming HMM with the same setting of tied-state number • Parsimoniousness of the best PHMM compared with the best HMM • Demonstrating the reasonability of the proposed state tying algorithm 11

HMM vs. PHMM • Much more compact by setting the number of tied-states per character < 1 • DNN-PHMM (Ns=0.5, 9.52%) outperforming DNN-HMM (Ns=1, 11.09%) 12

Memory and Computation Costs DNN-PHMM using (1024, 4) setting achieved a comparable CER with DNN- HMM using (2048, 6) setting, 75% of model size and 72% of run-time latency were reduced in DNN-PHMM compared with DNN-HMM. 13

State Tying Result Analysis Tied Radical Similar characters structure part 喷喻嗅嗡吃咆哦哨嘈嘲噬嚼口 Left-right 客害容密寇蜜穷穿突窃窍窑宀 Top-bottom 口圃圆囚囤困围固 Surround 巨匝匠匡匣匪匹医匿臣匚 Left-surround 诞巡边逊辽达谜迁迂过近这辶 Bottom-left-surround 澜阐阑鬲闸闻闽润门 Top-surround 串吊甲牢帛早平 | Cross 氛氢氦氨气 Top-right-surround The Chinese characters with the same or similar radicals were easily tied using the proposed algorithm. This is the reason that the proposed DNN-PHMM with quite compact design can still maintain high recognition performance. 14

Thanks! 15

Handwritten Chinese Text Recognition Wenchao Wang, Jun Du and Zi-Rui - PowerPoint PPT Presentation

Parsimonious HMMs for Offline Handwritten Chinese Text Recognition Wenchao Wang, Jun Du and Zi-Rui Wang University of Science and Technology of China ICFHR 2018, Niagara Falls, USA, Aug. 5-8, 2018 1 Background Offline handwritten Chinese

Handwritten character recognition Handwritten character recognition using elastic matching based

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text

Using Eigen- -Deformations in Deformations in Using Eigen Handwritten Character Recognition

Automatic Scoring of Automatic Scoring of Handwritten Essays using Latent Handwritten Essays

WELCOME CHINESE Your Access Channel to the Chinese Market Welcome Chinese mission statement

CONTENT TITLE Insert Subtitle Here Enter Text Here Enter Text Here Enter Text Here

Handwritten Recognition of Chinese Characters Analysis on CNN working principles and best

Post-Conference Presentation Sunday Oladayo Oladejo Table of Content A Introduction B

Unconstrained Handwritten Text Recognition Reporter: Zecheng Xie South China University of

Interactive Smoothing of Handwritten Text Images Using a Bilateral Filter Oliver A. Nina, Bryan

Enhancing ICANN Text Accountability 26 June 2014 Text #ICANN50 Text #ICANN50 Text #ICANN50

Add Your Title Here Replace your text here! Replace your text here! Insert your title here 1

Text Text #ICANN51 15 October 2014 Text Text IDN Root Zone LGR Sarmad Hussain IDN Program

Text Text #ICANN51 Contractual Compliance Text Text Contractual Compliance Update

Text Text #ICANN50 Contractual Compliance Text Text GNSO Council Meeting Wednesday, Jun 25

AND MACHINE LEARNING CHAPTER 1: INTRODUCTION Example Handwritten Digit Recognition Polynomial

INFO 4300 / CS4300 Information Retrieval slides adapted from Hinrich Sch utzes, linked from

The Aldous diffusion on continuum trees Soumik Pal University of Washington, Seattle Vienna

Definitions Topic 18 Binary Trees A tree is an abstract data type root node internal one entry

Cryptography Autumn 2018 Tadayoshi (Yoshi) Kohno yoshi@cs.Washington.edu Thanks to Dan Boneh,

Computational Social Science: Methods and Applications Anjalie Field, anjalief@cs.cmu.edu 1

The (Decentralized) USENIX Security 2011 Peter Eckersley Jesse Burns EFF iSEC SSL/TLS

Theory of Computer Games: Selected Advanced Topics Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

Do Employers Prefer Undocumented Workers? Evidence from Chinas Hukou System Peter Kuhn a Kailing