Home Research Publications

 

 


Recognition of on-line cursive script


Some of the main problems associated with recognition of on-line cursive script are how to deal with extremely large variability of letter shapes and how to deal with the segmentation problem.  The variability in letter shapes is partially a result of different writing styles but even for the same writer the same letter is often written differently depending on the position of the letter within a word. 

To get a sense of the variability of letter shapes check out the following examples from our database (obtained from David Rumelhart).  If you were not able to read all the words, here is the list (ordered from left to right, top to down):  animal, foolish, river, account, art, and sleep.

One of the reasons we started this project is because of the segmentation problem.  Although extremely important in many areas, including computer vision and sequence analysis (from DNA analysis to speech recognition), the segmentation problem remains largely unsolved.  Why is that?  Probably because it is one of those "chicken-and-egg" problems. It is very hard to get a handle of those problems. In order to segment a word it helps to know the letters but in order to recognize the letters it helps to know the word.  Recognition of on-line script seemed like a nice starting point since patterns are one-dimensional but at the same time sufficiently complex and challenging.  In the following, I will briefly describe two models that we have developed for recognition of cursive script.  The main advantage of these two models over the  state-of-the-art model for cursive script recognition, the Hidden Markov Model (HMM), is that they can handle duration modeling without an increase in computational complexity.


 

Home Research Publications