Journal of Northeastern University ›› 2004, Vol. 25 ›› Issue (11): 1061-1064.DOI: -

• OriginalPaper • Previous Articles     Next Articles

Manchu character recognition post-processing based on bayes rules and substitution set confusion matrix

Li, Jing-Jiao (1); Zhao, Ji (1)   

  1. (1) Sch. of Info. Sci. and Eng., Northeastern Univ., Shenyang 110004, China; (2) Anshan Univ. of Sci. and Technol., Anshan 114002, China
  • Received:2013-06-24 Revised:2013-06-24 Online:2004-11-15 Published:2013-06-24
  • Contact: Zhao, J.
  • About author:-
  • Supported by:

Abstract: After combining of organically the recognition information on single Manchu characters from relevant system with the information on phrases to set up a statistical information database of Manchu phrases and underdetermined word sets, Bayes rules are used to synthesize the prior probability of underdetermined Manchu word sets and posterior probability of phrases. A data construction is thus developed to improve efficiently the recognition rate, which is rational and easy to implement especially available to detect and correct those rejected and incorrectly recognized words output from the SCR single character recognition system. Experiment shows that the post-processing performance depends on not only the language model but the accurate estimate of posterior probability. In addition, the higher the recognition rate of SCR, the stronger the rectifiability of postprocessing.

CLC Number: