东北大学学报(自然科学版) ›› 2007, Vol. 28 ›› Issue (3): 316-319.DOI: -

• 论著 • 上一篇    下一篇

文本区域字符颜色极性判断方法

孙红星;赵楠楠;王蓉;徐心和;   

  1. 东北大学信息科学与工程学院;辽宁科技大学电子与信息工程学院;中国人民公安大学安全防范系;东北大学信息科学与工程学院 辽宁沈阳110004;辽宁鞍山114044;北京100038;辽宁沈阳110004
  • 收稿日期:2013-06-24 修回日期:2013-06-24 出版日期:2007-03-15 发布日期:2013-06-24
  • 通讯作者: Sun, H.-X.
  • 作者简介:-
  • 基金资助:
    国家自然科学基金资助项目(60475036)

Method to recognize character's color polarity of text region

Sun, Hong-Xing (1); Zhao, Nan-Nan (2); Wang, Rong (3); Xu, Xin-He (1)   

  1. (1) School of Information Science and Engineering, Northeastern University, Shenyang 110004, China; (2) School of Electronic and Information Engineering, Liaoning University of Science and Technology, Anshan 114044, China; (3) College of Information Security and Engineering, Chinese People's Public Security University, Beijing 100038, China
  • Received:2013-06-24 Revised:2013-06-24 Online:2007-03-15 Published:2013-06-24
  • Contact: Sun, H.-X.
  • About author:-
  • Supported by:
    -

摘要: 文本区域的字符存在着不同的颜色极性.为了能够正确地把文本区域的灰度图像转换成OCR识别软件可以识别的二值图像,提出了一种判断文本区域字符颜色极性的方法.首先计算文本区域的灰度-梯度共生矩阵,并根据目标函数快速地找到分割的灰度和梯度最佳阈值;然后在此基础上提取特征向量,送入神经网络进行分类;最后根据颜色极性判断的结果,分割出字符.实验结果表明,提出的方法在复杂度不同的背景下,正确地识别出了不同类别的字符颜色极性.

关键词: 文本提取, 字符, 颜色极性, 灰度-梯度共生矩阵, 神经网络

Abstract: Characters in a text region may have different color polarities. To convert correctly the image with grayscale in an accepted text region into the OCR-ready binary image, a method is proposed to classify then recognize the color polarity of characters in a text region. The gray-gradient co-occurrence matrix of the text region is calculated, and the optimum thresholds of segmented grayscale and gradient are found quickly according to the objective function. Then, the feature vector is extracted from the gray-gradient co-occurrence matrix and fed into neural network to classify the color polarity. All the characters in the text region are finally segmented according to the classification of color polarities. Experimental results showed that the proposed method can recognize correctly different color polarities of characters in the background with different complexities.

中图分类号: