查看论文信息

中文题名：	自然场景文本区域定位
姓名：	李昭早
学号：	0308120706
保密级别：	公开
论文语种：	chi
学科代码：	0810
学科名称：	信息与通信工程
学生类型：	硕士
学位：	工学硕士
学校：	西安电子科技大学
院系：	通信工程学院
专业：	通信与信息系统
第一导师姓名：	卢朝阳
第一导师单位：	西安电子科技大学
第二导师姓名：	高西全
完成日期：	2006-03-08
答辩日期：	2006-03-08
外文题名：	Text Location in Nature Image
中文关键词：	文本定位 ; RGB聚类 ; HSV颜色模型 ; 边缘检测 ; 裂缝检测 ; 仪表识别
中文摘要：	︿随着人们日常生活水平的提高，人们持着手机、数码相机拍摄自然场景中的文字图像，它们即可自动把图像中的文字转换成可编辑的文本资料。自然场景中的文字识别(OCR)已经成为人们日常生活中的一种迫切需求。对于复杂自然场景，文本区域定位已成为OCR必不可少的前提环节。本文结合科研项目，重点对文字的颜色信息和形状信息进行了深入的探讨，提出了一些实效的文本定位算法：(1)基于RGB阈值化聚类的文本区域定位算法。该算法首先对彩色图像的RGB分量分别进行局部阈值化，然后通过颜色聚类形成若干层子图像，再在各子图像中进行刷子刷图、连通域分析等处理，最后把各子图像综合。(2)基于人眼感知HSV空间聚类法。利用HSV空间模型，根据人眼对颜色的感知原理把自然图像分为红橙黄绿青蓝紫七彩色子图和纯黑白的子图，然后再进行后期处理。在改进的颜色子图中把图像分割成了人们常用的红黄绿蓝灰色图像。(3)基于HSV与Sober边缘检测混合模型的文本定位算法。该算法既考虑文字的颜色信息，又考虑文字的形状边缘信息，充分利用HSV图像聚类和Sober边缘检测的优点，把两者结合起来形成混合模型，从而达到更高识别率的结果。实验结果表明，本文所提出的几种文本定位算法具有新颖之处，这些方法不但可以相对准确有效地定位出相应的文本区域，而且能够比较准确的框定文本区域的大致结构，具有一定的理论价值和较高的实用价值。另外，文章介绍了图像处理中的一些实用算法。对桥梁视频裂缝检测与监控进行了深入的研究；同时，对基于图像处理的仪表识别系统的组成以及识别过程也进行了详细的介绍。﹀
外文摘要：	︿ With the development of the people’s life, the handle telephone and digital camera can get the nature image with word and change the image word into the edited data word. So the recognition of word in nature image has become the impending demand. To the complex nature image, the text location is the precondition of word recognition. Based on the scientific research project, we emphases on word’s color information and shape information, and bring forward several useful arithmetic of text location: (1) The arithmetic of text location on RGB threshold clustering. In this arithmetic, we locally binarizate the RGB weight of color image, form several children images by color clustering, brush the image and analysis the interconnected domain in the children images. At last we synthesize all children images to get the location of text. (2) HSV clustering arithmetic. Based on the vision theory on color, we use the HSV model to divide the nature image into red、orange、yellow、green、cyan、blue、purple and black、white、gray children images, and then dong the latter work. In the improved dividing method, the image is divided into red、yellow、green、blue and gray images. (3) The arithmetic on Mixed model of HSV and Sober edge detection. It considers not only the color information, but also the edge and shape of word. And it combines these to mixed model to get the best result. The result of experiment shows that the arithmetic presented in this paper is novel and it can well and truly locate the text, and more it can fix the rough frame of word. This arithmetic is valuable in theory and application. In addition, some applicable arithmetic is introduced latter in this paper. The detection and surveillance to the bridge’s crack is deeply researched, at the same time, the composing and the recognition process of analog measuring instruments system by image processing is introduced in detail. ﹀
中图分类号：	11
馆藏号：	11-4648
开放日期：	2015-09-13

附件下载