[1]龚学尧,赵逢禹.基于图像搜索与合成的图像语义理解[J].计算机技术与发展,2022,32(06):57-62.[doi:10. 3969 / j. issn. 1673-629X. 2022. 06. 010]
 GONG Xue-yao,ZHAO Feng-yu.Image Semantic Understanding Based on Image Search and Synthesis[J].,2022,32(06):57-62.[doi:10. 3969 / j. issn. 1673-629X. 2022. 06. 010]
点击复制

基于图像搜索与合成的图像语义理解()

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
32
期数:
2022年06期
页码:
57-62
栏目:
图形与图像
出版日期:
2022-06-10

文章信息/Info

Title:
Image Semantic Understanding Based on Image Search and Synthesis
文章编号:
1673-629X(2022)06-0057-06
作者:
龚学尧赵逢禹
上海理工大学 光电信息与计算机工程学院,上海 200093
Author(s):
GONG Xue-yaoZHAO Feng-yu
School of Optical-Electrical & Computer Engineering,University of Shanghai for Science and Technology,Shanghai 200093,China
关键词:
图像语义理解图像识别搜索引擎图相似计算语义合成
Keywords:
image semantic understandingimage recognitionsearch enginesgraph similarity algorithmsemantic synthesis
分类号:
TP391
DOI:
10. 3969 / j. issn. 1673-629X. 2022. 06. 010
摘要:
图像语义理解是计算机视觉的重要研究领域,对于人工智能的发展具有重大的现实意义,当前大部分的方法在生成多样化的图像语义方面还存在着不足,因此提出了一种基于图像搜索的图像语义合成方法。 该方法首先将图像输入搜索引擎以获得相似图像及其携带的描述;其次通过目标检测算法对图像中的目标进行识别以获得图像内目标词及目标框图像;然后计算图像相似度并比较描述文本中的目标词,利用相似度与共有目标词形成度量指标,提取搜索获得的描述文本中最符合原图像语义的文本作为基础文本;最后利用基础文本中缺失的目标词对应的文本与基础文本合成从而获得图像语义。 在 MSCOCO 数据集上的实验结果表明,该方法借助搜索引擎与语义合成可以有效地反映图像语义,相较于其他图像语义理解算法能够更准确地识别图像中的物体,输出更全面的图像语义;对于图像中的内容能够进行更加多样化的描述。
Abstract:
Image semantic understanding is an important research area of computer vision,which is of great practical significance to thedevelopment of artificial intelligence. Most of the current methods are insufficient in generating diversified image semantics,so an imagesemantic synthesis method based on image search is proposed. Images are first inputted into the search engine to obtain similar imagesand their text descriptions. Secondly,the target detection algorithm is used to recognize the target in the image,so as to obtain the targetimage and target words. Then the image similarity is calculated and the target words in the description text are compared. Similarity andcommon target words are used to form metrics. The text most consistent with the original image semantics from the description text is extracted as the basic text. Finally,the corresponding text of the missing target word is synthesized with the basic text to obtain the imagesemantics. The experimental results on the MSCOCO data set show that the proposed method can effectively reflect the image semantics.Compared with other image semantic understanding algorithms,it can identify the objects in the image more accurately and output morecomprehensive image semantics. The image semantic contents are more diversified.

相似文献/References:

[1]王明平 宋丽梅.基于计算机视觉的车架号采集系统[J].计算机技术与发展,2008,(04):239.
 WANG Ming-ping,SONG Li-mei.Vehicle Identify Number Acquisition System Based on Machine-Vision[J].,2008,(06):239.
[2]徐仕玲 赵敏 徐建波.野外早期火灾图像识别方法研究[J].计算机技术与发展,2008,(06):214.
 XU Shi-ling,ZHAO Min,XU Jian-bo.Research for Early Fire Image Recognition Technology in Outdoors[J].,2008,(06):214.
[3]张婷 吴元君 黄俊 吴建国.选票选举系统中选票图像的预处理方法研究[J].计算机技术与发展,2007,(04):225.
 ZHANG Ting,WU Yuan-jun,HUANG Jun,et al.Research of Image Pre- Processing in Vote Processing System[J].,2007,(06):225.
[4]吴忠 朱国龙 黄葛峰 吴建国.基于图像识别技术的手写数字识别方法[J].计算机技术与发展,2011,(12):48.
 WU Zhong,ZHU Guo-long,HUANG Ge-feng,et al.Handwritten Digit Recognition Based on Image Recognition System[J].,2011,(06):48.
[5]汪晨 张涛 林为民 邓松 时坚 李伟伟.图像识别综述及在电力信息安全中的应用研究[J].计算机技术与发展,2012,(04):161.
 WANG Chen,ZHANG Tao,LIN Wei-min,et al.Image Recognition Review and Application Research in Electric Power Information Security[J].,2012,(06):161.
[6]王珂雅 邱力军.一种新的脑部CT图像异常检测算法[J].计算机技术与发展,2012,(05):185.
 WANG Ke-ya,QIU Li-jun.A New Anomaly Detection Algorithm for Brain CT Image[J].,2012,(06):185.
[7]周诚诚 张代远[].利用图像识别技术过滤海量可疑钓鱼网站[J].计算机技术与发展,2012,(11):246.
 ZHOU Cheng-cheng,ZHANG Dai-yuan.Using Image Recognition Technology to Filter Mass Suspicious Phishing Sites[J].,2012,(06):246.
[8]毕 杨,宋 飞,王 轩.一种航空维修工具智能管理系统的设计与实现[J].计算机技术与发展,2020,30(07):199.[doi:10. 3969 / j. issn. 1673-629X. 2020. 07. 041]
 BI Yang,SONG Fei,WANG Xuan.Design and Implementation of an Intelligent Management System of Aviation Maintenance Tools[J].,2020,30(06):199.[doi:10. 3969 / j. issn. 1673-629X. 2020. 07. 041]
[9]严一鸣[],郭星[]. 基于计算机视觉的交互式电子沙盘系统研究[J].计算机技术与发展,2017,27(06):195.
 YAN Yi-ming[],GUO Xing[]. Investigation on Interactive Electronic Sand Table System with Computer Vision[J].,2017,27(06):195.
[10]高友文,周本君,胡晓飞.基于数据增强的卷积神经网络图像识别研究[J].计算机技术与发展,2018,28(08):62.[doi:10.3969/ j. issn.1673-629X.2018.08.013]
 GAO You-wen,ZHOU Ben-jun,HU Xiao-fei.Research on Image Recognition of Convolution Neural Network Based on Data Enhancement[J].,2018,28(06):62.[doi:10.3969/ j. issn.1673-629X.2018.08.013]

更新日期/Last Update: 2022-06-10