[1]易炟,陈东方,王晓峰.基于特征耦合注意力机制的小样本目标检测方法[J].计算机技术与发展,2025,(04):29-36.[doi:10.20165/j.cnki.ISSN1673-629X.2024.0383]
 YI Da,CHEN Dong-fang,WANG Xiao-feng.Few-shot Object Detection Method Based on Feature Coupling Attention Mechanism[J].,2025,(04):29-36.[doi:10.20165/j.cnki.ISSN1673-629X.2024.0383]
点击复制

基于特征耦合注意力机制的小样本目标检测方法()

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
期数:
2025年04期
页码:
29-36
栏目:
媒体计算
出版日期:
2025-04-10

文章信息/Info

Title:
Few-shot Object Detection Method Based on Feature Coupling Attention Mechanism
文章编号:
1673-629X(2025)04-0029-08
作者:
易炟12陈东方12王晓峰12
1. 武汉科技大学 计算机科学与技术学院,湖北 武汉 430065;
2. 智能信息处理与实时工业系统湖北省重点实验室,湖北 武汉 430065
Author(s):
YI Da12CHEN Dong-fang12WANG Xiao-feng12
1. School of Computer Science and Technology,Wuhan University of Science and Technology,Wuhan 430065,China;
2. Hubei Provincial Key Laboratory of Intelligent Information Processing and Real-time Industrial Systems,Wuhan 430065,China
关键词:
小样本学习目标检测元学习特征耦合特征融合任务解耦
Keywords:
few-shot learningobject detectionmeta-learningfeature couplingfeature fusiontask decoupling
分类号:
TP391.4
DOI:
10.20165/j.cnki.ISSN1673-629X.2024.0383
摘要:
在当前基于元学习的小样本目标检测方法中,查询图像和支持图像的特征提取过程往往在进行最终特征融合之前是独立进行的,缺乏有效的信息交互,这导致查询特征的代表性信息不足,尤其在样本极度有限的情况下更为显著。 为解决这一问题,提出了一种基于特征耦合注意力机制的小样本目标检测方法(FC-FSOD)。 FC-FSOD 以 Faster-RCNN 作为基础网络架构,并在此基础上进行了创新。 首先,设计了特征增强耦合模块,该模块增强了支持特征,使其成为更具代表性的支持原型;其次,通过注意力机制将原型与查询特征进行耦合,赋予查询特征以特定于支持特征的感知信息;再次,综合考虑查询图像的 ROI 特征与类级原型之间的差异性和相似性,设计了一种更为鲁棒的特征融合策略;最后,对分类和回归任务进行了解耦处理,消除了全局平均池化对回归预测的负面影响。 在 PASCAL VOC 和 MS COCO 这两个公开数据集上的实验结果表明,该方法在多种小样本场景中的检测精度均有明显提升。
Abstract:
In the current meta-learning-based few-shot object detection methods,the feature extraction processes of query image and support images are often performed independently before the final feature fusion,and there is a lack of effective information interaction,which leads to insufficient representative information of the query features,and it is more significant especially in the case of extremely limited samples. To solve this problem,a few-shot object detection method based on the feature coupling attention mechanism ( FC-FSOD) is proposed. FC-FSOD takes Faster-RCNN as the basic network architecture and makes innovations based on it. Firstly,a feature enhanced coupling module is designed,which enhances the support features to become more representative support prototypes.Secondly,the prototype is coupled with the query feature through the attention mechanism,and the query feature is endowed with the per-ceptual information specific to the supporting feature. Thirdly, a more robust feature fusion strategy is designed by considering the differences and similarities between the ROI features of the query image and the class-level prototypes. Finally,decoupling of the classi-fication and regression tasks removes the negative influence of global average pooling on regression prediction. Experimental results on two publicly available datasets,PASCAL VOC and MS COCO,show a significant improvement in the detection accuracy of the proposed method in a variety of few-shot scenarios.

相似文献/References:

[1]刘晓明 李毓蕙 高燕 郑华强.基于目标区域清晰显示的H.264编码策略[J].计算机技术与发展,2010,(06):29.
 LIU Xiao-ming,LI Yu-hui,GAO Yan,et al.A Coding Strategy of H.264 Based on High-definition Display of Target Region[J].,2010,(04):29.
[2]刘翔 吴谨 祝愿博 康晓晶.基于视频序列的目标检测与跟踪技术研究[J].计算机技术与发展,2009,(11):179.
 LIU Xiang,WU Jin,ZHU Yuan-bo,et al.A Study of Object Detecting and Tracking Based on Video Sequences[J].,2009,(04):179.
[3]曙光 张超 蔡则苏.基于改进的混合高斯模型的目标检测方法[J].计算机技术与发展,2012,(07):60.
 SHU Guang,ZHANG Chao,CAI Ze-su.Target Detection Method Based on Improved Gaussian Mixture Model[J].,2012,(04):60.
[4]刘洁,李目,周少武.一种混沌混合粒子群优化RBF神经网络算法[J].计算机技术与发展,2013,(08):181.
 LIU Jie[],LI Mu[],ZHOU Shao-wu[].An Algorithm of Chaotic Hybrid Particle Swarm Optimization Based on RBF Neural Network[J].,2013,(04):181.
[5]蒋翠清,孙富亮,吴艿芯. 基于相对欧氏距离的背景差值法视频目标检测[J].计算机技术与发展,2015,25(01):37.
 JIANG Cui-qing,SUN Fu-liang,WU Nai-xin. Video Object Detection of Background Subtraction Method Based on Relative Euclidean Distance[J].,2015,25(04):37.
[6]卢官明,衣美佳. 步态识别关键技术研究[J].计算机技术与发展,2015,25(07):100.
 LU Guan-ming,YI Mei-jia. Research on Critical Techniques in Gait Recognition[J].,2015,25(04):100.
[7]高翔,朱婷婷,刘洋. 多摄像头系统的目标检测与跟踪方法研究[J].计算机技术与发展,2015,25(07):221.
 GAO Xiang,ZHU Ting-ting,LIU Yang. Research of Target Detection and Tracking Method for Multi-camera System[J].,2015,25(04):221.
[8]章文洁[][],黄旻[],张桂峰[]. 滤光片多光谱成像中运动目标场景误配准修正[J].计算机技术与发展,2016,26(01):18.
 ZHANG Wen-jie[][],HUANG Min[],ZHANG Gui-feng[]. Misregistration Correction for Moving Object Scene in Filter-type Multispectral Imaging[J].,2016,26(04):18.
[9]施泽浩,赵启军.基于全卷积网络的目标检测算法[J].计算机技术与发展,2018,28(05):55.[doi:10.3969/j.issn.1673-629X.2018.05.013]
 SHI Ze-hao,ZHAO Qi-jun.Object Detection Algorithm Based on Fully Convolutional Neural Network[J].,2018,28(04):55.[doi:10.3969/j.issn.1673-629X.2018.05.013]
[10]张夏清,茅耀斌. 一种改进的ViBe背景提取算法[J].计算机技术与发展,2016,26(07):36.
 ZHANG Xia-qing,MAO Yao-bin. An Improved ViBe Background Generation Method[J].,2016,26(04):36.

更新日期/Last Update: 2025-04-10