基于动态自适应通道注意力特征融合的小目标检测

Small object detection based on dynamic adaptive channel attention feature fusion

  • 摘要: 针对小目标检测中卷积操作导致检测特征缺失和不同尺度语义隔阂的问题,提出一种基于动态自适应通道注意力特征融合的小目标检测方法。1)提出一种多尺度三角动态颈(Tri-Neck)网络结构,用于融合多尺度特征语义隔阂及弥补小目标特征缺失的问题。2)提出一种分组批量动态自适应通道注意力模块,增强弱语义小目标特征同时抑制无用信息,且在动态自适应通道注意力模块中设计新的激活函数和交并比损失函数,提升通道注意力表征能力。3)采用ResNet50作为骨干网络依次连接特征金字塔网络和Tri-Neck网络。实验结果表明,该方法在Pascal Voc 2007、Pascal Voc 2012上比YOLOv8算法mAP分别提升5.3%和6.2%,在MS COCO 2017数据集上AP和APS分别提升1.6%和2%,在SODA-D数据集上比YOLOv8算法AP提升0.9%。

     

    Abstract: In order to solve the feature missing and different scales features semantics gaps problem caused by convolution operations in small object detection, a small object detection method based on dynamic adaptive channel attention feature fusion is proposed in this paper. Firstly, a Tri-Neck network structure is introduced to address the semantic gaps and feature deficiency in small object detection across multiple scales. Secondly, a dynamic adaptive channel attention module is proposed to enhance weak semantic features of small objects while suppressing irrelevant information. Additionally, new activation functions and intersection-over-union loss functions are designed within the dynamic adaptive channel attention module to improve channel attention representation capability. Finally, the ResNet50 backbone network is utilized, connecting the feature pyramid network and the Tri-Neck network sequentially. Experimental results on the Pascal VOC 2007 and Pascal VOC 2012 datasets demonstrate performance improvements of 5.3% and 6.2% respectively, while on the MS COCO 2017 dataset, the proposed algorithm shows enhancements in overall performance and small object detection performance by 1.6% and 2% respectively, and on the SODA-D dataset, our proposed algorithm demonstrates superior performance compared to the suboptimal algorithm AP, resulting in a 0.9% improvement in overall accuracy.

     

/

返回文章
返回