基于注意力模型和Soft-NMS的输电线路小目标检测方法

赵云龙; 田生祥; 李岩; 罗龙; 齐鹏文

doi:10.12178/1001-0548.2022290

基于注意力模型和Soft-NMS的输电线路小目标检测方法

doi: 10.12178/1001-0548.2022290

国网青海省电力公司检修公司　西宁　810000

基金项目: 国家自然科学基金青年项目(61903155)；国网青海省电力公司双创孵化培育基金(106000004505)

详细信息

作者简介:
赵云龙(1988 – )，男，高级工程师，主要从事图像处理方面的研究

通讯作者: 赵云龙，E-mail: rensheng202201@163.com

Attention Model and Soft-NMS-Based Transmission Line Small Target Detection Method

State Grid Qinghai Electric Power Company Overhauling Company 　Xining　810000

摘要: 在输电线路的缺陷检测中，鸟巢以及塑料、碎布等挂空悬浮物多为小目标。其所占像素少，容易被背景干扰，检测精度有待提高。设计了一种全新的两阶段目标检测算法，用于改善对输电线路中鸟巢以及挂空悬浮物的检测效果。为了提高小目标检测的性能，在特征提取模块中融入注意力机制，以学习更为丰富的上下文信息。此外，在检测模块中，设计了基于更为柔和非极大值抑制算法的后处理方法，以减少小目标的丢失。与常用的两阶段目标检测算法相比，该方法在两个类别的平均准确率上分别提高了约4.7%和5.9%，有着更高的实际应用价值。
- 注意力机制 /
- Soft-NMS /
- 小目标检测 /
- 输电线路
Abstract: In the defect detection of transmission lines, the bird's nest, plastic, rags and other suspended solids are mostly small targets. They have few pixels in the images and are easy to be disturbed by the background, which make the detection accuracy needs to be improved. In this paper, a new two-stage object detection algorithm is designed to improve the detection effect of bird nests and suspended solids in transmission lines. In order to improve the detection performance of small targets, the attention mechanism is integrated into the feature extraction network to learn more rich context information. In addition, in the detection network, a post-processing method based on softer non maximum suppression algorithm is designed to reduce the loss of small targets. Compared with the commonly used two-stage object detection algorithms, the proposed method improves the average accuracy of the two categories by about 4.7% and 5.9%, respectively, and has greater value in practical applications.
- attention mechanism /
- soft-NMS /
- small target detection /
- transmission line

图 1 网络整体架构

下载: 全尺寸图片幻灯片

图 2 加入特征金字塔网络的Resnet50-CBAM图解

下载: 全尺寸图片幻灯片

图 3 ResNet50-CBAM中残差块的结构图

下载: 全尺寸图片幻灯片

图 4 卷积块注意力模块结构图

下载: 全尺寸图片幻灯片

图 5 数据集中的小目标样例

下载: 全尺寸图片幻灯片

图 6 测试集中真值框的尺寸分布

下载: 全尺寸图片幻灯片

图 7 数据增强图像示例

下载: 全尺寸图片幻灯片

图 8 模型训练流程

下载: 全尺寸图片幻灯片

图 9 训练过程中损失与AP的变化

下载: 全尺寸图片幻灯片

图 10 鸟巢检测效果

下载: 全尺寸图片幻灯片

图 11 挂空悬浮物检测效果

下载: 全尺寸图片幻灯片

图 12 提出方法的热力图可视化

下载: 全尺寸图片幻灯片

表 1 检测算法AP对比

算法	主干网络	AP（鸟巢）/%	AP（悬浮物）/%
Faster R-CNN	ResNet50	84.2	65.9
Faster R-CNN	ResNet101	85.4	68.7
Cascade R-CNN^[30]	ResNet50	85.8	67.6
YOLOv3^[31]	DarkNet-53	83.6	66.3
YOLOv4^[32]	CSPDarknet53	84.7	66.5
提出的方法	ResNet-CBAM	88.9	71.8

下载: 导出CSV

表 2 小目标召回率对比

算法	主干网络	召回率（鸟巢）/%	召回率（悬浮物）/%
Faster R-CNN	ResNet50	87.3	72.0
本文方法	ResNet-CBAM	92.6	79.7

下载: 导出CSV

[1]	SHU Y, CHEN W. Research and application of UHV power transmission in China[J]. High Voltage, 2018, 3(1): 1-13. doi: 10.1049/hve.2018.0003
[2]	PANG Y, WANG T, ANWER R M, et al. Efficient featurized image pyramid network for single shot detector[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. [S.l.]: IEEE, 2019: 7336-7344.
[3]	YANG X, LIU Q, YAN J, et al. R3det: Refined single-stage detector with feature refinement for rotating object[EB/OL]. [2022-05-07]. https://arxiv.org/abs/1908.05612v6.
[4]	HU J, SHEN L, SUN G. Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 7132-7141.
[5]	WOO S, PARK J, LEE J Y, et al. Cbam: Convolutional block attention module[C]//Proceedings of the European Conference on Computer Vision (ECCV). Berlin: Springer, 2018: 3-19.
[6]	HU J, SHEN L, ALBANIE S, et al. Gather-excite: Exploiting feature context in convolutional neural networks[J]. Advances in Neural Information Processing Systems, 2018, 31: 9401-9411.
[7]	ZhAO Z, XU G, QI Y, et al. Multi-patch deep features for power line insulator status classification from aerial images[C]//2016 International Joint Conference on Neural Networks (IJCNN). Vancouver: IEEE, 2016: 3187-3194.
[8]	GAO Q, WU Y, LI Q. A faulted insulator identification algorithm of sparse difference-based deep belief network[J]. Electronic Measuring Instrument, 2016, 53(1): 19-25.
[9]	LIU Y, YONG J, LIU L, et al. The method of insulator recognition based on deep learning[C]//2016 4th International Conference on Applied Robotics for the Power Industry (CARPI). [S.l.]: IEEE, 2016: 1-5.
[10]	WANG W, TIAN B, LIU Y, et al. Study on the electrical devices detection in UAV images based on region based convolutional neural networks[J]. Journal of Geo-information Science, 2017, 19(2): 256-263.
[11]	MA L, XU C, ZUO G, et al. Detection method of insulator based on faster R-CNN[C]//2017 IEEE 7th Annual International Conference on CYBER Technology in Automation, Control, and Intelligent Systems (CYBER). Honolulu: IEEE, 2017: 1410-1414.
[12]	QING C, BIN Y, RUN Y, et al. Insulator detection and recognition of explosion fault based on convolutional neural networks[J]. Journal of Electronic Measurement and Instrument, 2017, 31(6): 942-953.
[13]	REN S, HE K, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137-1149. doi: 10.1109/TPAMI.2016.2577031
[14]	ZHOU Z, YUAN G, FENG W, et al. Target recognition and evaluation of typical transmission line equipment based on deep learning[C]//Proceedings of Purple Mountain Forum 2019-International Forum on Smart Grid Protection and Control. Singapore: Springer, 2020: 701-709.
[15]	GAO F, WANG J, KONG Z, et al. Recognition of insulator explosion based on deep learning[C]//2017 14th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP). Chengdu: IEEE, 2017: 79-82.
[16]	BAI R X, CAO H, YU Y J, et al. Insulator fault recognition based on spatial pyramid pooling networks with transfer learning (match 2018)[C]//2018 3rd International Conference on Advanced Robotics and Mechatronics (ICARM). Beijing: IEEE, 2018: 824-828.
[17]	LIU X, JIANG H, CHEN J, et al. Insulator detection in aerial images based on faster regions with convolutional neural network[C]//2018 IEEE 14th International Conference on Control and Automation (ICCA). [S.l.]: IEEE, 2018: 1082-1086.
[18]	WANG Y, WANG J, GAO F, et al. Detection and recognition for fault insulator based on deep learning[C]//2018 11th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI). Beijing: IEEE, 2018: 1-6.
[19]	LING Z, QIU R C, JIN Z, et al. An accurate and real-time self-blast glass insulator location method based on faster R-CNN and U-net with aerial images[EB/OL]. [2022-6-11]. https://arxiv.org/pdf/1801.05143.pdf.
[20]	WANG B, WU R, ZHENG Z, et al. Study on the method of transmission line foreign body detection based on deep learning[C]//2017 IEEE Conference on Energy Internet and Energy System Integration (EI2). [S.l.]: IEEE, 2017: 1-5.
[21]	TIAN B, LI D, WANG W, et al. Transmission line image defect diagnosis preprocessed parallel method based on deep learning[C]//2018 3rd International Conference on Mechanical, Control and Computer Engineering (ICMCCE). [S.l.]: IEEE, 2018: 299-303.
[22]	OHTA H, SATO Y, MORI T, et al. Image acquisition of power line transmission towers using uav and deep learning technique for insulators localization and recognition[C]//2019 23rd International Conference on System Theory, Control and Computing (ICSTCC). [S.l.]: IEEE, 2019: 125-130.
[23]	CHEN H, HE Z, SHI B, et al. Research on recognition method of electrical components based on YOLO V3[J]. IEEE Access, 2019, 7: 157818-157829. doi: 10.1109/ACCESS.2019.2950053
[24]	LIU S, WANG B, GAO K, et al. Object detection method for aerial inspection image based on region-based fully convolutional network[J]. Automation of Electric Power Systems, 2019, 43(13): 162-168.
[25]	李弘宸, 杨忠, 姜遇红, 等. 基于优化ResNet的输电线路航拍图像分类方法[J]. 应用科技, 2021, 48(2): 64-68. doi: 10.11991/yykj.202007009 LI H C, YANG Z, JIANG Y H, et al. Aerial image classification method of transmission lines based on optimized ResNet[J]. Applied Science and Technology, 2021, 48(2): 64-68. doi: 10.11991/yykj.202007009
[26]	IOFFE S, SZEGEDY C. Batch normalization: Accelerating deep network training by reducing internal covariate shift[C]//International Conference on Machine Learning. [S.l.]: PMLR, 2015: 448-456.
[27]	APICELLA A, DONNARUMMA F, ISGRO F, et al. A survey on modern trainable activation functions[J]. Neural Networks, 2021, 138: 14-32. doi: 10.1016/j.neunet.2021.01.026
[28]	BODLA N, SINGH B, CHELLAPPA R, et al. Soft-NMS--improving object detection with one line of code[C]//Proceedings of the IEEE International Conference on Computer Vision. [S. l.]: IEEE, 2017: 5561-5569.
[29]	DENG J, DONG W, SOCHER R, et al. Imagenet: A large-scale hierarchical image database[C]//2009 IEEE Conference on Computer Vision and Pattern Recognition. [S. l.]: IEEE, 2009: 248-255.
[30]	CAI Z, VASCONCELOS N. Cascade R-CNN: Delving into high quality object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. [S. l.]: IEEE, 2018: 6154-6162.
[31]	REDMON J, FARHADI A. Yolov3: An incremental improvement[EB/OL]. [2022-7-11]. https://arxiv.org/pdf/1804.02767.pdf.
[32]	BOCHKOVSKIY A, WANG C Y, LIAO H Y M. Yolov4: Optimal speed and accuracy of object detection[EB/OL]. [2022-7-15]. https://arxiv.org/abs/2004.10934

[1]	要媛媛, 付潇, 杨东瑛, 王洁宁, 郑文. 结合全局信息增强的医学领域命名实体识别研究 . 电子科技大学学报, 2024, 53(): 1-10. doi: 10.12178/1001-0548.2023064
[2]	刘志刚, 张国辉, 高月, 刘苗苗. 多因素引导的行人重识别数据增广方法研究 . 电子科技大学学报, 2024, 53(2): 235-242. doi: 10.12178/1001-0548.2023056
[3]	崔少国, 独潇, 张宜浩. 基于兴趣注意力网络的会话推荐算法 . 电子科技大学学报, 2024, 53(1): 67-75. doi: 10.12178/1001-0548.2022307
[4]	李阳, 李春璇, 徐灿飞, 方立梅. 基于残差注意力机制的肺结节数据增强方法 . 电子科技大学学报, 2023, 52(6): 880-886. doi: 10.12178/1001-0548.2022363
[5]	莫太平, 黄巧人, 陈德鸿, 伍锡如, 张向文. 改进可逆缩放网络的图像超分辨率重建 . 电子科技大学学报, 2023, 52(5): 739-746. doi: 10.12178/1001-0548.2022261
[6]	黄颖, 许剑, 周子祺, 陈树沛, 周帆, 曹晟. 高效长序列水位预测模型的研究与实现 . 电子科技大学学报, 2023, 52(4): 595-601. doi: 10.12178/1001-0548.2022133
[7]	齐鹏文, 李渊, 李岩, 罗龙, 赵云龙. 基于注意力模型的输电线路隐患检测 . 电子科技大学学报, 2023, 52(2): 240-246. doi: 10.12178/1001-0548.2022063
[8]	张凤荔, 王雪婷, 王瑞锦, 汤启友, 韩英军. 融合动态图表示和自注意力机制的级联预测模型 . 电子科技大学学报, 2022, 51(1): 83-90. doi: 10.12178/1001-0548.2021100
[9]	毛文清, 徐雅斌. 基于深度图卷积网络的社交机器人识别方法 . 电子科技大学学报, 2022, 51(4): 615-622, 629. doi: 10.12178/1001-0548.2021280
[10]	周丰丰, 牛甲昱. 细胞穿膜肽识别问题的多特征融合卷积网络预测算法 . 电子科技大学学报, 2022, 51(4): 494-499. doi: 10.12178/1001-0548.2021391
[11]	王磊, 孙志成, 王磊, 陈端兵, 蒋家玮. 基于DRSN-CW和LSTM的轴承故障诊断 . 电子科技大学学报, 2022, 51(6): 921-927. doi: 10.12178/1001-0548.2021385
[12]	葛唯益, 程思伟, 王羽, 徐建. 基于双向门控循环神经网络的事件论元抽取方法 . 电子科技大学学报, 2022, 51(1): 100-107. doi: 10.12178/1001-0548.2021153
[13]	代翔. 基于事件模式及类型的事件检测模型 . 电子科技大学学报, 2022, 51(4): 592-599. doi: 10.12178/1001-0548.2021377
[14]	郭磊, 王邱龙, 薛伟, 郭济. 基于改进YOLOv5的小目标检测算法 . 电子科技大学学报, 2022, 51(2): 251-258. doi: 10.12178/1001-0548.2021235
[15]	郭磊, 王邱龙, 薛伟, 郭济. 基于注意力机制的光线昏暗条件下口罩佩戴检测 . 电子科技大学学报, 2022, 51(1): 123-129. doi: 10.12178/1001-0548.2021222
[16]	赵磊, 高联丽, 宋井宽. 面向视觉对话的自适应视觉记忆网络 . 电子科技大学学报, 2021, 50(5): 749-753. doi: 10.12178/1001-0548.2021057
[17]	头旦才让, 仁青东主, 尼玛扎西, 于永斌, 邓权芯. 基于改进字节对编码的汉藏机器翻译研究 . 电子科技大学学报, 2021, 50(2): 249-255, 293. doi: 10.12178/1001-0548.2020218
[18]	李学明, 岳贡, 陈光伟. 基于多模态注意力机制的图像理解描述新方法 . 电子科技大学学报, 2020, 49(6): 867-874. doi: 10.12178/1001-0548.2019228
[19]	邓钰, 雷航, 李晓瑜, 林奕欧. 用于目标情感分类的多跳注意力深度模型 . 电子科技大学学报, 2019, 48(5): 759-766. doi: 10.3969/j.issn.1001-0548.2019.05.016
[20]	王淼, 刘伟东, 李一鹏, 武艺, 李源源, 张忠瑞, 陈明. 匹配输电线路巡检需求的无人机选型研究 . 电子科技大学学报, 2018, 47(1): 60-65. doi: 10.3969/j.issn.1001-0548.2018.01.009

点击查看大图

图(12) / 表(2)

计量

文章访问数: 4007
HTML全文浏览量: 1258
PDF下载量: 24
被引次数: 0

全文HTML

随着特高压的逐渐普及，电网规模越来越大^[1]。为了保障输电安全和质量，电力线路巡检越来越被世界各国重视。由于输电线架设高且多架设于崎岖地形，人工检查一方面需要花费大量的时间和精力，另一方面不可避免地存在安全风险。无人机（unmanned aerial vehicle, UAV）为输电线路提供了一种新的巡检方式，即从多个角度拍摄密集的输电线路巡检图像，建立事后人工查看照片的工作模式。“拍照和查看”的工作流程带来了巨大的工作量，因此开发一种智能缺陷检测方法来自动检查传输缺陷照片至关重要。

用于定位和分类目标物体的目标检测算法有望实现传输缺陷照片的自动检查。在深度学习发展前，目标检测依赖于基于图像处理的手工设计算法，如光流法、帧差法等。然而，传统方法普遍抗干扰、抗噪声能力较差，且较难适应光照变化。当前，基于深度学习的两阶段目标检测算法在精度上有着较好的表现。

由于架空线路上的鸟巢以及塑料袋、碎布等挂空悬浮物目标多数为小目标，其包含像素少，且容易被背景干扰，导致检测较为困难。此类小目标物体的检测有着长远的研究价值，一直广受关注。一些研究证明组合不同特征层可以提升小目标检测的效果。文献[2]利用特征金字塔融合机制构建图像金字塔，并将金字塔4个不同特征层学习到的特征与SSD （single shot multibox detector）框架的原始特征进行融合用于获取全局信息。此外，他们将上一层与本层的特征进行合并以获取到局部信息。文献[3]除了组合卷积层和反卷积层的特征图外，还在“编码器−解码器”架构中使用了反卷积层，进一步提高了检测效果。除了组合不同特征层，获取上下文信息也被证明可用于提高小目标检测的效果。其中，基于注意力机制的方法通过为提取的特征分配不同权重以实现更好的上下文关联效果。挤压和激励网络（squeeze-and-excitation networks, SENet）^[4]找到输入特征中每个通道的重新校准权重，并使用这些权重来提升重要通道。卷积块注意模块（convolutional block attention module, CBAM）^[5]使用通道注意和空间注意来校准特征并学习更好的表示，以便网络可以在下游任务上表现更好。汇集和激发模块（GE）^[6]在全局级别聚合特征，并使用汇集的信息增强局部特征的重要组成部分。

当前已有一些研究试图验证深度学习在输电线检测中的有效性。文献[7]提出了一种结合卷积神经网络和支持向量机的对绝缘子状态进行分类的方法。文献[8]介绍了深度隐含网络在绝缘子分类任务中的有效性。这两篇论文都展示了深度学习方法在传输线检测领域的潜力。一些研究人员尝试使用基于深度学习的目标检测方法来定位输电线路塔上的组件。文献[9]构建了一个具有两个前景类对象的数据集，并训练了一个六层的神经网络。文献[10-12]采用更快的基于感兴趣区域的卷积神经网络（faster R-CNN^[13]）算法。文献[14]提出了一种基于YOLO算法的绝缘体和减振器检测模型。这些研究仅应用基于深度学习的对象检测方法来定位塔上的组件，但未能检测到传输线照片中的缺陷。

在深度学习与输电线路的结合中，越来越多的研究集中在缺陷检测^[15-16]上。采用的方法主要分为两阶段方法（如Faster R-CNN^[17-18]）以及一阶段方法（如YOLO或SSD）。文献[15]采用基于区域候选的卷积神经网络模型，实现绝缘子的端到端智能检测，并可以检测出故障爆炸的坐标。文献[19]通过基于 Faster R-CNN以及U-net来检测破损绝缘子的位置。文献[20]采用3种方法对输电线路异物实际数据集进行测试验证，讨论了不同参数对识别结果的影响。文献[21]进行了预测架构的级联设计和并行目标检测结果的逻辑推理，该机制可以更快地提高算法在实际项目下的应用效果。文献[22]基于YOLO搭建检测系统，使用无人机自动检查电力传输塔和绝缘体。文献[23]引入了超分辨率卷积神经网络来增强绝缘子或阻尼器故障的训练数据。文献[24]利用了在线硬样挖掘和样本平衡等的优点。这些研究为输电线路航拍图像缺陷检测提供了初步的解决方案。

本文提出了一种基于注意力获取和Soft-NMS的传输线路中鸟巢与挂空悬浮物的检测算法。对比实验结果表明，提出的方法可以有效地提高输电线路鸟巢与挂空悬浮物的检测效果。本文的主要贡献为：

1）设计了一种基于注意力机制和ResNet的主干网络，允许提取的特征进行自适应权重调整，更充分地进行上下文信息的学习，有利于鸟巢和挂空悬浮物这类小型目标的检测；

2）将Faster R-CNN中的后处理程序换为Soft-NMS。用于减少同类别检测框重叠时的漏检情况，防止小目标的丢失。