Improved YOLOX SAR Near-Shore Area Ship Detection Method

LIU Lin; XIAO Jiarong; WANG Xiaobei; ZHANG Desheng; YU Zhongjun

doi:10.12178/1001-0548.2022039

Volume 52 Issue 1

Jan. 2023

Article Contents

Article Navigation > Journal of University of Electronic Science and Technology of China > 2023 > 52(1): 44-53

LIU Lin, XIAO Jiarong, WANG Xiaobei, ZHANG Desheng, YU Zhongjun. Improved YOLOX SAR Near-Shore Area Ship Detection Method[J]. Journal of University of Electronic Science and Technology of China, 2023, 52(1): 44-53. doi: 10.12178/1001-0548.2022039

Citation:

LIU Lin, XIAO Jiarong, WANG Xiaobei, ZHANG Desheng, YU Zhongjun. Improved YOLOX SAR Near-Shore Area Ship Detection Method[J]. Journal of University of Electronic Science and Technology of China, 2023, 52(1): 44-53. doi: 10.12178/1001-0548.2022039

Improved YOLOX SAR Near-Shore Area Ship Detection Method

doi: 10.12178/1001-0548.2022039

LIU Lin^{1, 2, 3},
XIAO Jiarong^{2, 3
,
,},
WANG Xiaobei²,
ZHANG Desheng²,
YU Zhongjun^{2, 3}

1.
School of Information and Communication Engineering, University of Electronic Science and Technology of China　Chengdu　611731
2.
Aerospace Information Research Institute, Chinese Academy of Sciences　Haidian Beijing　100094
3.
School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences　Huairou Beijing　100049

Received Date: 2022-01-27
Rev Recd Date: 2022-06-11

Available Online: 2023-01-13

Publish Date: 2023-01-25

Abstract

To solve the problem of low accuracy and high false alarm rate of synthetic aperture radar (SAR) nearshore area vessel detection, a new SAR nearshore area vessel detection method based on improved attention mechanism and rotating frame is proposed. Firstly, the feature extraction capability of the network was enhanced by improving the coordinate attention mechanism and introducing it into the feature extraction network. Secondly, the angle classification head was added and the two-dimensional Gaussian distribution was introduced to calculate the KL divergence between the prediction distribution and the target distribution, so as to evaluate the loss value of the rotating frame and complete the angle information extraction of the target. Then, based on the anchor frameless (AF) mechanism of YOLOX algorithm, the model can be made lightweight and the positioning accuracy can be further improved by reducing the redundancy of candidate frames. Finally, the model was tested on the open dataset Offical - SSDD, and the inference verification was performed on the embedded platform (NVIDIA Jetson AGX Xavier). The calculation parameter of the algorithm model is only 1.14M, and the average detection accuracy of the algorithm model is 18.77%, higher than that of the YOLOX model in the nearshore condition, and the overall detection accuracy reaches 94.2%. The verification results show that the algorithm is suitable for dense ship target detection in any direction in complex scenes and can meet the requirements of real-time processing.
- improved coordinate-attention,
- inshore region,
- rotation anchor,
- SAR,
- ship detection

References

[1]	孙显, 王智睿, 孙元睿, 等. AIR-SARShip-1.0: 高分辨率SAR舰船检测数据集[J]. 雷达学报, 2019, 8(6): 852-862. doi: 10.12000/JR19097 SUN X, WANG Z R, SUN Y R, et al. AIR-SARShip-1.0: High-Resolution SAR ship detection dataset[J]. Journal of Radars, 2019, 8(6): 852-862. doi: 10.12000/JR19097
[2]	REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. [S.l.]: IEEE, 2016: 779-788.
[3]	LIU W, ANGUELOV D, ERHAN D, et al. SSD: Single shot MultiBox detector[C]//European Conference on Computer Vision. Cham: Springer, 2016: 21-37.
[4]	徐英, 谷雨, 彭冬亮, 等. 面向合成孔径雷达图像任意方向舰船检测的改进YOLOv3模型[J]. 兵工学报, 2021, 42(8): 1698-1707. XU Y, GU Y, PENG D L, et al. Improved YOLOv3 model for ship detection of synthetic aperture radar images in any direction[J]Acta Armamentarii, 2021, 42(8): 1698-1707.
[5]	杨杰. 基于深度学习的SAR舰船目标检测方法研究[D]. 秦皇岛: 燕山大学, 2021. YANG J. Research on SAR ship target detection method based on deep learning[D]. Qinhuangdao: Yanshan University, 2021.
[6]	REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137-1149. doi: 10.1109/TPAMI.2016.2577031
[7]	周雪珂, 刘畅, 周滨. 多尺度特征融合与特征通道关系校准的SAR图像船舶检测[J]. 雷达学报, 2021, 10(4): 531-543. doi: 10.12000/JR21021 ZHOU X K, LIU C, ZHOU B. Multi-Scale feature fusion and feature channel relationship calibration for SAR image ship detection[J]. Journal of Radars, 2021, 10(4): 531-543. doi: 10.12000/JR21021
[8]	GE Z, LIU S T, WANG F, et al. YOLOX: Exceeding YOLO series in 2021[EB/OL]. (2021-08-06). https://arxiv.org/abs/2107.08430.
[9]	HU J, SHEN L, ALBANIE S, et al. Squeeze-and-Excitation Networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. [S.l.]: IEEE, 2018: 7132-7141.
[10]	WOO S, PARK J, LEE J Y, et al. CBAM: Convolutional Block Attention Module[C]//Proceedings of the European Conference on Computer Vision (ECCV). Munich: [s.n.], 2018: 3-19.
[11]	HOU Q B, ZHOU D Q, FENG J S. Coordinate attention for efficient mobile network design[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. [S.l.]: IEEE, 2021: 13713-13722.
[12]	WANG Q L, WU B G, ZHU P F, et al. ECA-Net: Efficient channel attention for deep convolutional neural networks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. [S.l.]: IEEE, 2020: 11531-11539.
[13]	PARK J, WOO S, LEE J Y, et al. BAM: Bottleneck attention module[EB/OL]. [2022-01-23]. https://arxiv.org/abs/1807.06514.
[14]	YANG X, YANG X J, YANG J R, et al. Learning high-precision bounding box for rotated object detection via kullback-leibler divergence[J]. Advances in Neural Information Processing Systems, 2021, 34: 18381-18394.
[15]	ZHANG T W, ZHANG X L, LI J, et al. Sar ship detection dataset (ssdd): Official release and comprehensive data analysis[J]. Remote Sensing, 2021, 13(18): 3690. doi: 10.3390/rs13183690
[16]	YANG X, YAN J C. Arbitrary-Oriented object detection with circular smooth label[C]//European Conference on Computer Vision. Cham: Springer, 2020: 677-694.
[17]	阮晨, 郭浩, 安居白. 复杂背景下SAR近岸舰船检测[J]. 中国图象图形学报, 2021, 26(5): 1058-1066. RUAN C, GUO H, AN J B. SAR nearshore ship detection under complex background[J]. Journal of Image and Graphics, 2021, 26(5): 1058-1066.
[18]	REDMON J, FARHADI A. YOLOv3: An incremental improvement[EB/OL]. [2022-01-20]. https://arxiv.org/abs/1804.02767.
[19]	ZHU X K, LYU S C, XU W, et al. TPH-YOLOv5: Improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. [S.l.]: IEEE, 2021: 2778-2788.

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(14) / Tables(2)

Get Citation

PDF

XML

Article Metrics

Article views(4449) PDF downloads(95) Cited by()

Proportional views

HTML

合成孔径雷达(synthetic aperture radar, SAR)具有全天时、全天候对地观测能力，针对广阔的海洋区域可以大范围地获取海面SAR图像，有效探测海面船只，对我国海洋事业起着至关重要的作用。但是SAR图像在视觉上是灰度图像，目标纹理特征不明显，导致基于SAR图像的目标信息提取困难。同时，由于SAR相干成像的特点，近岸区域中存在的各类金属建筑会产生与船只特征相近的强散射点，仅靠人工进行目标检测难度很高，效率较低，容易产生误检现象，且船只目标具有不定朝向、密集排列特点，易产生漏检现象，因此近岸区域下的SAR图像船只检测是一个具有挑战性的问题。

传统的SAR图像船只目标检测方法多采用恒虚警率法，其依赖于人工建模，通过设计海杂波模型提取船只特征，无法有效区分船只与岛屿、近岸建筑等虚警目标，因此在背景复杂的近岸情形下效果不佳^[1]。随着深度学习技术的不断发展，其在SAR图像目标检测方面的应用具有良好的发展前景，目前基于深度学习方法进行SAR船只检测大体分为两个技术路线，前者对以YOLO^[2]和SSD^[3]为代表的单阶段目标检测算法进行适应性改进，以实现实时高效船只检测。如文献[4]针对船只在近岸场景下方向多变问题，对YOLOv3算法进行改进，结合水平框和旋转框的预测结果实现船只目标方位角估计，提升了YOLOv3算法在复杂场景下的船只检测效果。文献[5]针对单阶段目标检测算法在复杂场景下船只检测性能不佳问题，在YOLOv3上引入了通道注意力机制，通过加强各网络层通道间的特征融合，提高模型在近岸场景下的检测性能。后者以Faster R-CNN^[6]为代表的双阶段目标检测算法进行适应性改进，以实现高精度船只目标检测，如文献[7]通过多尺度特征融合与特征通道校准方法，提高Faster-RCNN算法在复杂场景下的船只目标检测性能。

单阶段目标检测算法由于缺少了双阶段算法的精细处理，在面对SAR图像中尺度大小不一、视觉特征不明显的船只目标时表现不佳。另外，单阶段目标检测算法YOLOv3作为Anchor Based算法虽然在一定程度上缓解了双阶段目标检测算法，如在Faster R-CNN中选择性搜索带来的候选框计算量爆炸的问题，但其每个网格中不同尺寸锚框的生成依赖于人工参数设置，目标的定位精度严重依赖于手动调参。同时为兼顾目标检测实时处理平台资源功耗受限等情形，需要避免主流算法模型参数量大、网络推理耗时长的问题。

针对以上问题，本文提出了一种基于改进YOLOX的SAR近岸区域密集船只高精度检测方法，有效地解决SAR近岸情形下算法模型准确率低、虚警率高且参数冗余问题，主要创新点如下：

1) 改进坐标注意力机制，进一步提升对空间域信息的学习，并引入至特征提取网络，提升网络对船只目标信息的聚焦，实现目标信息的有效提取；

2) 采用旋转目标框进行目标检测，增加角度分类头学习目标的角度信息，并通过将旋转目标框转化为二维高斯分布的方法，计算预测分布与目标分布的KL散度作为旋转框损失值训练网络，提升网络在近岸区域下的检测精度；

3) 基于YOLOX的Anchor Free机制，结合上述两点设计，有效减少由于角度分类头导致的冗余候选框激增问题，避免冗余框带来的无效计算。

3. 结束语

本文提出了基于改进坐标注意力机制与旋转目标框的单阶段检测网络方法。在构建特征提取网络时加入改进坐标注意力机制准确提取感兴趣区域，同时选用旋转目标框获得船只方位角信息，提高检测精度。经offical-ssdd数据集实验验证，改进的YOLOX检测算法的检测精度在近岸情形下较YOLOX模型提升了18.77%，与主流YOLOv5检测算法相比，本文模型计算参数量减少了84.2%，推理耗时缩减至1/3，总体检测精度提升了4.1%，满足实时处理需求，验证了本方法的有效性。本文方法虽较基于水平框检测方法能得到更精准的目标框，但对SAR图像近岸场景中的大船只目标中心定位仍有待进一步提升。后续仍需进一步扩充数据集，优化网络结构，减小网络对近岸船只的定位偏移量。

Reference (19)

[1]	孙显, 王智睿, 孙元睿, 等. AIR-SARShip-1.0: 高分辨率SAR舰船检测数据集[J]. 雷达学报, 2019, 8(6): 852-862.	SUN X, WANG Z R, SUN Y R, et al. AIR-SARShip-1.0: High-Resolution SAR ship detection dataset[J]. Journal of Radars, 2019, 8(6): 852-862.
[2]	REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. [S.l.]: IEEE, 2016: 779-788.
[3]	LIU W, ANGUELOV D, ERHAN D, et al. SSD: Single shot MultiBox detector[C]//European Conference on Computer Vision. Cham: Springer, 2016: 21-37.
[4]	徐英, 谷雨, 彭冬亮, 等. 面向合成孔径雷达图像任意方向舰船检测的改进YOLOv3模型[J]. 兵工学报, 2021, 42(8): 1698-1707.	XU Y, GU Y, PENG D L, et al. Improved YOLOv3 model for ship detection of synthetic aperture radar images in any direction[J]Acta Armamentarii, 2021, 42(8): 1698-1707.
[5]	杨杰. 基于深度学习的SAR舰船目标检测方法研究[D]. 秦皇岛: 燕山大学, 2021.	YANG J. Research on SAR ship target detection method based on deep learning[D]. Qinhuangdao: Yanshan University, 2021.
[6]	REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137-1149.
[7]	周雪珂, 刘畅, 周滨. 多尺度特征融合与特征通道关系校准的SAR图像船舶检测[J]. 雷达学报, 2021, 10(4): 531-543.	ZHOU X K, LIU C, ZHOU B. Multi-Scale feature fusion and feature channel relationship calibration for SAR image ship detection[J]. Journal of Radars, 2021, 10(4): 531-543.
[8]	GE Z, LIU S T, WANG F, et al. YOLOX: Exceeding YOLO series in 2021[EB/OL]. (2021-08-06). https://arxiv.org/abs/2107.08430.
[9]	HU J, SHEN L, ALBANIE S, et al. Squeeze-and-Excitation Networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. [S.l.]: IEEE, 2018: 7132-7141.
[10]	WOO S, PARK J, LEE J Y, et al. CBAM: Convolutional Block Attention Module[C]//Proceedings of the European Conference on Computer Vision (ECCV). Munich: [s.n.], 2018: 3-19.
[11]	HOU Q B, ZHOU D Q, FENG J S. Coordinate attention for efficient mobile network design[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. [S.l.]: IEEE, 2021: 13713-13722.
[12]	WANG Q L, WU B G, ZHU P F, et al. ECA-Net: Efficient channel attention for deep convolutional neural networks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. [S.l.]: IEEE, 2020: 11531-11539.
[13]	PARK J, WOO S, LEE J Y, et al. BAM: Bottleneck attention module[EB/OL]. [2022-01-23]. https://arxiv.org/abs/1807.06514.
[14]	YANG X, YANG X J, YANG J R, et al. Learning high-precision bounding box for rotated object detection via kullback-leibler divergence[J]. Advances in Neural Information Processing Systems, 2021, 34: 18381-18394.
[15]	ZHANG T W, ZHANG X L, LI J, et al. Sar ship detection dataset (ssdd): Official release and comprehensive data analysis[J]. Remote Sensing, 2021, 13(18): 3690.
[16]	YANG X, YAN J C. Arbitrary-Oriented object detection with circular smooth label[C]//European Conference on Computer Vision. Cham: Springer, 2020: 677-694.
[17]	阮晨, 郭浩, 安居白. 复杂背景下SAR近岸舰船检测[J]. 中国图象图形学报, 2021, 26(5): 1058-1066.	RUAN C, GUO H, AN J B. SAR nearshore ship detection under complex background[J]. Journal of Image and Graphics, 2021, 26(5): 1058-1066.
[18]	REDMON J, FARHADI A. YOLOv3: An incremental improvement[EB/OL]. [2022-01-20]. https://arxiv.org/abs/1804.02767.
[19]	ZHU X K, LYU S C, XU W, et al. TPH-YOLOv5: Improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. [S.l.]: IEEE, 2021: 2778-2788.

算法	近岸区域			远海区域			总计
算法	P/%	R/%	Ap50/%	P/%	R/%	Ap50/%	Ap50/%
YOLOX	0.6686	0.9127	0.6593	0.9582	0.9813	0.9792	0.8788
CA_YOLOX	0.8023	0.9262	0.7984	0.9920	0.9893	0.9907	0.9306
BCA_YOLOX	0.8547	0.8033	0.8262	0.9813	0.9601	0.9790	0.9351
R_YOLOX	0.7442	0.7485	0.7333	0.9413	0.9439	0.9374	0.8728
本文算法	0.8488	0.9799	0.8470	0.9893	0.9867	0.9870	0.9426

算法	参数量×10⁶	Ap50/%	网络推理耗时/ms
YOLOX	0.91	0.879	8
YOLOV3	36.3	0.871	28
YOLOV5	7.2	0.901	30
Faster RCNN	272.7	0.890	119
本文算法	1.14	0.942	10

Improved YOLOX SAR Near-Shore Area Ship Detection Method

doi: 10.12178/1001-0548.2022039

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Related

Proportional views