基于改进YOLOv7的矿井人员检测算法

邵小强; 李鑫; 杨永德; 原泽文; 杨涛

doi:10.12178/1001-0548.2023163

基于改进YOLOv7的矿井人员检测算法

doi: 10.12178/1001-0548.2023163

邵小强^{1, 2},
李鑫^{1, 2, ,},
杨永德^{1, 2},
原泽文^{1, 2},
杨涛^{1, 2}

1.
西安科技大学电气与控制工程学院，西安 710054
2.
西安市电气设备状态检测与供电安全重点实验室，西安 710054

基金项目: 国家自然科学基金（52174198）

详细信息

作者简介:
邵小强，博士，副教授，主要从事深度学习、目标检测方面的研究

通讯作者: 通信作者Email：1187751601@qq.com

中图分类号: TD76

Mine personnel detection algorithm based on improved YOLOv7

SHAO Xiaoqiang^{1, 2},
LI Xin^{1, 2
, ,},
YANG Yongde^{1, 2},
YUAN Zewen^{1, 2},
YANG Tao^{1, 2}

1.
College of Electrical and Control Engineering, Xi’an University of Science and Technology, Xi’an 710054, China
2.
Xi’an Key Laboratory of Electrical Equipment Condition Monitoring and Power Supply Security. Xi’an 710054, China

摘要: 矿井人员的实时检测是建设智慧矿山必不可少的内容，通过视频监测井下人员，从而实现危险区域预警及联动控制，对于矿井安全生产具有重要意义。现阶段可见光图像识别技术针对井下昏暗环境中人员的辨识还有待完善。针对井下光照不均、煤尘干扰严重导致监控视频存在噪声多、图像模糊等问题，提出一种改进YOLOv7的矿井人员检测算法。该算法在YOLOv7的基础上进行改进，首先针对ELAN模块直接拼接形成通道隔离的问题，提出基于通道重组与特征关注的复杂场景检测方式：使用ShuffleNetV2作为主干网络，减少参数量，提升算法推理速度，促进通道间的信息流动；将Swin Transformer注意力机制引入shuffle_block中，提升图像中人员的关注度，抑制复杂环境对人员检测的干扰，同时Transformer优异的全局感受野有利于遮挡目标的检测；其次针对特征融合结果未侧重预期目标且模型缺乏针对性策略提升小目标检测性能，在颈部多尺度融合网络添加ACmix模块，兼顾全局特征和局部特征，提升了算法对小目标的检测能力；最后引入Efficient IOU Loss提升算法收敛速度的同时减小目标框及先验框高度和宽度的差值，实现更加精准的定位。通过公开行人数据集及自建矿井人员检测数据集验证表明：该算法较YOLOv7模型相比检测精度提升了3.1%，达到89.4%；召回率提升了3.8%，达到86.4%；速度提升了15.8%，达到68.8FPS；满足矿井人员实时检测的工作要求。
- 矿井人员检测 /
- YOLOv7 /
- Swin-Transformer /
- ACmix /
- Efficient IOU Loss
Abstract: Real-time detection of mine personnel is an essential part of the construction of intelligent mine. It is of great significance to realize early warning and linkage control of dangerous areas by monitoring underground personnel through video, which is of great significance for mine safety production. At present, the visible light image recognition technology needs to be improved for the identification of personnel in the dim environment of underground coal mine. Aiming at the problems of more noise and image blur in the monitoring video caused by uneven illumination and serious coal dust interference in the underground, this paper proposes an improved YOLOv7 mine personnel detection algorithm. Firstly, aiming at the problem of channel isolation caused by direct splicing of ELAN modules, a complex scene detection method based on channel reorganization and feature attention is proposed. The ShuffleNetV2 is used as the backbone network to reduce the number of parameters, improve the reasoning speed of the algorithm, and promote the information flow between channels. At the same time, the Swin Transformer attention mechanism is introduced into shuffle_block to improve the attention of people in the image and suppress the interference of complex environment on personnel detection. At the same time, the excellent global receptive field of Transformer is conducive to the detection of occlusion targets. Secondly, in view of the fact that the feature fusion results did not focus on the expected target and the model lacked targeted strategies to improve the detection performance of small targets, the ACmix module was added to the neck multi-scale fusion network to take into account both global features and local features, which improved the detection ability of the algorithm for small targets. Finally, Efficient IOU Loss is introduced to improve the convergence speed of the algorithm and reduce the difference between the height and width of the target frame and the prior frame to achieve more accurate positioning. Through the verification of public pedestrian data sets and self-built mine personnel detection data sets, it is shown that the detection accuracy of the proposed algorithm is 3.1% higher than that of the YOLOv7 model, reaching 89.4%. The recall rate increased by 3.8% to 86.4%. A 15.8% speedup of 68.8FPS Meet the mine personnel real-time detection work requirements.
- mine personnel detection /
- YOLOv7 /
- Swein-Transformer /
- ACmix /
- Efficient IOU Loss

图 1 STR_shuffleNet基本单元

下载: 全尺寸图片幻灯片

图 2 ACmix 结构

下载: 全尺寸图片幻灯片

图 3 本文目标检测算法框架

下载: 全尺寸图片幻灯片

图 4 准确率曲线

下载: 全尺寸图片幻灯片

图 5 召回率曲线

下载: 全尺寸图片幻灯片

图 6 损失值曲线

下载: 全尺寸图片幻灯片

图 7 主流算法检测结果

下载: 全尺寸图片幻灯片

表 1 改进ShuffleNetV2结构

层数	输出大小	核大小	步长	重复	通道数
Image	224$ \times $224				3
Conv1	112$ \times $112	3$ \times $3	2	1	24
DW conv	56$ \times $56	3$ \times $3	2	1	24
Stage2	28$ \times $28		2	1	116
Stage2	28$ \times $28		1	3	116
Stage3	14$ \times $14		2	1	232
Stage3	14$ \times $14		1	7	232
Stage4	7$ \times $7		2	1	464
Stage4	7$ \times $7		1	3	464
Conv5	7$ \times $7	1$ \times $1	1	1	1024
Global pooling	1$ \times $1	7$ \times $7

下载: 导出CSV

表 2 Swin-Transformer模型参数

	Output size	Swin-Transformer
Stage1	4$ \times $(56$ \times $56)	Concat 4$ \times $4, 96-d, LN
Stage1	4$ \times $(56$ \times $56)	$ \left[ {\begin{array}{*{20}{c}} {win.sz.}&{7 \times 7} \\ {\dim 96}&{head3} \end{array}} \right] \times 2 $
Stage2	8$ \times $(28$ \times $28)	Concat 2$ \times $2, 192-d, LN
Stage2	8$ \times $(28$ \times $28)	$\left[ {\begin{array}{*{20}{c}} {win.sz.}&{7 \times 7} \\ {\dim {\text{192}}}&{head{\text{6}}} \end{array}} \right] \times 2$
Stage3	16$ \times $(14$ \times $14)	Concat 2$ \times $2, 384-d, LN
Stage3	16$ \times $(14$ \times $14)	$\left[ {\begin{array}{*{20}{c}} {win.sz.}&{7 \times 7} \\ {\dim {\text{384}}}&{head{\text{12}}} \end{array}} \right] \times {\text{6}}$
Stage4	32$ \times $(7$ \times $7)	Concat 2$ \times $2, 768-d, LN
Stage4	32$ \times $(7$ \times $7)	$\left[ {\begin{array}{*{20}{c}} {win.sz.}&{7 \times 7} \\ {\dim {\text{768}}}&{head{\text{24}}} \end{array}} \right] \times {\text{2}}$

下载: 导出CSV

表 3 常见检测算法实验结果

Model	Parameter/M	FPS	$ M_{s} $	$ M_{r} $	$ M_{ \pm} $	$ M_{f} $	mAP@0.5
Faster CNN	68.2	41.6	0.842	0.809	0.101	0.133	0.833
SDD	82.1	32.9	0.850	0.855	0.214	0.125	0.864
YOLOv4	24.7	55.1	0.829	0.797	0.232	0.095	0.822
YOLOv5s	13.9	86.4	0.791	0.754	0.355	0.114	0.799
YOLOv5m	19.3	67.3	0.819	0.821	0.217	0.094	0.811
YOLOv5l	28.7	56.4	0.833	0.865	0.157	0.078	0.827
YOLOv7	26.3	59.4	0.857	0.832	0.179	0.097	0.849

下载: 导出CSV

表 4 消融结果

Model	ShuffleNetV2	Swin-Transformer	ACmix	EIOU	Precision	Recall	mAP	FPS
1					0.867	0.832	0.849	59.4
2	√				0.841	0.812	0.824	79.9
3	√	√			0.859	0.829	0.833	70.3
4	√		√		0.853	0.845	0.831	69.9
5	√			√	0.842	0.843	0.826	81.1
6	√	√	√		0.889	0.853	0.877	65.9
7	√	√	√	√	0.894	0.864	0.882	68.8

下载: 导出CSV

表 5 公开数据集实验结果

数据集	性能指标	YOLOv7					Our
数据集	性能指标	大	中	小	总	大	中	小	总
Caltech Pedestrian Detection	Precision	0.884	0.882	0.865	0.877	0.908	0.903	0.901	0.904
	Recall	0.849	0.846	0.837	0.844	0.890	0.870	0.874	0.878
	mAP	0.873	0.875	0.847	0.865	0.884	0.882	0.880	0.882
INRIA Person Dataset	Precision	0.814	0.810	0.788	0.804	0.856	0.853	0.850	0.853
	Recall	0.850	0.844	0.829	0.841	0.864	0.865	0.860	0.863
	mAP	0.803	0.813	0.775	0.797	0.852	0.852	0.846	0.850
自建矿井人员检测数据集	Precision	0.879	0.869	0.853	0.867	0.895	0.894	0.893	0.894
	Recall	0.831	0.839	0.826	0.832	0.869	0.860	0.863	0.864
	mAP	0.863	0.853	0.831	0.849	0.883	0.883	0.880	0.882

下载: 导出CSV

[1]	ZHANG K, KANG L, CHEN X, et al. A review of intelli gent unmanned mining current situation and development trend[J]. Energies, 2022, 15(2): 513. doi: 10.3390/en15020513
[2]	单鹏飞, 李晨炜, 来兴平等. 模拟暗湿工况下煤矸混合体态势热敏图像精准辨识实验[J/OL]. 煤炭学报: 1-12 3-04-14]. SHAN Pengfei, LI Chenwei, LAI Xingping, et al. Experiment on Accurate identification of thermal image of coal-gangue mixture under a simulated dusky and wet condition[J/OL]. Journal of china Coal Society, 1-12[2023-04-14].
[3]	HE K M, GKIOXARI G, DOLLÁR P, et al. Mask R-CNN[C]//Proceedings of the IEEE International Conference on Computer Vision. New York: IEEE, 2017: 2980-2988.
[4]	ZHU H G. An efficient lane line detection method based on computer vision[J]. Journal of Physics:Conference Series, 2021, 1802(3): 032006. doi: 10.1088/1742-6596/1802/3/032006
[5]	LI X Y, WANG S, LIU B, et al. Improved YOLOv4 network using infrared images for personnel detection in coal mines[J]. Journal of Electronic Imaging, 2022, 31(1): 013017.
[6]	DU Y X, TONG M M, ZHOU L L, et al. Edge detection based on Retinex theory and wavelet multiscale product for mine images[J]. Applied Optics, 2016, 55(34): 9625-9637. doi: 10.1364/AO.55.009625
[7]	寇发荣, 肖伟, 何海洋等. 基于改进YOLOv5的煤矿井下目标检测研究[J]. 电子与信息学报, 2023, 45(07): 2642-2649. KOU Farong, XIAO Wei, HE Haiyang, et al. Research on tarage detection in underground coal mines based on improved YOLOv5[J]. Journal of Electronics & Information Technology, 2023, 45(07): 2642-2649.
[8]	邵小强, 李鑫, 杨涛等. 改进YOLOv5s和DeepSORT的井下人员检测及跟踪算法[J]. 煤炭科学技术, 2023, 51(10): 291-301. SHAO Xiaoqiang, LI Xin, YANG Tao, et al. Underground personnel detection and tracking based on improved YOLOv5s and DeepSORT[J/OL]. Coal Science and Technology, 2023, 51(10): 291-301.
[9]	李江昀, 赵义凯, 薛卓尔, 等. 深度神经网络模型压缩综述[J]. 工程科学学报, 2019, 41(10): 1229-1239. LI J Y, ZHAO Y K, XUE Z E, et al. A survey of model compression for deep neural networks[J]. Chinese Journal of Engineering, 2019, 41(10): 1229-1239.
[10]	WANG C Y, BOCHKOVSKIY A, LIAO H Y M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[EB/OL]. [2023-06-01]. http://arxiv.org/abs/2207.02696.
[11]	MA N N, ZHANG X Y, ZHENG H T, et al. ShuffleNet V2: practical guidelines for efficient CNN architecture design[C]//European Conference on Computer Vision. Cham: Springer, 2018: 122-138.
[12]	LIU Z, LIN Y T, CAO Y, et al. Swin transformer: Hierarchical vision transformer using shifted windows[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. New York: IEEE, 2021: 9992-10002.
[13]	PAN X R, GE C J, LU R, et al. On the integration of self-attention and convolution[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE, 2022: 805-815.
[14]	CHEN C Y, LIU M Y, TUZEL O, et al. R-CNN for small object detection[C]//Asian Conference on Computer Vision. Cham: Springer, 2017: 214-230.
[15]	ZHANG Y F, REN W Q, ZHANG Z, et al. Focal and efficient IOU loss for accurate bounding box regression[J]. Neurocomputing, 2022, 506(C): 146-157.
[16]	REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE, 2016: 779-788.
[17]	REDMON J, FARHADI A. YOLO9000: Better, faster, stronger[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE, 2017: 6517-6525.
[18]	REDMON J, FARHADI A. YOLOv3: An incremental improvement[EB/OL]. [2018-04-08] https://arxiv.org/abs/1804.02767
[19]	BOCHKOVSKIY A, WANG C Y, LIAO H Y M. YOLOv4: Optimal speed and accuracy of object detection[EB/OL]. [2023-06-01]. http://arxiv.org/abs/2004.10934.
[20]	SONG Q S, LI S B, BAI Q, et al. Object detection method for grasping robot based on improved YOLOv5[J]. Micromachines, 2021, 12(11): 1273. doi: 10.3390/mi12111273
[21]	DING X H, ZHANG X Y, MA N N, et al. RepVGG: Making VGG-style ConvNets great again[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE, 2021: 13728-13737.
[22]	张旭, 周云成, 刘忠颖, 等. 基于改进ShuffleNet V2模型的苹果叶部病害识别及应用[J]. 沈阳农业大学学报, 2022, 53(1): 110-118. ZHANG X, ZHOU Y C, LIU Z Y, et al. Identification and application of apple leaf diseases based on improved ShuffleNet V2 model[J]. Journal of Shenyang Agricultural University, 2022, 53(1): 110-118.
[23]	SANDLER M, HOWARD A, ZHU M L, et al. MobileNetV2: Inverted residuals and linear bottlenecks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE, 2018: 4510-4520.
[24]	XIA Z F, PAN X R, SONG S J, et al. Vision transformer with deformable attention[EB/OL]. [2023-06-01]. http://arxiv.org/abs/2201.00520.
[25]	NIU Z Y, ZHONG G Q, YU H. A review on the attention mechanism of deep learning[J]. Neurocomputing, 2021, 452: 48-62. doi: 10.1016/j.neucom.2021.03.091
[26]	WANG C Y, YEH I H, LIAO H Y M. You only learn one representation: Unified network for multiple tasks[EB/OL]. [2023-06-01] . http://arxiv.org/abs/2105.04206.
[27]	RISHAV, SCHUSTER R, BATTRAWY R, et al. ResFPN: Residual skip connections in multi-resolution feature pyramid networks for accurate dense pixel matching[C]//Proceedings of the 25th International Conference on Pattern Recognition. New York: IEEE, 2021: 180-187.
[28]	ZHENG Z H, WANG P, REN D W, et al. Enhancing geometric factors in model learning and inference for object detection and instance segmentation[J]. IEEE Transactions on Cybernetics, 2022, 52(8): 8574-8586. doi: 10.1109/TCYB.2021.3095305
[29]	ZHANG Y F, REN W Q, ZHANG Z, et al. Focal and efficient IOU loss for accurate bounding box regression[J]. Neurocomputing, 2022, 506(C): 146-157.
[30]	REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[C]//Proceedings of the IEEE Transactions on Pattern Analysis and Machine Intelligence. New York: IEEE, 2017: 1137-1149.
[31]	LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot MultiBox detector[C]//European Conference on Computer Vision. Cham: Springer, 2016: 21-37.

[1]	张桃红, 郭学强, 郑瀚, 罗继昌, 王韬, 焦力群, 唐安莹. Dual-Path Vision Transformer用于急性缺血性脑卒中辅助诊断 . 电子科技大学学报, doi: 10.12178/1001-0548.2023081
[2]	邵延华, 张兴平, 张晓强, 楚红雨, 吴亚东. 联合结构重参数和YOLOv5的航拍红外目标检测 . 电子科技大学学报, doi: 10.12178/1001-0548.2022070
[3]	郭磊, 王邱龙, 薛伟, 郭济. 基于改进YOLOv5的小目标检测算法 . 电子科技大学学报, doi: 10.12178/1001-0548.2021235
[4]	杨政, 原子霞, 贾祖瑶. 基于迁徙数据估计武汉感染新型冠状病毒的人员数量 . 电子科技大学学报, doi: 10.12178/1001-0548.2020030
[5]	张涛, 谢红, 黄少滨. 多Agent交互策略模型检测方法 . 电子科技大学学报, doi: 10.3969/j.issn.1001-0548.2016.05.016
[6]	任金胜, 贾海涛. 基于改进目标检测能量项的联合语义分割和目标检测 . 电子科技大学学报, doi: 10.3969/j.issn.1001-0548.2014.05.020
[7]	吴丹, 危胜军. 基于模型检测的策略冲突检测方法 . 电子科技大学学报, doi: 10.3969/j.issn.1001-0548.2013.05.019
[8]	彭云峰, 何模雄, 隆克平. 入侵检测灰色空间模型及应用 . 电子科技大学学报, doi: 10.3969/j.issn.1001-0548.2012.03.025
[9]	李最雄, 杨涛, 汪万福, 谌文武. 西藏壁画空鼓病害的探地雷达检测 . 电子科技大学学报, doi: 10.3969/j.issn.1001-0548.2010.06.013
[10]	王洪, 刘昌忠, 汪学刚, 吴宏刚. S模式前导脉冲检测方法 . 电子科技大学学报, doi: 10.3969/j.issn.1001-0548.2010.04.002
[11]	杨宏宇, 朱丹, 谢丰, 谢丽霞. 入侵异常检测研究综述 . 电子科技大学学报, doi: 10.3969/j.issn.1001-0548.2009.05.014
[12]	刘才铭, 张雁, 赵辉, 彭凌西, 曾金全. 多级免疫检测器集在分布式入侵检测中的应用 . 电子科技大学学报,
[13]	彭静. 数字水印算法检测标准的研究 . 电子科技大学学报,
[14]	陆庆, 周世杰, 秦志光, 吴春江. 对等网络流量检测技术 . 电子科技大学学报,
[15]	程永新, 许家珆, 陈科. 一种新型入侵检测模型及其检测器生成算法 . 电子科技大学学报,
[16]	薛书文, 雷雨, 陈习权, 祖小涛. 脉冲红外热成像无损检测的物理检测机理 . 电子科技大学学报,
[17]	王田. SMSC负荷状态检测方法研究 . 电子科技大学学报,
[18]	曹嘉毅, 陈中柘. 热丝法检测装置研究 . 电子科技大学学报,
[19]	赵志钦, 王建国, 黄顺吉. SAR图像的边沿检测方法研究 . 电子科技大学学报,
[20]	尹治本. 一种技术人员结构的评价模型及算法研究 . 电子科技大学学报,

点击查看大图

图(7) / 表(5)

计量

文章访问数: 268
HTML全文浏览量: 82
PDF下载量: 3
被引次数: 0

全文HTML

目前全球煤矿开采正由传统机械化开采向智能化开采过渡，文献[1]提出智能化煤矿系统架构，认为矿井目标检测是煤矿智能化高速通信及信息获取的基础，且应以人员检测为主。由于矿井使用人工光源照明且井下煤尘干扰严重，导致监控图像存在光照不均、细节模糊等问题^[2]。工作人员无法长时间有效对视频进行多场景监控，井下作业人员的实际位置、工作情况等无法及时反馈到控制室，因此井下作业存在很大的危险性。

当前目标检测算法分为传统目标检测与深度神经网络^[3]两大类，传统目标检测需要手工设计特征，使用滑动窗口的方式搜索图像，最终采用分类器进行分类。此类算法存在手工设计特征鲁棒性差，存在窗口冗余等问题，导致传统检测方法逐渐被深度神经网络所取代^[4]。文献[5]使用YOLOv4针对矿井红外图像进行人员检测，通过迁移训练提升模型的泛化性，但需使用超分辨率卷积网络对红外图像进行预处理，导致模型整体参数增加，使得井下设备无法提供足够的计算量。文献[6]提出一种基于Retinex理论和小波多尺度的边缘检测算法，从低照度矿井图像中获取边缘图像，向矿用巡检机器人提供环境信息，该算法具有良好的实时性，但易受外界环境干扰，鲁棒性较差。文献[7]使用YOLOv5对矿井目标进行检测，采用轻量化主干网络加速模型的推理速度，使模型保持一定精度的同时达到实时检测标准，但是对于遮挡目标检测效果不佳。文献[8]使用YOLOv5和Deepsort算法实现井下人员检测及跟踪，引入跟踪模型增强了模型的抗遮挡能力，并且部署于嵌入式平台实现了矿井人员计数。但是该模型ID转换仍然存在，需要进一步的改进。上述方法专注于改进特征提取网络，适应井下图像特点，得到高精度的图像；或者为了满足参数轻量化，保证实时性，使用轻量化主干网络进行替换，使得模型存在一定的问题，无法保证模型检测精度与速率之间的均衡^[9]。

针对上述问题，本文基于YOLOv7^[10]，提出一种可用于矿井人员实时检测的模型，首先采用ShuffleNetV2^[11]作为模型的主干网络，加强CPU端的推理速度，同时在shuffle_block模块中引入Swin Transformer^[12]注意力机制，增强感受野，提升模型的全局感知能力，优化模型在遮挡情况下的检测效果；其次在颈部多尺度融合网络添加ACmix^[13]模块，通过模块内部的卷积通道和自注意力通道捕捉更多的特征，提升模型对小目标的敏感度^[14]；最后引入Efficient IOU Loss^[15]加速算法训练过程的收敛速度。

3. 结束语

本文提出了一种改进YOLOv7的矿井人员检测算法，在YOLOv7的基础上，使用ShuffleNetV2轻量化主干，同时在shuffle_block中引入Swin Transformer注意力机制，保持一定精度的同时降低了模型计算量；在多尺度融合阶段中层和低层引入ACmix模块帮助网络关注小目标特征，提升模型对小目标的检测能力；引入EIOU损失减小目标框和先验框高度与宽度的差值，实现更加精准的定位。

利用自建矿井人员检测数据集对本文算法进行验证，结果表明，本文算法准确率达89.4%。检测速率达68.8FPS，满足井下人员实时检测的要求，为矿井安全生产提供了良好的保障，对于煤矿开采向智能化开采过渡具有重要意义。

参考文献 (31)

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

基于改进YOLOv7的矿井人员检测算法

doi: 10.12178/1001-0548.2023163

作者简介:
邵小强，博士，副教授，主要从事深度学习、目标检测方面的研究

通讯作者: 通信作者Email：1187751601@qq.com

Mine personnel detection algorithm based on improved YOLOv7

计量

基于改进YOLOv7的矿井人员检测算法

doi: 10.12178/1001-0548.2023163

1. 西安科技大学电气与控制工程学院，西安 710054

2. 西安市电气设备状态检测与供电安全重点实验室，西安 710054

作者简介:
邵小强，博士，副教授，主要从事深度学习、目标检测方面的研究

通讯作者: 通信作者Email：1187751601@qq.com

English Abstract

Mine personnel detection algorithm based on improved YOLOv7

1. College of Electrical and Control Engineering, Xi’an University of Science and Technology, Xi’an 710054, China

2. Xi’an Key Laboratory of Electrical Equipment Condition Monitoring and Power Supply Security. Xi’an 710054, China

全文HTML

1.1. YOLOv7检测模型

1.2. 基于通道重组与特征关注的复杂场景检测

1.3. 基于自注意力与卷积混合模块的小目标检测

1.4. Efficient IOU Loss

2.1. 实验环境搭建

2.2. 数据集样本

2.3. 评价指标

2.4. 目标检测实验结果与分析

目录

期刊在线

编辑办公

友情链接

留言板

基于改进YOLOv7的矿井人员检测算法

doi: 10.12178/1001-0548.2023163

作者简介: 邵小强，博士，副教授，主要从事深度学习、目标检测方面的研究

通讯作者: 通信作者Email：1187751601@qq.com

Mine personnel detection algorithm based on improved YOLOv7

计量

出版历程

基于改进YOLOv7的矿井人员检测算法

doi: 10.12178/1001-0548.2023163

1. 西安科技大学 电气与控制工程学院，西安 710054 2. 西安市电气设备状态检测与供电安全重点实验室，西安 710054

作者简介: 邵小强，博士，副教授，主要从事深度学习、目标检测方面的研究

通讯作者: 通信作者Email：1187751601@qq.com

English Abstract

Mine personnel detection algorithm based on improved YOLOv7

1. College of Electrical and Control Engineering, Xi’an University of Science and Technology, Xi’an 710054, China 2. Xi’an Key Laboratory of Electrical Equipment Condition Monitoring and Power Supply Security. Xi’an 710054, China

全文HTML

1.1. YOLOv7检测模型

1.2. 基于通道重组与特征关注的复杂场景检测

1.3. 基于自注意力与卷积混合模块的小目标检测

1.4. Efficient IOU Loss

2.1. 实验环境搭建

2.2. 数据集样本

2.3. 评价指标

2.4. 目标检测实验结果与分析

目录

期刊在线

编辑办公

友情链接

作者简介:
邵小强，博士，副教授，主要从事深度学习、目标检测方面的研究

1. 西安科技大学电气与控制工程学院，西安 710054

2. 西安市电气设备状态检测与供电安全重点实验室，西安 710054

作者简介:
邵小强，博士，副教授，主要从事深度学习、目标检测方面的研究

1. College of Electrical and Control Engineering, Xi’an University of Science and Technology, Xi’an 710054, China

2. Xi’an Key Laboratory of Electrical Equipment Condition Monitoring and Power Supply Security. Xi’an 710054, China