多因素引导的行人重识别数据增广方法研究

刘志刚; 张国辉; 高月; 刘苗苗

doi:10.12178/1001-0548.2023056

多因素引导的行人重识别数据增广方法研究

doi: 10.12178/1001-0548.2023056

刘志刚^{1, 2, ,},
张国辉¹,
高月¹,
刘苗苗¹

1.
东北石油大学计算机与信息技术学院，大庆 163318
2.
黑龙江省石油大数据与智能分析重点实验室，大庆 163318

基金项目: 国家自然科学基金（51774090, 42002138）；黑龙江省自然科学基金（LH2020F003）；河北省自然科学基金（D2023107002）；黑龙江省属本科高校团队创新基金（2022TSTD-03）；黑龙江省高等教育教学改革项目（SJGY20210109）

详细信息

作者简介:
刘志刚，博士，教授，主要从事机器学习、计算机视觉方面的研究

通讯作者: 刘志刚，E-mail：dqpilzg@163.com

中图分类号: TP39

Research on Pedestrian Re-Identification Data Augmentation Method Based on Multi-Factor Guidance

1.
School of Computer and Information Technology, Northeast Petroleum University, Daqing 163318, China
2.
Heilongjiang Petroleum Big Data and Intelligent Analysis Key Laboratory, Daqing 163318, China

摘要: 为解决行人重识别研究领域中行人标注图像获取困难的问题，提出一种多因素引导的行人数据增广方法。首先，在生成器网络中设计了一种局部多尺度引导机制，通过特征融合抑制生成图像的局部伪影；其次，提出了长距离相关性引导机制，通过外注意力引导生成图像的长距离依赖，提高生成行人图像的整体视感质量；最后，提出一种抗博弈判别网络，通过嵌入到生成对抗网络，从而构建一种三网络稳定博弈架构模型，增加生成对抗网络训练的稳定性。通过VIPeR、Market-1501、DukeMTMC-reID这3种不同规模数据集的仿真实验，结果表明该方法与目前主流方法相比，mAP与Rank-1精度上均有不同程度的提升，在小规模数据集上的提升较为显著。
- 行人重识别 /
- 生成对抗网络 /
- 数据增广 /
- 局部多尺度 /
- 注意力机制
Abstract: To solve the difficulty in obtaining annotated pedestrian images in the field of pedestrian re-identification research, a novel data augmentation method guided by multi-factor is proposed in this paper. Firstly, a local multi-scale guidance mechanism is designed in the generator network. It can suppress the local artifacts in generated images through feature fusion. Secondly, a long-distance correlation guidance mechanism is proposed to improve the overall visual quality of the generated pedestrian image by guiding the long-distance dependence of the generated image with external attention. Lastly, an adversarial discrimination network is designed and embed into original generative adversarial networks. The three network stability architecture model increases the stability of generative adversarial network training. The experiment are validated on the VIPeR, Market-1501 and DukeMTMC-reID benchmark datasets. The results demonstrate our method outperforms the state-of-the-art with the mAP and rank-1 scores, especially in small-scale datasets.
- person re-identification /
- generative adversial network /
- data augmentation /
- local multi-scale /
- attention mechanism

图 1 MG-DAM模型结构图

下载: 全尺寸图片幻灯片

图 2 LMG与LCG引导机制

下载: 全尺寸图片幻灯片

图 3 模型实验效果对比图

下载: 全尺寸图片幻灯片

图 4 模型多样性样本匹配图

下载: 全尺寸图片幻灯片

表 1 与其他主流方法对比实验数据表 %

方法	VIPeR		Market-1501		DukeMTMC-reID
方法	Rank-1	mAP	Rank-1	mAP	Rank-1	mAP
DeformGAN^[29]	—	—	80.6	61.3	—	—
PTGAN^[30]	62.1	49.5	87.7	75.9	71.6	46.6
AD-Cluster^[16]	—	—	90.3	80.9	75.6	48.9
FFGAN^[17]	—	—	89.4	77.6	76.2	52.0
UnityGAN^[18]	65.3	48.6	91.3	78.3	74.3	49.7
PG2^[19]	64.3	45.2	89.4	75.7	72.2	50.6
VariGAN^[20]	66.2	47.7	89.3	78.8	72.2	51.3
DGNet^[31]	—	—	91.7	84.0	77.2	52.3
ClonedPerson^[32]	—	—	84.5	59.9	—	—
本文	67.8	51.6	91.4	83.2	78.6	52.7

下载: 导出CSV

表 2 局部多尺度特征引导机制LMG验证数据表 %

实验组号	方法	VIPeR		Market-1501		DukeMTMC-reID
实验组号	方法	Rank-1	mAP	Rank-1	mAP	Rank-1	mAP
第一组	GAN^[1]	41.6	—	57.6	49.5	53.6	34.2
	WGAN^[30]	45.5	30.3	68.5	49.9	61.3	37.2
	CycleGAN^[15]	48.3	31.2	70.6	52.1	62.1	38.1
	DualGAN^[33]	44.6	30.0	72.3	51.2	60.0	37.1
第二组	GAN + LMG	47.6	—	57.6	57.6	61.5	36.1
	WGAN + LMG	53.1	35.4	72.7	58.6	64.3	40.3
	CycleGAN + LMG	53.8	36.2	74.5	59.0	65.8	40.6
	DualGAN + LMG	50.2	34.8	71.3	58.6	62.7	40.0

下载: 导出CSV

表 3 长距离相关性引导机制LCG验证数据表 %

实验组号	方法	VIPeR		Market-1501		DukeMTMC-Re-ID
实验组号	方法	Rank-1	mAP	Rank-1	mAP	Rank-1	mAP
第一组	GAN^[1] + SA	44.6	—	—	—	59.3	36.3
	WGAN^[30]+ SA	49.6	31.6	78.6	63.7	63.1	39.0
	CycleGAN^[15] + SA	50.1	32.3	79.3	68.5	63.5	40.0
	DualGAN^[33] + SA	48.7	30.2	79.6	69.3	61.0	39.6
第二组	DAT^[25]	—	—	85.7	74.6	73.2	73.6
第二组	GLFA^[26]	—	—	86.6	75.8	74.2	74.6
第三组	GAN + LCG	50.3	—	72.6	59.4	62.7	38.3
	WGAN + LCG	56.2	38.2	80.6	70.8	66.3	41.7
	CycleGAN^[15] + LCG	56.6	39.5	83.9	73.9	67.3	43.8
	DualGAN + LCG	53.7	37.0	82.4	73.3	65.8	42.5

下载: 导出CSV

[1]	GOODFELLOW I, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial nets[J]. Advances in Neural Information Processing Systems, 2014(2): 2672-2680.
[2]	LI Y J, CHEN Y C, LIN Y Y, et al. , Recover and identify: A generative dual model for cross-resolution person re-identification[C]//Proceedings of the IEEE International Conference on Computer Vision (ICCV). Seoul: IEEE, 2019: 8089-8098.
[3]	CHEN Y B, ZHU X T, GONG S G. Instance-guided context rendering for cross-domain person re-identification[C]//Proceedings of the IEEE International Conference on Computer Vision (ICCV). Seoul: IEEE, 2019: 232-242.
[4]	LIANG W Q, WANG G C, LAI J H. Asymmetric cross-domain transfer learning of person re-identification based on the many-to-many generative adversarial network[J]. Acta Automatica Sinica, 2022, 48(1): 103-120.
[5]	WANG Z X, WANG Z, ZHENG Y Q, et al. Learning to reduce dual-level discrepancy for infrared-visible person re-identification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Long Beach: IEEE, 2019: 618-626.
[6]	ZHENG Z D, YANG X D, YU Z D, et al. Joint discriminative and generative learning for person re-identification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Long Beach: IEEE, 2019: 2133-2142.
[7]	ZHA Z J, LIU J W, CHEN D, et al. Adversarial attribute-text embedding for person search with natural language query[J]. IEEE Transactions on Multimedia, 2020, 22(7): 1836-1846. doi: 10.1109/TMM.2020.2972168
[8]	ZHENG Z D, ZHENG L, YANG Y. Unlabeled samples generated by GAN improve the person re-identification baseline in vitro[C]//Proceedings of the IEEE International Conference on Computer Vision (ICCV). Venice: IEEE, 2017: 3774-3782.
[9]	RADFORD A, METZ L, CHINTALA S, et al. Unsupervised representation learning with deep convolutional generative adversarial networks[EB/OL]. [2020-12-22]. https://arxiv.org/pdf/1511.06434.pdf.
[10]	WEI L H, ZHANG S L, GAO W, et al. Person transfer GAN to bridge domain gap for person re-identifification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Salt Lake City: IEEE, 2018: 79-88.
[11]	DENG W J, ZHENG L, YE Q X, et al. Image-image domain adaptation with preserved self-similarity and domain-dissimilarity for person re-identifification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Salt Lake City: IEEE, 2018: 994-1003.
[12]	HUANG Y, XU J, WU Q, et al. Multi-pseudo regularized label for generated data in person re-identification[J]. IEEE Transactions on Image Processing, 2018, 28(3): 1391-1403.
[13]	QIAN X L, FU Y W, XIANG T, et al. Pose-normalized image generation for person re-identifification[EB/OL]. [2021-06-22]. https://arxiv.org/pdf/1712.02225.pdf.
[14]	ZHONG Z, ZHENG L, ZHENG Z D, et al. Camera style adaptation for person re-identifification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Salt Lake City: IEEE, 2018: 5157-5166.
[15]	ZHU J Y, PARK T, ISOLA P, et al. Unpaired image-to-image translation using cycle-consistent adversarial networks[C]//Proceedings of the IEEE International Conference on Computer Vision (ICCV). Venice: IEEE, 2017: 2242-2251.
[16]	ZHAI Y P, LU S J, YE Q X, et al. AD-Cluster: Augmented discriminative clustering for domain adaptive person re-identification[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle: IEEE, 2020: 9018-9027.
[17]	ZHANG X W, LYU M Q, LI H. Cross-domain person re-identification based on partial semantic feature invariance[J]. Journal of Beijing University of Aeronautics and Astronautics, 2020, 46(9): 1682-1690.
[18]	LIU C, CHANG X J, SHEN Y D. Unity style transfer for person re-identification[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle: IEEE, 2020: 6886-6895.
[19]	MA L, JIA X, SUN Q, et al. Pose guided person image generation[C]//Proceedings of the Conference on Neural Information Processing Systems (NIPS). Long Beach: NIPS, 2017: 405-415.
[20]	ZHAO B, WU X, CHENG Z Q, et al. Multi-view image generation from a single-view[EB/OL]. [2022-07-15]. https://arxiv.org/pdf/1704.04886.pdf.
[21]	WANG Y, LIANG X, LIAO S. Cloning outfits from real-world images to 3D characters for generalizable person re-identification[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. New Orleans: IEEE, 2022: 4900-4909.
[22]	CHEN T, LIU Y, ZHANG Y, et al. Semi-supervised single-stage controllable GANs for conditional fine-grained image generation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Montreal: IEEE, 2021: 9244-9253.
[23]	CHONG M J, CHU W S, KUMAR A. Retrieve in style: Unsupervised facial feature transfer and retrieval[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Montreal: IEEE, 2021: 3867-3876.
[24]	卢涛, 陈冲, 许若波, 等. 基于边缘增强生成对抗网络的人脸超分辨率重建[J]. 华中科技大学学报:自然科学版, 2020, 48(1): 87-92. LU T, CHEN C, XU R B, et al. Face hallucination based on edge enhanced generative adversarial network[J]. Journal of Huazhong University of Science and Technology, 2020, 48(1): 87-92.
[25]	KWON G, YE J C. Diagonal attention and style-based gan for content-style disentanglement in image generation and translation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Montreal: IEEE, 2021: 13960-13969.
[26]	REN Y, YU X, CHEN J, et al. Deep image spatial transformation for person image generation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle: IEEE, 2020: 7687-7696.
[27]	GRAY D, HAI T. Viewpoint invariant pedestrian recognition with an ensemble of localized features[C]//Proceedings of European Conference on Computer Vision (ECCV). [S.l.]: IEEE, 2008: 3408-3416.
[28]	ZHENG L, SHEN L, LU T, et al. Scalable person re-identification: A benchmark[C]//Proceedings of the IEEE International Conference on Computer Vision (ICCV). Santiago: IEEE, 2015: 1116-1124.
[29]	ALIAKSANDR S, ENVER S, STEPHANE L, et al. Deformable GANs for pose-based human image generation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Salt Lake City: IEEE, 2018: 3408-3416.
[30]	GULRAJANI I, AHMED F, ARJOVSKY M, et al. Improved training of wasserstein gans[EB/OL]. [2022-09-15]. https://arxiv.org/pdf/1704.00028.pdf.
[31]	ZHENG Z, YANG X, YU Z, et al. Joint discriminative and generative learning for person re-identification[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Long Beach: IEEE, 2020: 2133-2142.
[32]	WEI L H, ZHANG S L, GAO W, et al. Person transfer GAN to bridge domain gap for person re-identification[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 79-88.
[33]	YI Z, ZHANG H, TAN P, et al. DualGAN: Unsupervised dual learning for image-to-image translation[J]. IEEE Computer Society, 2017: 2849-2857.

[1]	要媛媛, 付潇, 杨东瑛, 王洁宁, 郑文. 结合全局信息增强的医学领域命名实体识别研究 . 电子科技大学学报, 2024, 53(): 1-10. doi: 10.12178/1001-0548.2023064
[2]	王永, 王淞立, 邓江洲. 基于生成对抗网络的评分可信推荐模型 . 电子科技大学学报, 2024, 53(2): 1-8. doi: 10.12178/1001-0548.2023116
[3]	崔少国, 独潇, 张宜浩. 基于兴趣注意力网络的会话推荐算法 . 电子科技大学学报, 2024, 53(1): 67-75. doi: 10.12178/1001-0548.2022307
[4]	莫太平, 黄巧人, 陈德鸿, 伍锡如, 张向文. 改进可逆缩放网络的图像超分辨率重建 . 电子科技大学学报, 2023, 52(5): 739-746. doi: 10.12178/1001-0548.2022261
[5]	赵云龙, 田生祥, 李岩, 罗龙, 齐鹏文. 基于注意力模型和Soft-NMS的输电线路小目标检测方法 . 电子科技大学学报, 2023, 52(6): 906-914. doi: 10.12178/1001-0548.2022290
[6]	周丰丰, 孙燕杰, 范雨思. 基于miRNA组学的数据增强算法 . 电子科技大学学报, 2023, 52(2): 182-187. doi: 10.12178/1001-0548.2023002
[7]	陈丽, 许思扬, 刘芳, 冯奇, 刘承享, 徐福琛, 田淼, 刘光辉. 基于生成对抗网络的OFDM信号生成 . 电子科技大学学报, 2023, 52(6): 841-850. doi: 10.12178/1001-0548.2022253
[8]	代翔. 基于事件模式及类型的事件检测模型 . 电子科技大学学报, 2022, 51(4): 592-599. doi: 10.12178/1001-0548.2021377
[9]	吴子锐, 杨之蒙, 蒲晓蓉, 徐杰, 曹晟, 任亚洲. 面向特征生成的无监督域适应算法 . 电子科技大学学报, 2022, 51(4): 580-585, 607. doi: 10.12178/1001-0548.2021314
[10]	葛唯益, 程思伟, 王羽, 徐建. 基于双向门控循环神经网络的事件论元抽取方法 . 电子科技大学学报, 2022, 51(1): 100-107. doi: 10.12178/1001-0548.2021153
[11]	郭磊, 王邱龙, 薛伟, 郭济. 基于注意力机制的光线昏暗条件下口罩佩戴检测 . 电子科技大学学报, 2022, 51(1): 123-129. doi: 10.12178/1001-0548.2021222
[12]	毛文清, 徐雅斌. 基于深度图卷积网络的社交机器人识别方法 . 电子科技大学学报, 2022, 51(4): 615-622, 629. doi: 10.12178/1001-0548.2021280
[13]	周丰丰, 牛甲昱. 细胞穿膜肽识别问题的多特征融合卷积网络预测算法 . 电子科技大学学报, 2022, 51(4): 494-499. doi: 10.12178/1001-0548.2021391
[14]	头旦才让, 仁青东主, 尼玛扎西, 于永斌, 邓权芯. 基于改进字节对编码的汉藏机器翻译研究 . 电子科技大学学报, 2021, 50(2): 249-255, 293. doi: 10.12178/1001-0548.2020218
[15]	赵磊, 高联丽, 宋井宽. 面向视觉对话的自适应视觉记忆网络 . 电子科技大学学报, 2021, 50(5): 749-753. doi: 10.12178/1001-0548.2021057
[16]	李响, 严毅, 刘明辉, 刘明. 基于多条件对抗和梯度优化的生成对抗网络 . 电子科技大学学报, 2021, 50(5): 754-760. doi: 10.12178/1001-0548.2020415
[17]	李学明, 岳贡, 陈光伟. 基于多模态注意力机制的图像理解描述新方法 . 电子科技大学学报, 2020, 49(6): 867-874. doi: 10.12178/1001-0548.2019228
[18]	何磊, 李玉霞, 彭博, 吴焕萍. 基于生成对抗网络的无人机图像道路提取 . 电子科技大学学报, 2019, 48(4): 580-585. doi: 10.3969/j.issn.1001-0548.2019.04.016
[19]	陈俊周, 王娟, 龚勋. 基于级联生成对抗网络的人脸图像修复 . 电子科技大学学报, 2019, 48(6): 910-917. doi: 10.3969/j.issn.1001-0548.2019.06.016
[20]	邓钰, 雷航, 李晓瑜, 林奕欧. 用于目标情感分类的多跳注意力深度模型 . 电子科技大学学报, 2019, 48(5): 759-766. doi: 10.3969/j.issn.1001-0548.2019.05.016

点击查看大图

图(4) / 表(3)

计量

文章访问数: 2056
HTML全文浏览量: 498
PDF下载量: 10
被引次数: 0

全文HTML

行人重识别（Person Re-Identification, Re-ID）是在多个摄像头中进行特定目标行人图像的检索问题。近年来，得益于大规模标注数据集和卷积神经网络的拟合能力，Re-ID取得了较大进展。然而，由于行人数据涉及隐私、标注代价等因素，高质量的行人标注图像数据的获取极为困难，目前有监督学习的Re-ID性能很大程度上受制于此。

随着生成对抗网络（Generative Adversarial Networks, GAN）^[1]快速发展，基于GAN的行人数据增广^[2-7]方法引起了重视。文献[8]首次提出将GAN用于Re-ID的研究，通过对DCGAN^[9]生成行人图像，实现对标注数据集的扩充。文献[10-14]使用改进的CycleGAN^[15]进行域之间的行人风格迁移。文献[16]提出AD-Cluster模型，根据源域与目标域图像的语义一致性，增强跨域Re-ID模型的特征表达能力。文献[17]利用FFGAN实现基于増广判别聚类的数据迁移。文献[18]使用UnityGAN学习不同摄像机之间的背景风格差异，生成基于这些差异的平均风格图像，提升Re-ID模型的泛化能力。此外，基于GAN的行人数据增广方法可解决行人姿态与外观发生变化而引起的精度降低问题。文献[19]提出一种基于姿态引导的生成对抗网络（Pose Guided Person Generation Network, PG2），根据给定的行人图像和目标姿态合成任意姿态。文献[20]将变分推理和GAN相结合，提出一种生成人物衣服的模型VariGAN。文献[21]提出ClonedPerson方法，将真实世界的人物图像中的服装克隆到虚拟的三维人物中，进而实现数据增广。

这些基于GAN的数据增广方法，在一定程度上提升了Re-ID模型的性能、降低了手工标注样本的代价。但是，它们更多关注图像风格变换，而忽略生成图像质量给Re-ID模型带来的噪声影响，生成的行人图像有时会面临局部细粒度特征表达较弱、图像整体视感质量降低的问题。虽然此类图像可以在一定程度上提升模型的鲁棒性，但过多低质量图像会在Re-ID模型提取到的特征中融入过多噪声，干扰模型训练的稳定性。

本文经过研究分析，传统GAN生成图像质量较差的原因主要包括：1）受到卷积核尺寸的影响，生成行人图像的局部信息表达、长距离相关性依赖均受到较大限制，进而导致生成图像局部伪影严重、图像整体视感质量欠佳；2）网络训练方法难以保证稳定性，直接影响生成行人的图像质量。在此基础上，本文提出一种基于多因素引导行人图像增广方法（Multi-factor Guidance Data Augmentation Method, MG-DAM）。首先，设计了一种多尺度引导机制（Local Multi-scale Guidance, LMG），通过在生成器网络中使用局部多尺度引导机制，对行人图像进行特征提取与特征融合，从而抑制生成图像的局部伪影，增强生成图像的细粒度特征表达能力；其次，提出一种长距离相关性引导机制（Long-distance Correlation Guidance, LCG），突破卷积核映射关系的限制，增加生成行人图像的长距离依赖；为提升网络训练的稳定性，设计对抗博弈判别网络（Adversarial Discrimination Network, AD），进而更改网络整体训练方式。最后，通过仿真实验证明本文所提算法的有效性。

3. 结束语

本文提出了一种基于多因素引导的行人数据增广方法，通过局部多尺度引导机制LMG增强生成行人图像的局部特征表达能力，抑制生成图像局部伪影的产生；通过长距离相关性引导机制LCG突破了卷积核映射关系的限制，提升生成行人图像的整体视感质量；通过对抗博弈判别网络AD的使用，改变了GAN模型的对抗机制，提升了网络训练的稳定性。在数据集VIPeR、Market-1501和DukeMTMC-reID上的实验结果表明，本文所提方法可以提升Re-ID模型的性能，并且在一定程度上优于多数较为流行的GAN模型，为有效提升高质量的行人图像生成与数据增广提供了研究思路。

参考文献 (33)

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

多因素引导的行人重识别数据增广方法研究

doi: 10.12178/1001-0548.2023056

作者简介:
刘志刚，博士，教授，主要从事机器学习、计算机视觉方面的研究

通讯作者: 刘志刚，E-mail：dqpilzg@163.com

Research on Pedestrian Re-Identification Data Augmentation Method Based on Multi-Factor Guidance

计量

多因素引导的行人重识别数据增广方法研究

doi: 10.12178/1001-0548.2023056

1. 东北石油大学计算机与信息技术学院，大庆 163318

2. 黑龙江省石油大数据与智能分析重点实验室，大庆 163318

作者简介:
刘志刚，博士，教授，主要从事机器学习、计算机视觉方面的研究

通讯作者: 刘志刚，E-mail：dqpilzg@163.com

English Abstract

Research on Pedestrian Re-Identification Data Augmentation Method Based on Multi-Factor Guidance

1. School of Computer and Information Technology, Northeast Petroleum University, Daqing 163318, China

2. Heilongjiang Petroleum Big Data and Intelligent Analysis Key Laboratory, Daqing 163318, China

全文HTML

1.1. 局部多尺度引导机制LMG

1.2. 长距离相关性引导机制LCG

1.3. 对抗博弈判别网络AD

2.1. MG-DAM与其他主流方法的比较

2.2. LMG有效性验证

2.3. LCG的有效性验证

2.4. AD的有效性验证

目录

期刊在线

编辑办公

友情链接

留言板

多因素引导的行人重识别数据增广方法研究

doi: 10.12178/1001-0548.2023056

作者简介: 刘志刚，博士，教授，主要从事机器学习、计算机视觉方面的研究

通讯作者: 刘志刚，E-mail：dqpilzg@163.com

Research on Pedestrian Re-Identification Data Augmentation Method Based on Multi-Factor Guidance

计量

出版历程

多因素引导的行人重识别数据增广方法研究

doi: 10.12178/1001-0548.2023056

1. 东北石油大学 计算机与信息技术学院，大庆 163318 2. 黑龙江省石油大数据与智能分析重点实验室，大庆 163318

作者简介: 刘志刚，博士，教授，主要从事机器学习、计算机视觉方面的研究

通讯作者: 刘志刚，E-mail：dqpilzg@163.com

English Abstract

Research on Pedestrian Re-Identification Data Augmentation Method Based on Multi-Factor Guidance

1. School of Computer and Information Technology, Northeast Petroleum University, Daqing 163318, China 2. Heilongjiang Petroleum Big Data and Intelligent Analysis Key Laboratory, Daqing 163318, China

全文HTML

1.1. 局部多尺度引导机制LMG

1.2. 长距离相关性引导机制LCG

1.3. 对抗博弈判别网络AD

2.1. MG-DAM与其他主流方法的比较

2.2. LMG有效性验证

2.3. LCG的有效性验证

2.4. AD的有效性验证

目录

期刊在线

编辑办公

友情链接

作者简介:
刘志刚，博士，教授，主要从事机器学习、计算机视觉方面的研究

1. 东北石油大学计算机与信息技术学院，大庆 163318

2. 黑龙江省石油大数据与智能分析重点实验室，大庆 163318

作者简介:
刘志刚，博士，教授，主要从事机器学习、计算机视觉方面的研究

1. School of Computer and Information Technology, Northeast Petroleum University, Daqing 163318, China

2. Heilongjiang Petroleum Big Data and Intelligent Analysis Key Laboratory, Daqing 163318, China