基于纹理和颜色感知距离的对抗样本生成算法

徐明; 蒋奔驰

doi:10.12178/1001-0548.2021058

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名

邮箱

手机号码

标题

留言内容

验证码

基于纹理和颜色感知距离的对抗样本生成算法

杭州电子科技大学网络空间安全学院　杭州　310016

基金项目: 国家自然科学基金(61702150，61803135)

详细信息

作者简介:
徐明(1970-)，男，教授，博导，主要从事网络安全、数字取证及多媒体安全等方面的研究. E-mail：549614989@qq.com

中图分类号: TP182

摘要: 理想的对抗样本不仅要成功欺骗机器学习分类器，同时还应不易被人类视觉感知到差异。传统的算法仅采用L_p范数衡量对抗样本扰动的大小，往往导致视距差异与感官不匹配等问题。该文提出了一种基于纹理和颜色感知距离的对抗样本生成算法(Aho-λ)，其基本原理是尽可能地将扰动嵌入原始图像的高纹理区域，且基于颜色感知距离构建损失函数，从而降低原始图像和对抗样本之间的视距差异，最后利用自适应参数调节算法加快训练的收敛速度。在相近的L_p范数和可迁移性情形下，与DDN和C&W算法相比，该算法生成的对抗样本颜色感知距离更低，而且能以更少的迭代次数更快地生成对抗样本。

关键词:

Abstract: Ideal adversarial examples should not only successfully deceive the machine learning classifier, but also should not easily be perceived by human vision. In the traditional algorithms, only the norm is adopted as a measurement index of the perturbation size of adversarial examples, which usually leads to the difference in the visibility range. In this paper, a method for adversarial examples generation based on the texture and perceptual color distance is developed. The main idea is to embed the perturbation into a high texture area of an image and optimize the perceptual color distance, so as to reduce the difference in the visibility range between the original image and adversarial example. Moreover, an automatic hyperparameter optimization method is employed to accelerate the convergence of backpropagation. Experimental evaluation shows that the proposed algorithm can obtain the smallest L₂ norm and perceptual color distance than other algorithms. Meanwhile, a smaller number of iterations was required to obtain adversarial examples

Key words:

修改比率/%

成功率/%

扰动距离

${\overline {{L_2}} }$

${\overline {{L_\infty }} }$

${\overline {{C_2}} }$

78.3

0.51

25.90

123.31

80.4

0.48

24.73

120.76

85.3

0.52

20.23

102.65

92.3

0.69

19.66

75.78

95.1

0.74

19.48

68.42

96.2

0.82

18.65

90.75

100

0.79

16.78

56.54

100

1.03

17.03

67.52

100

1.11

17.25

82.48

100

1.30

16.89

85.60

方法

迭代次数

成功率/%

扰动距离

${\overline {{L_2}} }$

${\overline {{L_\infty }} }$

${\overline {{C_2}} }$

I-FGSM

–

100.0

2.51

1.59

317.96

C&W

1000

100.0

1.09

8.20

132.86

DDN

100

100.0

1.00

7.84

136.11

300

100.0

0.88

7.58

120.12

1000

100.0

0.82

7.62

111.65

PerC-AL

100

100.0

1.30

11.98

69.49

300

100.0

1.17

13.97

61.21

1000

100.0

1.13

17.04

57.10

Aho-λ

100

100.0

1.21

13.89

70.75

300

100.0

0.81

16.78

56.55

1000

100.0

0.81

16.98

56.52

方法

GoogLeNet

Vgg-16

ResNet-152

k=20

k=40

k=20

k=40

k=20

k=40

I-FGSM

3.4

5.3

6.5

11.9

7.5

9.9

C&W

1.8

2.8

3.9

5.9

4.5

5.1

DDN

1.0

2.0

4.5

6.7

4.3

5.1

PerC-C&W

2.2

3.9

4.3

8.1

5.5

6.5

PerC-AL

1.6

3.4

5.1

7.9

5.3

7.3

Aho-λ

1.2

2.3

4.0

7.0

4.5

5.6

基于纹理和颜色感知距离的对抗样本生成算法

杭州电子科技大学网络空间安全学院　杭州　310016

基金项目: 国家自然科学基金(61702150，61803135)

作者简介:
徐明(1970-)，男，教授，博导，主要从事网络安全、数字取证及多媒体安全等方面的研究. E-mail：549614989@qq.com

收稿日期: 2021-03-01

修回日期: 2021-05-06

网络出版日期: 2021-07-23

刊出日期: 2021-06-28

中图分类号: TP182

关键词:

全文HTML

近年来，深度学习在各个领域被广泛应用，其安全性备受关注，特别是对抗样本^[1]带来了诸多潜在威胁。对抗样本是通过对原始图像添加刻意构造的微小扰动后，使特定的深度学习分类器以高置信度产生一个错误的分类输出。理想的对抗样本不仅能够欺骗机器学习分类器，且其差异应不易被人类视觉感知。

在目前的对抗样本生成算法中，为了保证添加扰动后图像篡改痕迹的不可见性，通常研究人员采用比较公认的标准，即在RGB颜色空间内满足一定的L_p范数约束，用L_p范数衡量对抗样本中扰动的大小。如C&W^[2]、FGSM^[3]及变种(I-FGSM^[4]、RFGSM^[5])、Deepfool^[6]和JSMA^[7]。但范数距离与人类感官差异存在较大的偏差^[8]，采用范数约束优化生成的对抗样本不可避免地会在图像平滑区域出现肉眼可见的异常纹理。

此外在基于迭代优化的对抗样本生成算法中，如C&W^[2]、DDN^[9]等算法的损失函数是由多个损失函数累加，通常引入超参数来表示每个损失之间的加权系数。损失函数中的超参数在图像风格转移^[10]、图像超分辨率^[11]及GAN等网络模型中都会涉及，通常采用遍历或者随机搜索的方式反复尝试，最终才能确定合适的超参数。

为了解决对抗样本平滑区域易出现异常纹理和超参数确认困难的问题，本文提出了一种超参数自适应调节算法(Aho-λ)。该算法基于图像纹理和颜色感知距离，有效降低了对抗样本的视觉差异。训练过程中结合损失函数中超参数与攻击成功率和扰动距离之间的线性关系^{[2, 9]}，进行动态调节超参数，有效避免了超参数的反复尝试，降低对抗样本扰动的同时也减少了算法的迭代次数。

4. 结束语

本文结合纹理度筛选与颜色感知距离CIEDE2000作为图像损失函数，设计了一种能够自适应调节超参数的算法Aho-λ，生成的图像具有更小的颜色感知距离和更快的收敛。在JPEG和Big Depth压缩下具有良好的鲁棒性，且对抗样本在多种网络模型下具备一定的迁移能力。

使用CIEDE2000标准作为人类感知距离，在一定程度上降低了对抗样本在视觉上的可见度，但在图像平滑区域依然存在一定的可感知性，未来希望找到一种更符合人类感官的新标准引入训练损失中；同时也希望找到一种能够定量区分图像修改区域的方法，进一步完善纹理度筛选。

参考文献 (30)

[1]	SZEGEDY C, ZAREMBA W, SUTSKEVER I. Intriguing properties of neural networks[C]//The 2nd International Conference on Learning Representations. [S.l.]: 2014, 4: 3861-3864.
[2]	CARLINI N, WAGNER D. Towards evaluating the robustness of neural networks[C]//Symposium on Security and Privacy. San Jose: IEEE, 2017, 5: 39-57.
[3]	GOODFELLOW I, SHLENS J, SZEGEDY C. Explaining and harnessing adversarial examples[C]//The 3rd International Conference on Learning Representations. San Diego: [s.n.], 2015, 5: 1353-1362.
[4]	KURAKIN A, GOODFELLOW I, BENGIO S. Adversarial examples in the physical world[C]//The 5th International Conference on Learning Representations. Toulon: [s.n.], 2017, 4: 1238-1249.
[5]	TRAMER F, KURAKI A, PAPERNOT N, et al. Ensemble adversarial training: attacks and defenses[C]//The 6th International Conference on Learning Representations. Vancouver: IEEE, 2018, 5: 131-138.
[6]	MOOSAVI-DEZFOOLI S, FAWZI A, FROSSARD P. Deepfool: A simple and accurate method to fool deep neural networks[C]//Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016, 6: 2574-2582.
[7]	PAPERNOT N, MCDANIEL P, JHA S. The limitations of deep learning in adversarial settings[C]//European Symposium on Security and Privacy. [S.l.]: IEEE, 2016, 3: 372-387.
[8]	SHARIF M, BAUER L, REITER M. On the suitability of L_p-norms for creating and preventing adversarial examples[C]//2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops. [S.l.]: IEEE, 2018, 6: 1605-1613.
[9]	RONY J, HAFEMANN L, OLIVEIRA L, et al. Decoupling direction and norm for efficient gradient-based L₂ adversarial attacks and defenses[C]//Conference on Computer Vision and Pattern Recognition. Long Beach: IEEE, 2019, 6: 4322-4330.
[10]	GATYS L, ECKER A, BETHGE M. A neural algorithm of artistic style[EB/OL]. [2015-09-02]. https://arxiv.org/abs/1508.06576.
[11]	YOON Y, JEON H, YOO D, et al. Learning a deep convolutional network for light-field image super-resolution[C]//International Conference on Computer Vision Workshop. [S.l.]: IEEE, 2015, 12: 57-65.
[12]	SU Jia-wei, VARGAS D, SAKURAI K. One pixel attack for fooling deep neural networks[J]. Trans Evol Comput, 2019, 23(5): 828-841. doi: 10.1109/TEVC.2019.2890858
[13]	WANG Z, BOVIK A, SHEIKH H, et al. Image quality assessment: from error visibility to structural similarity[J]. Trans Image Process, 2004, 13(4): 600-612. doi: 10.1109/TIP.2003.819861
[14]	ROZSA A, RUDD E, BOULT T. Adversarial diversity and hard positive generation[C]//Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016, 6: 410-417.
[15]	KANBAK C, MOOSAVI-DEZFOOLI S, FROSSARD P. Geometric robustness of deep networks: Analysis and improvement[C]//Conference on Computer Vision and Pattern Recognition. San Diego: IEEE, 2018, 6: 4441-4449.
[16]	ENGSTROM L, TSIPRAS D, SCHMIDT L, et al. Exploring the landscape of spatial robustness[EB/OL]. [2019-09-16]. https://arxiv.org/abs/1712.02779.
[17]	HENG Wen, ZHOU Shu-chang, JIANG Ting-ting. Harmonic adversarial attack method[EB/OL]. [2018-08-08]. https://arxiv.org/abs/1807.10590.
[18]	LI Yi-jun, LIU Ming-yu, YANG Ming-hsuan, et al. A closed-form solution to photorealistic image stylization[J]. Springer Journal of Computer Vision, 2018, 5: 468-483.
[19]	PUY G, PEREZ P. A flexible convolutional solver with application to photorealistic style transfer[EB/OL]. [2018-06-13]. https://arxiv.org/abs/1806.05285.
[20]	ZHANG Han-wei, AVRITHIS Y, FURON T, et al. Smooth adversarial examples[J]. EURASIP Journal on Information Security, 2020(1): 15-24. doi: 10.1186/s13635-020-00112-z
[21]	BHATTAD A, CHONG Min-jin, LIANG Kai-zhao. Unrestricted adversarial examples via semantic manipulation[EB/OL]. [2019-03-20]. https://arxiv.org/abs/1904.06347.
[22]	LUO Bo, LIU Yan-nan, WEI Ling-xiao, et al. Towards imperceptible and robust adversarial example attacks against neural networks[C]//The 30th Innovative Applications of Artificial Intelligence. Louisiana: IEEE, 2018, 2: 1652-1659.
[23]	CROCE F, HEIN M. Sparse and imperceivable adversarial attacks[C]//International Conference on Computer Vision. Seoul: IEEE, 2019, 10: 4728-4731.
[24]	KURAKIN A, GOODFELLOW I, BENGIO S, et al. Adversarial attacks and defences competition[EB/OL]. (2018-03-31). https://arxiv.org/abs/1804.00097.
[25]	ZHAO Zheng-yu, LIU Zhuo-ran, LARSON M. Towards large yet imperceptible adversarial image perturbations with perceptual color distance[C]//Conference on Computer Vision and Pattern Recognition. Seattle: IEEE, 2020, 6: 1036-1045.
[26]	RONNIER L, CUI Gui-hua, BRYAN R. The development of the CIE 2000 colour-difference formula: CIEDE2000[J]. Color Research, 2001, 26(5): 340-350. doi: 10.1002/col.1049
[27]	SZEGEDY C, VANHOUCKE V, IOFFE S. Rethinking the inception architecture for computer vision[C]//Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016, 6: 2818-2826.
[28]	GUO C, RANA M, CISSE M, et al. Countering adversarial images using input transformations[EB/OL]. [2018-01-25]. https://arxiv.org/abs/1711.00117.
[29]	SIMONYAN K, ZISSERMAN A. Very deep convolutional networks for large-scale image recognition[C]//The 3rd International Conference on Learning Representations. San Diego: IEEE, 2016, 6: 2818-2826.
[30]	HE Kai-ming, ZHANG Xiang-yu, SUN Jian. Deep residual learning for image recognition[C]//Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016, 6: 770-778.

[1]	吴子锐, 杨之蒙, 蒲晓蓉, 徐杰, 曹晟, 任亚洲. 面向特征生成的无监督域适应算法 . 电子科技大学学报, 2022, 51(4): 580-585, 607. doi: 10.12178/1001-0548.2021314
[2]	刘欣刚, 江浩杨, 苏鑫, 冯晶. 基于决策边界搜索的对抗样本生成算法 . 电子科技大学学报, 2022, 51(5): 721-727. doi: 10.12178/1001-0548.2021396
[3]	李明, 何子述. 基于输出信杂噪比的机载雷达训练样本选择算法 . 电子科技大学学报, 2021, 50(5): 676-681. doi: 10.12178/1001-0548.2021041
[4]	何坤, 郑秀清, 谢沁岑, 王丹. 基于水平集的自适应保边平滑分割 . 电子科技大学学报, 2017, 46(4): 579-584. doi: 10.3969/j.issn.1001-0548.2017.04.017
[5]	马义德, 袁金霞, 张红娟. 自适应彩色图像SCM去噪 . 电子科技大学学报, 2012, 41(5): 751-758.
[6]	郭志勇, 李广军, 李强. 用于提高ADC性能的自适应Dither结构 . 电子科技大学学报, 2011, 40(3): 375-378. doi: 10.3969/j.issn.1001-0548.2011.03.009
[7]	陈晓梅, 孟晓风, 王国华. 自适应阈值估计的MS模型仿真建模 . 电子科技大学学报, 2011, 40(1): 58-63. doi: 10.3969/j.issn.1001-0548.2011.01.011
[8]	朱斌, 曾孝平, 曾凡鑫, 吴华. PN码自适应门限捕获新方法 . 电子科技大学学报, 2010, 39(4): 490-494. doi: 10.3969/j.issn.1001-0548.2010.04.003
[9]	李国军, 曾孝平, 周晓娜, 曾理, 蒋勇. 微弱高频CW信号的自适应滤波 . 电子科技大学学报, 2010, 39(2): 227-231,250. doi: 10.3969/j.issn.1001-0548.2010.02.016
[10]	孔令讲, 罗美方. 改进的对角加载自适应脉冲压缩算法 . 电子科技大学学报, 2010, 39(6): 854-858. doi: 10.3969/j.issn.1001-0548.2010.06.011
[11]	王朝翔, 丁炜. 自适应的Ad hoc MAC协议 . 电子科技大学学报, 2009, 38(1): 47-50.
[12]	熊超, 王建英. 弱信号环境下的盲自适应波束形成 . 电子科技大学学报, 2007, 36(2): 182-185.
[13]	周泽坤, 王锐, 张波. 一种自适应斜坡补偿电路 . 电子科技大学学报, 2007, 36(1): 47-49.
[14]	孙艳争, 黄炜, 余波. 基于EMD的非线性信号自适应分析 . 电子科技大学学报, 2007, 36(1): 24-26.
[15]	佘堃, 蒲红梅, 郑方伟, 周明天. 自适应多目独立成分分析 . 电子科技大学学报, 2007, 36(1): 11-13.
[16]	甘刚, 陈运, 李飞. 网络对抗训练模拟系统的设计与实现 . 电子科技大学学报, 2007, 36(3): 604-607.
[17]	邓琼, 刘晓明, 邓斌. 基于MEMS的自适应钻地引信系统设计 . 电子科技大学学报, 2006, 35(6): 932-935.
[18]	陈玉, 王晟, 李乐民. Ad hoc中基于主动网的自适应FEC . 电子科技大学学报, 2006, 35(5): 733-735,773.
[19]	程红霞, 张玉兴, 吴援明. 改进的OBS长度自适应门限组装算法 . 电子科技大学学报, 2006, 35(3): 302-304,342.
[20]	崔奉云, 王军. 基于电容近感原理的无源探测信号分析 . 电子科技大学学报, 2002, 31(6): 562-565.

留言板