Unsupervised Feature-Level Domain Adaptation with Generative Adversarial Networks

WU Zirui; YANG Zhimeng; PU Xiaorong; XU Jie; CAO Sheng; REN Yazhou

doi:10.12178/1001-0548.2021314

Jul. 2022

Article Contents

Article Navigation > Journal of University of Electronic Science and Technology of China > 2022 > 51(4): 580-585, 607

WU Zirui, YANG Zhimeng, PU Xiaorong, XU Jie, CAO Sheng, REN Yazhou. Unsupervised Feature-Level Domain Adaptation with Generative Adversarial Networks[J]. Journal of University of Electronic Science and Technology of China, 2022, 51(4): 580-585, 607. doi: 10.12178/1001-0548.2021314

Citation:

WU Zirui, YANG Zhimeng, PU Xiaorong, XU Jie, CAO Sheng, REN Yazhou. Unsupervised Feature-Level Domain Adaptation with Generative Adversarial Networks[J]. Journal of University of Electronic Science and Technology of China, 2022, 51(4): 580-585, 607. doi: 10.12178/1001-0548.2021314

Unsupervised Feature-Level Domain Adaptation with Generative Adversarial Networks

doi: 10.12178/1001-0548.2021314

1.
School of Computer Science and Engineering, University of Electronic Science and Technology of China　Chengdu　611731
2.
Institute of Electronic and Information Engineering, University of Electronic Science and Technology of China　Dongguan Guangdong　523808

Received Date: 2021-10-26
Rev Recd Date: 2021-12-25

Available Online: 2022-07-11

Publish Date: 2022-07-09

Abstract

For the classification problem of unlabeled high-dimensional images, the commonly used deep neutral networks have difficulty in producing good classification results in the unlabeled datasets. This paper proposes an unsupervised feature-level domain adaptation with generative adversarial networks (Feature-GAN), which learns the feature level transformation from one domain to another in unsupervised manner. It maps the source domain image features to the target domain image features and keeps the label information, and these generated labeled features can be used to train a classifier adapted to the target domain features. This model avoids the generation process of the image itself in the complex image domain adaptation problem and focuses on feature generation. The model is easy to train and has high stability. Experiments show that the proposed method can be widely applied to complex image classification scenarios, and it outperforms traditional sample generation-based unsupervised domain adaptation algorithms in terms of accuracy, convergence speed, and stability.
- GAN,
- image classification,
- transfer learning,
- unsupervised domain adaptation

References

[1]	XU J, REN Y, LI G, et al. Deep embedded multi-view clustering with collaborative training[J]. Information Sciences, 2021, 573: 279-290. doi: 10.1016/j.ins.2020.12.073
[2]	REN Y Z, WANG N, LI M X, et al. Deep density-based image clustering[J]. Knowledge Based Systems, 2000, 197: 105841.
[3]	曹占涛, 杨国武, 陈琴, 等. 基于修正标签分布的乳腺超声图像分类[J]. 电子科技大学学报, 2020, 49(4): 597-602. doi: 10.12178/1001-0548.2020001 CAO Z T, YANG G W, CHEN Q, et al. Breast ultrasound images classification based on correction label distributiona[J]. Journal of University of Electronic Science and Technology of China, 2020, 49(4): 597-602. doi: 10.12178/1001-0548.2020001
[4]	储岳中, 汪佳庆, 张学锋, 等. 基于改进深度残差网络的图像分类算法[J]. 电子科技大学学报, 2021, 50(2): 243-248. doi: 10.12178/1001-0548.2020314 CHU Y Z, WANG J Q, ZHANG X F, et al. Image classification algorithm based on improved deep residual network[J]. Journal of University of Electronic Science and Technology of China, 2021, 50(2): 243-248. doi: 10.12178/1001-0548.2020314
[5]	JIA S, JIANG S, LI Z, et al. A survey: Deep learning for hyperspectral image classification with few labeled samples[J]. Neurocomputing, 2021, 448: 179-204. doi: 10.1016/j.neucom.2021.03.035
[6]	KRIZHEVSKY A, SUTSKEVER I, HINTON G E. Imagenet classification with deep convolutional neural networks[J]. Advances in Neural Information Processing Systems, 2012, 25: 1097-1105.
[7]	SZEGEDY C, VANHOUCKE V, IOFFE S, et al. Rethinking the inception architecture for computer vision[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition. Las Vegas, NV: IEEE, 2016: 2818-2826.
[8]	CEN F, ZHAO X, LI W, et al. Deep feature augmentation for occluded image classification[J]. Pattern Recognition, 2021, 111: 107737. doi: 10.1016/j.patcog.2020.107737
[9]	ZHAN X, XIE J, LIU Z, et al. Online deep clustering for unsupervised representation learning[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2020: 6688-6697.
[10]	MA L, SHUAI R, RAN X, et al. Combining DC-GAN with ResNet blood cell image classification[J]. Medical Biological Engineering & Aomputing, 2020, 58(6): 1251-1264.
[11]	RADFORD A, METZ L, CHINTALA S. Unsupervised representation learning with deep convolutional generative adversarial networks[EB/OL]. [2021-08-22]. https://arxiv.org/abs/1511.06434.
[12]	HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2016: 770-778.
[13]	高淑萍, 赵清源, 齐小刚, 等. 改进MobileNet的图像分类方法研究[J]. 智能系统学报, 2021, 16(1): 11-20. GAO S P, ZHAO Q Y, QI X G, et al. Research on the improved image classification method of MobileNet[J]. CAAI Transactions on Intelligent Systems, 2021, 16(1): 11-20.
[14]	HOWARD A G, ZHU M, CHEN B, et al. Mobilenets: Efficient convolutional neural networks for mobile vision applications[EB/OL]. [2021-08-10]. https://arxiv.org/abs/1704.04861.
[15]	GANIN Y, LEMPITSKY V. Unsupervised domain adaptation by backpropagation[C]//International Conference on Machine Learning. Cambridge, MA: JMLR, 2015: 1180-1189.
[16]	GRETTON A, BORGWARDT K M, RASCH M J, et al. A kernel two-sample test[J]. Journal of Machine Learning Research, 2012, 13(1): 723-773.
[17]	BOUSMALIS K, SILBERMAN N, DOHAN D, et al. Unsupervised pixel-level domain adaptation with generative adversarial networks[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2017: 3722-3731.
[18]	LIU M Y, TUZEL O. Coupled generative adversarial networks[EB/OL]. [2021-07-13]. https://arxiv.org/abs/1606.07536.
[19]	GOODFELLOW I, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial networks[J]. Neural Information Processing Systems, 2014, 3: 2672-2680.
[20]	ARJOVSKY M, CHINTALA S, BOTTOU L. Wasserstein GAN[EB/OL]. [2021-07-22]. https://arxiv.org/abs/1701.07875v3.
[21]	SAENKO K, KULIS B, FRITZ M, et al. Adapting visual category models to new domains[C]//European Conference on Computer Vision. Berlin: Springer, 2010: 213-226
[22]	VENKATESWARA H, EUSEBIO J, CHAKRABORTY S, et al. Deep hashing network for unsupervised domain adaptation[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2017: 5018-5027.
[23]	SAITO K, WATANABE K, USHIKU Y, et al. Maximum classifier discrepancy for unsupervised domain adaptation[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2018: 3723-3732.
[24]	WANG Q, BRECKON T. Unsupervised domain adaptation via structured prediction based selective pseudo-labeling[C]//AAAI Conference on Artificial Intelligence. Palo Alto, CA: AAAI, 2020: 6243-6250.
[25]	JAEMIN N, HEECHUL J, HYUNG J C, et al. FixBi: Bridging domain spaces for unsupervised domain adaptation[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2021: 1094-1103.
[26]	LAURENS M, GE-OFFREY H. Visualizing data using t-sne[J]. Journal of Machine Learning Research, 2008, 9: 2579-2605.

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(5) / Tables(3)

Get Citation

PDF

XML

Article Metrics

Article views(4243) PDF downloads(66) Cited by()

Proportional views

HTML

在图像的各种任务中，图像聚类^[1-2]将相似图像聚在一个类，将不相似图像分开。而图像分类^[3]则需判断图像属于某个类别，在图像分析任务中扮演着重要角色。近年来，随着智能手机等可手持摄像设备的普及，图像数据大幅提高。在图像的分类方法中，深度神经网络扮演着重要的角色^[4-5]。但随着图像的分辨率及内容复杂度急剧增加，好的神经网络往往需要复杂的结构以及大量带标签的图像才能训练。

针对有效神经网络结构复杂等问题，提出了大量优秀的图像分类网络，如AlexNet^[6]、InceptionV3^[7]等。现有的分类任务均使用这些成熟的分类网络初始化自身网络^[8-9]。一般情况下，在使用已有网络参数进行模型初始化后，会添加针对于任务本身的网络结构并依据任务目标进行参数优化，如文献[10]结合了无监督深度卷积生成对抗网络表示学习算法(unsupervised representation learning with deep convolutional generative adversarial networks, DC-GAN)^[11]和Resnet^[12]完成图像分类；文献[13]则对MobileNet^[14]进行改进，将标准卷积形式替换为深度可分离卷积进行图像分类。对于有监督的数据，标签信息可以指导参数的准确优化，而在一些真实场景中，存在标签不足、准确标记困难等问题，使获得具有良好标记训练集的代价很大。但对于无标签的复杂图像，因为缺乏强有力的模型目标，往往难以达到预期效果。

为了解决这些问题，无监督领域适应(unsupervised domain adaptation, UDA)应运而生。它的目的是将源域的带标签数据中的知识转移到无标签的目标域上，以提高目标域样本的分类预测性能。现有的无监督域适应算法主要分为两类：1) 使用深度神经网络和域之间的距离函数作为损失函数优化网络，这类方法难以找到合适的域间距离衡量标准，以及受源域样本量影响，导致模型欠拟合，如文献[15]用最大平均差异(maximum mean discrepancy, MMD)^[16]来计算域间的差异；2) 基于生成对抗网络的UDA方法，无监督像素域适应方法(unsupervised pixel-level domain adaptation, PixelDA)^[17]使用生成器生成“假”(fake)样本，并使用判别器来区分这些生成的实例是否来自目标域的样本。耦合生成对抗网络(coupled generative adversarial networks, Co-GAN)^[18]使用一对生成器，前几层通过权重共享约束促使来自不同域的实例样本提取相同的语义，并在生成器的最后几层中被分解为不同的低层细节。这类模型的缺陷在于GAN的不稳定性，利用GAN生成复杂图片需要大量的样本量以及精细的模型参数优化过程，严重影响聚类分类效率。

为此，本文提出了面向特征生成的无监督域适应算法Feature-GAN，结合DAN方法在特征层完成域适应，模型易训练，又结合GAN可用于特征生成，避免寻找特定域间距离衡量标准，解决了现有无监督域适应方法存在的问题。

4. 结束语

本文提出了图像特征层面的无监督域适应方法Feature-GAN，克服了传统域适应算法域间距离函数难以定义的问题，避免了基于样本生成的算法中训练慢、不稳定的问题。具体地，Feature-GAN通过使用源域图像的特征生成目标域图像的特征。生成的特征既拥有源域图像特征的标签，又拥有目标域的图像特征的分布。域适应过程和图像分类过程采用不同的特征提取器，避免域适应过程模糊特征类间差异。该方法参数量少，训练稳定。

Reference (26)

[1]	XU J, REN Y, LI G, et al. Deep embedded multi-view clustering with collaborative training[J]. Information Sciences, 2021, 573: 279-290.
[2]	REN Y Z, WANG N, LI M X, et al. Deep density-based image clustering[J]. Knowledge Based Systems, 2000, 197: 105841.
[3]	曹占涛, 杨国武, 陈琴, 等. 基于修正标签分布的乳腺超声图像分类[J]. 电子科技大学学报, 2020, 49(4): 597-602.	CAO Z T, YANG G W, CHEN Q, et al. Breast ultrasound images classification based on correction label distributiona[J]. Journal of University of Electronic Science and Technology of China, 2020, 49(4): 597-602.
[4]	储岳中, 汪佳庆, 张学锋, 等. 基于改进深度残差网络的图像分类算法[J]. 电子科技大学学报, 2021, 50(2): 243-248.	CHU Y Z, WANG J Q, ZHANG X F, et al. Image classification algorithm based on improved deep residual network[J]. Journal of University of Electronic Science and Technology of China, 2021, 50(2): 243-248.
[5]	JIA S, JIANG S, LI Z, et al. A survey: Deep learning for hyperspectral image classification with few labeled samples[J]. Neurocomputing, 2021, 448: 179-204.
[6]	KRIZHEVSKY A, SUTSKEVER I, HINTON G E. Imagenet classification with deep convolutional neural networks[J]. Advances in Neural Information Processing Systems, 2012, 25: 1097-1105.
[7]	SZEGEDY C, VANHOUCKE V, IOFFE S, et al. Rethinking the inception architecture for computer vision[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition. Las Vegas, NV: IEEE, 2016: 2818-2826.
[8]	CEN F, ZHAO X, LI W, et al. Deep feature augmentation for occluded image classification[J]. Pattern Recognition, 2021, 111: 107737.
[9]	ZHAN X, XIE J, LIU Z, et al. Online deep clustering for unsupervised representation learning[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2020: 6688-6697.
[10]	MA L, SHUAI R, RAN X, et al. Combining DC-GAN with ResNet blood cell image classification[J]. Medical Biological Engineering & Aomputing, 2020, 58(6): 1251-1264.
[11]	RADFORD A, METZ L, CHINTALA S. Unsupervised representation learning with deep convolutional generative adversarial networks[EB/OL]. [2021-08-22]. https://arxiv.org/abs/1511.06434.
[12]	HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2016: 770-778.
[13]	高淑萍, 赵清源, 齐小刚, 等. 改进MobileNet的图像分类方法研究[J]. 智能系统学报, 2021, 16(1): 11-20.	GAO S P, ZHAO Q Y, QI X G, et al. Research on the improved image classification method of MobileNet[J]. CAAI Transactions on Intelligent Systems, 2021, 16(1): 11-20.
[14]	HOWARD A G, ZHU M, CHEN B, et al. Mobilenets: Efficient convolutional neural networks for mobile vision applications[EB/OL]. [2021-08-10]. https://arxiv.org/abs/1704.04861.
[15]	GANIN Y, LEMPITSKY V. Unsupervised domain adaptation by backpropagation[C]//International Conference on Machine Learning. Cambridge, MA: JMLR, 2015: 1180-1189.
[16]	GRETTON A, BORGWARDT K M, RASCH M J, et al. A kernel two-sample test[J]. Journal of Machine Learning Research, 2012, 13(1): 723-773.
[17]	BOUSMALIS K, SILBERMAN N, DOHAN D, et al. Unsupervised pixel-level domain adaptation with generative adversarial networks[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2017: 3722-3731.
[18]	LIU M Y, TUZEL O. Coupled generative adversarial networks[EB/OL]. [2021-07-13]. https://arxiv.org/abs/1606.07536.
[19]	GOODFELLOW I, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial networks[J]. Neural Information Processing Systems, 2014, 3: 2672-2680.
[20]	ARJOVSKY M, CHINTALA S, BOTTOU L. Wasserstein GAN[EB/OL]. [2021-07-22]. https://arxiv.org/abs/1701.07875v3.
[21]	SAENKO K, KULIS B, FRITZ M, et al. Adapting visual category models to new domains[C]//European Conference on Computer Vision. Berlin: Springer, 2010: 213-226
[22]	VENKATESWARA H, EUSEBIO J, CHAKRABORTY S, et al. Deep hashing network for unsupervised domain adaptation[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2017: 5018-5027.
[23]	SAITO K, WATANABE K, USHIKU Y, et al. Maximum classifier discrepancy for unsupervised domain adaptation[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2018: 3723-3732.
[24]	WANG Q, BRECKON T. Unsupervised domain adaptation via structured prediction based selective pseudo-labeling[C]//AAAI Conference on Artificial Intelligence. Palo Alto, CA: AAAI, 2020: 6243-6250.
[25]	JAEMIN N, HEECHUL J, HYUNG J C, et al. FixBi: Bridging domain spaces for unsupervised domain adaptation[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2021: 1094-1103.
[26]	LAURENS M, GE-OFFREY H. Visualizing data using t-sne[J]. Journal of Machine Learning Research, 2008, 9: 2579-2605.

数据集	数据域	类别数	输入大小
Office-31	Amazon(A)	5	299*299
	DSLR(D)
	Webcam(W)
Office-Home	Clipart(C)	7	299*299
	Product(P)
	Real World(R)

模型	A→D	A→W	D→A	D→W	W→A	W→D	Avg
InceptionV3	77.08	78.33	71.25	94.38	73.33	93.33	81.28
MCD	93.00	73.80	81.40	95.00	81.80	98.40	87.23
PixelDA	88.20	86.00	81.20	96.00	70.40	98.60	86.73
SPL	70.32	84.65	95.60	82.76	94.36	85.73	85.58
FixBi	92.40	85.80	64.40	88.80	59.60	95.40	81.07
Feature-GAN	91.20	92.40	85.60	99.20	86.80	99.00	92.37

模型	C→P	C→R	P→C	P→R	R→C	R→P	Avg
InceptionV3	80.03	86.24	70.76	85.89	75.58	87.83	81.06
MCD	78.32	88.85	76.32	90.07	77.34	86.27	82.86
PixelDA	76.75	84.32	71.49	84.84	78.07	86.73	80.37
SPL	76.20	76.68	80.75	96.83	83.28	75.29	81.51
FixBi	68.95	77.53	61.11	80.14	59.36	78.47	70.93
Feature-GAN	89.39	93.38	82.60	93.55	83.34	92.04	89.05

Unsupervised Feature-Level Domain Adaptation with Generative Adversarial Networks

doi: 10.12178/1001-0548.2021314

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Related

Proportional views