Rating-Trustworthy Recommendation Model Based on Generative Adversarial Networks

WANG Yong; WANG Songli; Deng Jiangzhou

doi:10.12178/1001-0548.2023116

Existing deep learning-based recommendation models have mainly focused on improving the accuracy of recommendation systems. However, beyond recommendation accuracy, the reliability of the model's recommendations is also of great concern. Therefore, a rating-trustworthy recommendation model based on generative adversarial networks (GANs) is proposed to evaluate the effectiveness of prediction results and achieve a balance between recommendation accuracy and reliability. This model solely employs explicit user rating information to gauge the credibility of predicted ratings and screens out highly credible predicted ratings based on a predefined reliability threshold, thus ensuring the trustworthiness of recommended items. Furthermore, to enhance the prediction performance of the model and ensure fairness in training, a positive sample padding strategy is designed to mitigate the data imbalance problem in the rating reliability matrix. Experimental results on three real datasets show that the proposed model outperforms selected comparison methods in both Recall and NDCG metrics, effectively improving the performance of recommendation systems.

HTML

大数据时代下，信息资源呈爆发式增长，同质化信息充斥网络，导致用户难以发现准确且可信的内容，信息过载问题日益严重。推荐系统^[1]（Recommender System, RS）作为一种高效的信息过滤技术，通过深度分析用户历史偏好行为信息，为用户提供个性化推荐服务，提升用户体验。其中基于深度学习的推荐方法^[2]展现出良好表现。生成对抗网络^[3]（Generative Adversarial Networks, GAN）是新兴的深度学习技术，其被证明在解决数据噪声和数据稀疏性方面有不错的效果^[1]，为优化推荐性能提供新的解决方案。

大多数推荐模型仅使用原始评分信息进行预测，但由于用户行为的主观性和复杂性，其评分信息不完全可靠，导致预测准确性降低^[4]。文献[5]通过噪声管理方案进行评分修正，以提供相对准确的信息输入。但这类方法忽略了信息输出的可靠性对推荐质量的重要性。文献[6-8]也证实了可信推荐对提升用户满意度有积极影响，因此除了准确性外，推荐可靠性也应被考虑。如在酒店推荐中，综合评分高但评分人数较少的酒店可能不如评分略低但评分人数更多的酒店更受用户信赖。因此，推荐系统需要考虑评分可信度来提高推荐质量和用户体验。

基于此，本文提出了一种基于生成对抗网络的评分可信推荐模型来提供可靠性预测。首先，考虑到隐式反馈无法准确描述用户偏好，利用条件生成对抗网络模型对用户显式评分信息进行训练，以捕捉不同用户的偏好程度。其次，借助基于偏好一致性的自然噪声检测来识别原始矩阵中的噪声评分，并通过设计的生成机制产生评分可靠性矩阵；再利用生成对抗网络框架预测出评分的可靠性概率。最后，根据设定的可靠性阈值，过滤掉不可信的预测评分，以保证推送给目标用户的项目具有高可信度，从而实现可信推荐。

1. 相关工作

1.1. 生成对抗网络

GAN能够不依赖于任何先验假设，学习到高维复杂的数据分布。GAN最开始在图像生成领域成功应用，后续在推荐系统领域也获得显著效果。文献[9]将GAN的对抗训练方式成功应用到CF中去提升推荐准确性。但由于生成器G会生成离散项目索引，使得项目标签冲突。为此，文献[10]利用vector-wise（矢量对抗）训练生成对抗网络（GAN-Based Collaborative Filtering Framework, CFGAN）去解决传统基于GAN模型中所存在的一系列问题。文献[11]通过增加记忆模块训练特定用户条件向量，从而生成出更加符合用户特征的数据。文献[12]提出了一个融合神经协同过滤（Neural Network-Based Collaborative Filtering, NCF）和GAN的隐式推荐算法来解决用户和项目间交互信息不充足的问题。然而，以上方法都未考虑到原始评分信息中所存在的自然噪声的问题，这可能会对模型训练结果的准确性和可靠性带来一定影响，使得预测效果不佳。此外，现有基于GAN的方法还未充分考虑使用显式评分信息作为模型输入的问题。

1.2. 可信推荐系统

为缓解噪声问题导致的推荐偏差，研究者们提出了各种解决方案来处理自然噪声问题，以保证模型输入的准确性。文献[13]介绍了一种基于评分偏好的自然噪声管理方法来识别和修正有误的用户评分。文献[14]借助模糊理论方法，设计了一个模糊噪声识别框架。文献[15]通过将用户和物品分类到更加细粒度的类别，以对自然噪声进行识别。文献[16]针对不同推荐场景设计了相应模型来预处理自然噪声。文献[17]提出了一种基于概率加权的阈值方法来修正自然噪声。以上方法都是从信息源的角度去预处理用户评分信息。除了对输入信息进行噪声识别外，对模型的输出结果进行有效性评估也相当关键，这将直接影响到模型推荐的准确性和可靠性。

推荐模型输出结果的可信程度会影响用户的购买意愿和决策选择。因此，引入可靠性信息来优化预测结果的可信度，不仅可以辅助用户建模，还能提高预测评分的准确性和可靠性。文献[7]引入可靠性度量方法和文本信息，构建了一个动态信任网络来提高预测评分的准确性和可靠性。文献[18]利用去噪自编码器从多源数据中获取多方位潜在特征，并使用强化学习整合特征来度量评分可靠性。文献[8]提出了一种基于用户项目双视角的可信推荐方法来评估预测结果的可靠性，并修正不可信结果。文献[19]介绍了一种基于伯努利分布的矩阵分解推荐模型来获取预测结果的可靠性概率，并筛选高可靠性结果输出。随后，为了更好地捕捉和融合用户和项目的高低阶特征，文献[20]利用神经协同过滤方法对用户评分矩阵进行分类处理，以获取具有可靠性概率的离散预测评分。然而，以上这些方法都是利用原始评分信息或其他辅助信息进行可信推荐，未充分考虑到用户评分信息中所普遍存在的噪声问题，这会降低模型训练的准确性以及预测结果的可信度。因此，将信息源的可靠性考虑到推荐模型之中，对进一步提高模型的可信推荐有积极作用。

4. 结束语

本文提出了一种基于生成对抗网络的评分可信推荐模型，旨在为预测评分提供相应评分可靠性概率，并根据可靠性阈值过滤掉具有低可靠性概率的预测评分，以保证最终推荐给目标用户的项目既具有高预测评分值，又具有高可信度。与基于CF的模型相似，该模型无需利用除用户评分以外的其他辅助信息就能得到预测评分的可靠性值，具有良好的通用性和可扩展性。同时，考虑到模型训练的公平性，本文设计了正样本填充策略来缓解评分可靠性矩阵中的数据不均衡问题，从而提高模型输出结果的准确性。实验表明：本文模型在3个公开数据集上的性能表现明显优于其他对比方法，有效提高了推荐系统的质量。

Reference (21)

[1]	GAO M, ZHANG J, YU J, et al. Recommender systems based on generative adversarial networks: A problem-driven perspective[J]. Information Sciences, 2021, 546: 1166-1185.
[2]	HE X, LIAO L, ZHANG H, et al. Neural collaborative filtering[C]//Proceedings of the 26th International Conference on World Wide Web. Perth: ACM, 2017: 173-182.
[3]	GOODFELLOW I, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial networks[J]. Communications of the ACM, 2020, 63(11): 139-144.
[4]	DENG J, RAN X, WANG Y, et al. Probabilistic matrix factorization recommendation approach for integrating multiple information sources[J]. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2023, 53(10): 6220-6231.
[5]	JOORABLOO N, JALILI M, REN Y. Improved recommender systems by denoising ratings in highly sparse datasets through individual rating confidence[J]. Information Sciences, 2022, 601: 242-254.
[6]	BOBADILLA J, GUTIÉRREZ A, ORTEGA F, et al. Reliability quality measures for recommender systems[J]. Information Sciences, 2018, 442: 145-157.
[7]	MORADI P, AHMADIAN S. A reliability-based recommendation method to improve trust-aware recommender systems[J]. Expert Systems with Applications, 2015, 42(21): 7386-7398.
[8]	AHMADIAN S, AFSHARCHI M, MEGHDADI M. A novel approach based on multi-view reliability measures to alleviate data sparsity in recommender systems[J]. Multimedia Tools and Applications, 2019, 78: 17763-17798.
[9]	WANG J, YU L, ZHANG W, et al. IRGAN: A minimax game for unifying generative and discriminative information retrieval models[C]//Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. Tokyo: ACM, 2017: 515-524.
[10]	CHAE D K, KANG J S, KIM S W, et al. Cfgan: A generic collaborative filtering framework based on generative adversarial networks[C]//Proceedings of the 27th ACM International Conference on Information and Knowledge Management. New York: ACM, 2018: 137-146.
[11]	WEN J, ZHU X R, WANG C D, et al. A framework for personalized recommendation with conditional generative adversarial networks[J]. Knowledge and Information Systems, 2022, 64(10): 2637-2660.
[12]	SUN J, LIU B, REN H, et al. NCGAN: A neural adversarial collaborative filtering for recommender system[J]. Journal of Intelligent and Fuzzy Systems, 2022, 42(4): 2915-2923.
[13]	BAG S, KUMAR S, AWASTHI A, et al. A noise correction-based approach to support a recommender system in a highly sparse rating environment[J]. Decision Support Systems, 2019, 118: 46-57.
[14]	WANG P, WANG Y, ZHANG L Y, et al. An effective and efficient fuzzy approach for managing natural noise in recommender systems[J]. Information Sciences, 2021, 570: 623-637.
[15]	LI W, LI X, DENG J, et al. Sentiment based multi-index integrated scoring method to improve the accuracy of recommender system[J]. Expert Systems with Applications, 2021, 179: 115105.
[16]	YERA R, CASTRO J, MARTINEZ L. Natural noise management in recommender systems using fuzzy tools[J]. Computational Intelligence for Semantic Knowledge Management: New Perspectives for Designing and Organizing Information Systems, 2020: 1-24.
[17]	LUO C, WANG Y, LI B, et al. An efficient approach to manage natural noises in recommender systems[J]. Algorithms, 2023, 16(5): 228.
[18]	AHMADIAN M, AHMADIAN S, AHMADI M. RDERL: Reliable deep ensemble reinforcement learning-based recommender system[J]. Knowledge-Based Systems, 2023, 263: 110289.
[19]	ORTEGA F, LARA-CABRERA R, GONZÁLEZ-PRIETO Á, et al. Providing reliability in recommender systems through Bernoulli matrix factorization[J]. Information Sciences, 2021, 553: 110-128.
[20]	BOBADILLA J, GUTIÉRREZ A, ALONSO S, et al. Neural collaborative filtering classification model to obtain prediction reliabilities[J]. International Journal of Interactive Multimedia and Artificial Intelligence, 2022, 7(4): 18-26.
[21]	LIU D, WANG Y, LUO C, et al. An improved autoencoder for recommendation to alleviate the vanishing gradient problem[J]. Knowledge-Based Systems, 2023: 110254.

数据集	用户数/个	项目数/个	评分数量/个	稀疏度/%
ML-100K	943	1 682	100 000	6.3
ML-1M	6 040	3 952	1 000 209	4.2
YM	15 400	1 000	365 704	2.3

数据集	ML-100K	ML-1M	YM
可靠性阈值	0.435	0.475	0.395

对比方法	ML-100K	ML-1M	YM
NCF	（1.419,0.513,0.532）	（1.535,0.472,0.496）	（1.823,0.375,0.446）
C_NCF	（1.172,0.526,0.541）	（1.048,0.492,0.500）	（1.898,0.369,0.426）
BeMF	（1.811,0.503,0.544）	（1.837,0.489,0.525）	（1.973,0.304,0.362）
I_CDAE	（0.984,0.523,0.541）	（0.987,0.496,0.521）	（1.686,0.389,0.446）
CFGAN	（1.382,0.494,0.527）	（1.348,0.475,0.501）	（1.837,0.361,0.442）
PRGAN	（1.331,0.527,0.537）	（1.724,0.455,0.469）	（1.783,0.380,0.446）
Rating-GAN	（1.161,0.516,0.548）	（1.197,0.500,0.514）	（1.722,0.385,0.468）
DGAN+R	（0.967,0.544,0.571）	（1.019,0.519,0.536）	（1.458,0.400,0.484）

Rating-Trustworthy Recommendation Model Based on Generative Adversarial Networks

doi: 10.12178/1001-0548.2023116

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Related

Proportional views