Entity-Relationship Joint Extraction Model Infused with Reinforcement Learning

ZHAI Sheping; LI Hang; KANG Xinnian; YANG Rui

doi:10.12178/1001-0548.2023107

Existing joint extraction tasks of entities and relationships introduce distant supervision strategies to automatically generate large-scale training data, leading to severe problems of noisy data during data processing. To address the issue of noisy data, this paper proposes an entity relation joint extraction model with reinforcement learning integration. The model consists of two components: reinforcement learning and joint extraction model. The joint extraction model is composed of a graph convolutional network and a multi-head self-attention mechanism. Firstly, reinforcement learning is utilized to eliminate noisy sentences from the original dataset, and the denoised high-quality sentences are input into the joint extraction model. Secondly, the joint extraction model is employed to predict and extract entities and relationships from the input sentences, and provide feedback rewards to the reinforcement learning component to guide it in selecting high-quality sentences. Finally, the reinforcement learning and joint extraction models are jointly trained and iteratively optimized. The experiments demonstrating that the proposed model can effectively address the issue of data noise and outperform baseline methods in entity relationship extraction.

HTML

实体关系抽取^[1]是自然语言处理中的一项基本任务，主要应用于信息抽取、问答系统、自动文本摘要等，其任务是将实体间的关系形式化地描述为三元组$ \left\langle E_{1}, R, E_{2}\right\rangle $。关系抽取的定义如下，给定一个句子$ S $，存在一对标记实体$ E_{1} $和$ E_{2} $，其目的是抽取出$ E_{1} $和$ E_{2} $之间的关系$ R $。如在句子“马云创办了阿里巴巴”中，包含了实体“马云”和“阿里巴巴”，以及之间的关系“创办”。实体关系抽取的目的是识别句子中标注的实体，并对关系进行抽取。

在实体关系抽取的发展过程中，学者们提出了各种关系抽取方法。早期的抽取方法主要是基于规则的方法^[2]和基于词典驱动的方法^[3]。基于规则的抽取方法对跨领域的可移植性较差，人力物力消耗大。基于词典驱动的抽取方法灵活性不高，难以解决以动词为关系词之外的识别抽取，同时在关系抽取的过程中存在特征依赖和关系重叠的问题，对抽取的性能产生很大影响。基于传统机器学习的抽取方法^[4]优于早期的关系抽取方法，在提高抽取结果准确率和召回率方面有所改善，但存在实体关系重叠的问题。随着深度学习^[5]和远程监督方法^[6]的发展，上述方法中存在的问题得到了较好的解决。然而，远程监督方法在处理数据时会引入大量的噪声数据。深度学习虽然减少了专家提取特征的数量，但仍然需要依赖词汇资源来获取高级特征。因此，实体关系抽取的主要问题集中在噪声数据和特征提取两个方面。

随着强化学习^[7]在自然语言处理领域的应用，强化学习为解决实体关系抽取问题提供了新的研究思路，同时对于任务中远程监督的噪声问题提供了一种十分有效的解决方法。

因此，本文提出了一种新的融合强化学习的实体关系联合抽取模型。该模型包括两部分：联合网络模型和强化学习（Reinforcement Learning, RL）代理。本文的联合网络模型用于抽取实体及实体之间的关系，强化学习代理通过与联合网络模型进行交互，识别并过滤掉带有噪声的句子，从而提高联合网络模型抽取的性能。

1. 相关工作

1.1. 实体关系联合抽取

流水线方法和联合抽取方法是目前广泛使用在实体关系联合抽取中的两种方法。

1.1.1. 流水线方法

流水线方法首先提取文本中的标记跨度以检测实体对，然后预测它们之间的关系类型。许多传统的命名实体识别模型（Name Entity Recognition, NER）都是线性序列模型^[8]，如隐马尔可夫模型（Hidden Markov Model, HMM）^[9]、条件随机场（Conditional Random Fields, CRF）^[10]和最大熵马尔可夫模型（Maximum Entropy Markov Model, MEMM）^[11]。这些模型在很大程度上依赖适当的特征，并且这些特征需要大量的手动特征工程或外部资源。

随着深度学习方法的出现，利用神经网络构造特征提取出适当的语义特征，不再依赖专家手工提取特征。在关系抽取任务中，基于神经网络的方法明显优于传统的NER模型。文献[12]首先提出一种基于循环神经网络（Recurrent Neural Network, RNN）的简单框架，该模型可以给关系分类提供更好的性能，尤其是能够学习长距离的关系模式。文献[13]利用卷积神经网络（Convolutional Neural Network, CNN）来提取词级和句子级的特征，将所有单词标记为输入，无需复杂的预处理。随后文献[14]改进了CNN网络，提出了一种新的分段卷积神经网络模型（Piecewise Convolutional Neural Network, PCNN），模型具有多实例学习，用以解决错误标签问题和特征提取过程中的噪声数据问题。然而卷积神经网络受限于卷积核的大小，在序列特征的学习上有一定局限性。文献[15]提出了基于注意力的双向长短期记忆网络来捕获句子中最重要的语义信息，用于解决重要信息可能出现在句子中任何位置的情况。文献[16]为了缓解错误标签导致关系抽取性能的降低，提出了一种基于句子级注意力的关系提取模型。

1.1.2. 联合抽取方法

针对流水线方法中存在的问题，实体关系联合抽取方法被提出，该方法可以同时识别实体并确定它们之间的关系。联合抽取方法^[17]体现了实体和关系的联系，其性能优于大多数流水线方法。大多数传统的联合抽取方法基于特征，需要依赖大量的特征工程。

近年来，学者们使用联合抽取方法改善流水线方法中存在的错误累积传播以及实体关系重叠等问题。文献[18]提出了一种新的端到端神经模型来提取实体之间的关系，该模型共享实体提取和关系分类的参数。文献[19]设计了一种混合神经网络模型提取实体及其关系，其中包含用于实体提取的新型双向编码器−解码器的长短期记忆网络（Long Short-Term Memory, LSTM）模块Bi-LSTM-ED和用于关系分类的CNN模块。文献[20]提出了一种联合神经网络模型，将实体识别任务和关系提取任务建模为多头选择问题。

1.2. 强化学习

一般情况下，神经网络模型的训练需要大量的标注数据。虽然远程监督可以产生大量的标注数据，但训练数据中会掺杂大量的噪声数据。强化学习是一种学习和决策计算方法，它具有试错搜索和延迟奖励两个特点，可以有效地处理噪声数据问题。随着强化学习在自然语言领域的发展，许多学者将强化学习应用于大规模的实体关系抽取中。

文献[21]使用强化学习代理去除训练集中的噪声数据，并使用句子可能性作为奖励来指导训练，奖励来自于不同联合神经网络的性能变化。文献[22]使用强化学习策略生成假阳性指标，自动识别每种关系类型的假阳性，无须任何监督信息，侧重于增强联合模型的鲁棒性。文献[23]将联合抽取建模为两步决策过程，通过设计每步奖励函数获得反馈，并采用Q-Learning算法作为模型的策略网络来控制联合抽取模型的决策过程。文献[24]采用分层强化学习框架来增强实体提取和关系类型之间的交互，整个提取过程被分解为两级RL策略的层次结构，分别用于关系检测和实体提取。文献[25]提出了一种具有生成参数的新型图神经网络，使用图神经网络处理多跳关系推理，提高了关系推理的准确性。文献[26]提出了一种用于关系提取的新型图神经网络模型，考虑两个实体之间的相互作用，提高了关系抽取的准确率。

在这些研究基础上，本文将强化学习应用到联合抽取模型中，提高联合抽取模型的性能。

4. 结束语

本文提出了一种融合强化学习的实体关系联合抽取模型，该模型由强化学习和联合抽取模型构成。通过强化学习去除原始数据集中带有噪声的句子，将降噪后的句子输入到联合抽取模型中，再通过联合抽取模型对句子中的实体和关系进行预测抽取，并向强化学习提供反馈，指导句子选择过程。大量实验结果表明，本文提出的模型能够有效处理噪声数据问题，在实体关系抽取方面获得较好的性能。

Reference (28)

[1]	TUO M, YANG W. Review of entity relation extraction[J]. Journal of Intelligent & Fuzzy Systems, 2023, 44(5): 7391-7405.
[2]	ZHAO S, SU C, LU Z, et al. Recent advances in biomedical literature mining[J]. Briefings in Bioinformatics, 2021, 22(3): bbaa057.
[3]	ELTYEB S, SALIM N. Chemical named entities recognition: A review on approaches and applications[J]. Journal of cheminformatics, 2014, 6: 1-12.
[4]	BAI T, GUAN H T, WANG S, et al. Traditional Chinese medicine entity relation extraction based on CNN with segment attention[J]. Neural Computing and Applications, 2022, 34(4): 2739-2748.
[5]	YE Z, KUMAR Y J, SING G O, et al. A comprehensive survey of graph neural networks for knowledge graphs[J]. IEEE Access, 2022, 10: 75729-75741.
[6]	AYDAR M, BOZAL Ö, ÖZBAY F. Neural relation extraction: A review[J]. Turkish Journal of Electrical Engineering and Computer Sciences, 2021, 29(2): 1029-1043.
[7]	VlITHAYATHI VARGHESE N, MAHMOUD Q H. A survey of multi-task deep reinforcement learning[J]. Electronics, 2020, 9(9): 1363.
[8]	ZHU Y, WANG G, KARLSSON B F. CAN-NER: Convolutional attention network for Chinese named entity recognition[EB/OL]. [2023-02-22]. https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8844740&tag=1.
[9]	LIU C, YANG S. Using text mining to establish knowledge graph from accident/incident reports in risk assessment[J]. Expert Systems with Applications, 2022, 207: 117991.
[10]	HU W, HE L, MA H, et al. Kgner: Improving chinese named entity recognition by bert infused with the knowledge graph[J]. Applied Sciences, 2022, 12(15): 7702.
[11]	LV C, PAN D, LI Y, et al. A novel Chinese entity relationship extraction method based on the bidirectional maximum entropy Markov model[J]. Complexity, 2021, 2021: 1-8.
[12]	ZHANG D, WANG D. Relation classification via recurrent neural network[EB/OL]. [2023-02-26]. https://arxiv.org/pdf/1508.01006.pdf.
[13]	ZENG D, LIU K, LAI S, et al. Relation classification via convolutional deep neural network[EB/OL]. [2023-03-02]. https://aclanthology.org/C14-1220.pdf.
[14]	ZENG D J, LIU K, CHEN Y, et al. Distant supervision for relation extraction via piecewise convolutional neural networks[C]//Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. Lisbon: [s.n.] 2015: 1753-1762.
[15]	ZHOU P, SHI W, TIAN J, et al. Attention-based bidirectional long short-term memory networks for relation classification[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. Berlin: ACL, 2016: 207-212.
[16]	LIN Y, SHEN S, LIU Z, et al. Neural relation extraction with selective attention over instances[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. Berlin: ACL, 2016: 2124-2133.
[17]	YANG Y, WU Z, YANG Y, et al. A survey of information extraction based on deep learning[J]. Applied Sciences, 2022, 12(19): 9691.
[18]	MIWA M, BANSAL M. End-to-end relation extraction using lstms on sequences and tree structures[EB/OL]. [2023-03-08]. https://arxiv.org/pdf/1601.00770.pdf.
[19]	ZHENG S, HAO Y, LU D, et al. Joint entity and relation extraction based on a hybrid neural network[J]. Neurocomputing, 2017, 257: 59-66.
[20]	BEKOULIS G, DELEU J, DEMEESTER T, et al. Joint entity recognition and relation extraction as a multi-head selection problem[J]. Expert Systems with Applications, 2018, 114: 34-45.
[21]	FENG J, HUANG M, ZHAO L, et al. Reinforcement learning for relation classification from noisy data[EB/OL]. [2023-03-08]. https://arxiv.org/pdf/1808.08013.pdf.
[22]	QIN P, XU W, WANG W Y. Robust distant supervision relation extraction via deep reinforcement learning[EB/OL]. [2023-03-08]. https://arxiv.org/pdf/1805.09927.pdf.
[23]	FENG Y T, ZHANG H J, HAO W N, et al. Joint extraction of entities and relations using reinforcement learning and deep learning[J]. Computational intelligence and neuroscience, 2017, DOI: 10.1155/2017/7643065.
[24]	TAKANOBU R, ZHANG T, LIU J, et al. A hierarchical framework for relation extraction with reinforcement learning[EB/OL]. [2023-03-18]. https://arxiv.org/pdf/1811.03925.pdf.
[25]	ZHU H, LIN Y, LIU Z, et al. Graph neural networks with generated parameters for relation extraction[EB/OL]. [2023-03-22]. https://arxiv.org/pdf/1902.00756.pdf.
[26]	CHRISTOPOULOU F, MIWA M, ANANIADOU S. A walk-based model on entity graphs for relation extraction[EB/OL]. [2023-03-26]. https://arxiv.org/pdf/1902.07023v1.pdf.
[27]	ZHANG S, ZHENG D, HU X, et al. Bidirectional long short-term memory networks for relation classification[EB/OL]. [2023-03-28]. https://aclanthology.org/Y15-1009.pdf.
[28]	LIU Z, DI X Q, SONG W, et al. A sentence-level joint relation classification model based on reinforcement learning[J]. Computational Intelligence and Neuroscience, 2021, 2021: 1-10.

类型	句子	实体对	关系
训练集	522611	281270	18252
测试集	172448	96678	1950

参数	值
词嵌入维度	50
位置嵌入维度	5
Bi-LSTM隐藏单元数	128
GCN维度	256
GCN层数	3
学习率	0.02
丢失率	0.5
更新率	0.01
权重因子	0.01
注意力头的大小	8

模型	Precision	Recall	F1
Bi-LSTM	0.615	0.414	0.495
Bi-LSTM-ATT	0.650	0.447	0.524
GCN	0.635	0.426	0.510
GAT	0.672	0.465	0.551
GCN-MHATT	0.693	0.482	0.569

模型	Precision	Recall	F1
Bi-LSTM-ATT	0.650	0.447	0.524
Bi-LSTM-ATT-RL	0.661	0.452	0.528
GCN-MHATT	0.693	0.482	0.569
GCN-MHATT-RL	0.742	0.527	0.616

模型	Precision	Recall	F1
GCN	0.635	0.426	0.510
GCN-RL	0.681	0.463	0.576
GCN-HMATT	0.693	0.482	0.569
GCN-MHATT-RL	0.742	0.527	0.616

Entity-Relationship Joint Extraction Model Infused with Reinforcement Learning

doi: 10.12178/1001-0548.2023107

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Related

Proportional views

Entity-Relationship Joint Extraction Model Infused with Reinforcement Learning

doi: 10.12178/1001-0548.2023107

1. School of Computer Science & Technology, Xi’an University of Posts and Telecommunications, Xi’an 710121, China

2. Shaanxi Key Laboratory of Network Data Analysis & Intelligent Processing, Xi’an 710121, China