基于双向门控循环神经网络的事件论元抽取方法

葛唯益; 程思伟; 王羽; 徐建

doi:10.12178/1001-0548.2021153

基于双向门控循环神经网络的事件论元抽取方法

doi: 10.12178/1001-0548.2021153

葛唯益¹,
程思伟²,
王羽¹,
徐建^2, ,

1.
中国电子科技集团公司第二十八研究所信息系统工程重点实验室　南京　210007
2.
南京理工大学计算机科学与工程学院　南京　210094

基金项目: 国家自然科学基金(61872186)

详细信息

作者简介:
葛唯益(1985-)，男，博士，高级工程师，主要从事知识图谱、自然语言处理等方面的研究

通讯作者: 徐建，dolphin.xu@njust.edu.cn

中图分类号: TP183

Bi-GRU-Based Event Argument Extraction Approach

GE Weiyi¹,
CHENG Siwei²,
WANG Yu¹,
XU Jian^{2
, ,}

1.
Science and Technology on Information Systems Engineering Laboratory, the 28th Research Institute of China Electronics Technology Group Corporation　Nanjing　210007
2.
School of Computer Science & Engineering, Nanjing University of Science & Technology　Nanjing　210094

摘要: 事件抽取是构建知识图谱的关键前置任务之一，而事件论元抽取是事件抽取的子任务，对事件抽取质量有显著影响。针对现有的流水线式事件抽取方法在论元抽取时忽略了触发词和论元间、论元和论元间相互关系导致抽取质量低的问题，该文提出了一种基于双向门控循环神经网络(Bi-GRU)的事件论元抽取方法。该方法融合Bert词向量、词性特征、词位置特征和触发词类型特征作为输入，采用Bi-GRU网络对文本中的词进行编码，进而应用改进的多注意力机制为句子不同部分分配权重提取句子级别特征，最后通过全连接层实现论元识别和角色分类。在基准数据集上进行了实验验证，结果表明论元识别和角色分类任务的F1-score值分别达到了69.2%和61.6%，优于现有方法。
- 注意力机制 /
- 事件抽取 /
- 循环神经网络 /
- 词嵌入
Abstract: Event extraction is one of the important precedent tasks for knowledge graphs, while as a sub-task, event argument extraction has a significant impact on the quality of event extraction. The existing pipelined event extraction approaches usually ignore the relationships between triggers and arguments, or among arguments, which leads to low quality of event extraction. To solve this issue, this paper proposes a bidirectional gated recurrent neural network (Bi-GRU)-based event argument extraction approach. The proposed approach considers Bert-based word vector, word part-of-speech, word position, and trigger types as features, applies Bi-GRU to encode these features of each word in a sentence to get a word vector, leverages the improved multi-attention mechanism to assign weights to different parts of the sentence, and finally identify arguments and their roles in the sentence by a full-connection layer. Experiments are conducted on a benchmark dataset, and experimental results show that argument recognition and role classification tasks achieve 69.2% and 61.6% in F1-score respectively, and are better than compared state-of-the-art approaches.
- attention mechanism /
- event extraction /
- recurrent neural network /
- word embedding

图 1 基于双向门控循环单元和多注意力机制的事件论元抽取模型

下载: 全尺寸图片幻灯片

表 1 特征编码层不同的单词编码方式对模型性能的影响

词向量	论元识别/%			论元角色分类/%
词向量	Precision	Recall	F1	Precision	Recall	F1
word2vec	67.7	60.2	63.7	58.6	53.4	55.8
GloVe	68.3	58.1	62.7	59.4	53.2	56.1
ELMo	70.2	65.7	67.9	67.1	54.7	60.2
BERT	73.3	65.6	69.2	69.9	55.1	61.6

下载: 导出CSV

表 2 Bi-GRU-MATT模型相关消融实验

模型	论元识别/%			论元角色分类/%
模型	Precision	Recall	F1	Precision	Recall	F1
Bi-GRU-MATT	73.3	65.6	69.2	69.9	55.1	61.6
Multi Attention	70.9	64.5	67.5	68.2	53.6	60.0
Bi-GRU	72.6	64.4	68.2	68.9	54.4	60.8

下载: 导出CSV

表 3 不同注意力权重函数对模型性能的影响

注意力函数	论元识别/%			角色分类/%
注意力函数	Precision	Recall	F1	Precision	Recall	F1
Multiplicative	72.7	64.9	68.6	69.2	55.2	61.4
Additive	72.8	64.7	68.5	68.8	54.7	60.9
Symmetric multi	73.0	65.2	68.8	69.4	54.9	61.3
Symmetric multi (ReLU)	73.6	65.1	69.0	69.7	55.0	61.5
Scaled multi(ReLU)	73.3	65.6	69.2	69.9	55.1	61.6

下载: 导出CSV

表 4 Bi-GRU-MATT模型在单论元事件句(1/1)和多论元事件句(1/N)上的抽取性能

模型	性能/%
模型	1/1	1/N	F1-score
Embedding+T	37.4	15.5	32.6
CNN	51.6	36.6	48.9
DMCNN	54.6	48.7	53.5
JMEE	59.3	57.6	60.3
JRNN	50.0	55.2	55.4
Bi-GRU-MATT	60.1	58.7	61.6

下载: 导出CSV

表 5 Bi-GRU-MATT与其他先进方法的性能比较

模型	论元识别/%			论元角色分类/%
模型	Precision	Recall	F1	Precision	Recall	F1
Cross-Event	50.9	49.7	50.3	45.1	44.1	44.6
Cross-Entity	53.4	52.9	53.1	51.6	45.5	48.3
DMCNN	68.8	51.9	59.1	62.2	46.9	53.5
S-CNNs	69.2	50.8	58.6	63.3	45.8	53.1
RBPB	63.2	59.4	61.2	54.1	53.5	53.8
JRNN	61.4	64.2	62.8	54.2	56.7	55.4
dbRNN	71.3	64.5	67.7	66.2	52.8	58.7
JMEE	71.4	65.6	68.4	66.8	54.9	60.3
Ding’s model	64.7	65.0	64.8	57.4	55.8	56.6
Joint3EE	59.9	59.8	59.9	52.1	52.1	52.1
Bi-GRU-MATT	73.3	65.6	69.2	69.9	55.1	61.6

下载: 导出CSV

[1]	RILOFF E. Automatically constructing a dictionary for information extraction tasks[C]//Proceedings of the Eleventh National Conference on Artificial Intelligenc. Menlo Park, CA: AAAI, 1993, 1(1): 811-816.
[2]	RILOFF E, SHOEN J. Automatically acquiring conceptual patterns without an annotated corpus[C]//The 3rd Workshop on Very Large Corpora. [S.l.]: Association for Computational Linguistics, 1995: 148-161.
[3]	姜吉发. 一种事件信息抽取模式获取方法[J]. 计算机工程, 2005, 31(15): 96-98. doi: 10.3969/j.issn.1000-3428.2005.15.035 JIANG J F. An event IE pattern acquisition method[J]. Computer Engineering, 2005, 31(15): 96-98. doi: 10.3969/j.issn.1000-3428.2005.15.035
[4]	ARENDARENKO E, KAKKONEN T. Ontology-based information and event extraction for business intelligence[C]//International Conference on Artificial Intelligence: Methodology, Systems, and Applications. Heidelberg: Springer, 2012: 89-102.
[5]	KIM J T, MOLDOVAN D I. Acquisition of linguistic patterns for knowledge-based information extraction[J]. IEEE Transactions on Knowledge and Data Engineering, 1995, 7(5): 713-724. doi: 10.1109/69.469825
[6]	CHIEU H L, NG H T. A maximum entropy approach to information extraction from semi-structured and free text[C]//The 18th National Conference on Artificial Intelligence. [S.l.]: AAAI, 2002: 786-791.
[7]	LLORENS H, SAQUETE E, NAVARRO B. TimeML events recognition and classification: Learning CRF models with semantic roles[C]//Proceedings of the 23rd International Conference on Computational Linguistics. Beijing: Tsinghua University Press, 2010: 725-733.
[8]	丁效, 宋凡, 秦兵, 等. 音乐领域典型事件抽取方法研究[J]. 中文信息学报, 2011, 25(2): 15-21. doi: 10.3969/j.issn.1003-0077.2011.02.003 DING X, SONG F, QIN B, et al. Research on typical event extraction method in the field of music[J]. Journal of Chinese Information Processing, 2011, 25(2): 15-21. doi: 10.3969/j.issn.1003-0077.2011.02.003
[9]	LIAO S S, GRISHMAN R. Using prediction from sentential scope to build a pseudo co-testing learner for event extraction[C]//Proceedings of 5th International Joint Conference on Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2011: 714-722.
[10]	LIU S L, LIU K, HE S Z, et al. A probabilistic soft logic based approach to exploiting latent and global information in event classification[C]//The 30th AAAI Conference on Artificial Intelligence. Palo Alto, CA: AAAI, 2016: 2993-2999.
[11]	LI P F, ZHU Q M, DIAO H J, et al. Joint modeling of trigger identification and event type determination in Chinese event extraction[C]//Proceedings of COLING 2012. Mumbai: The Coling 2012 Organizing Committee, 2012: 1635-1652.
[12]	LI Q, JI H, HUANG L. Joint event extraction via structured prediction with global features[C]//Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2013: 73-82.
[13]	MCCLOSKY D, SURDEANU M, MANNING C D. Event extraction as dependency parsing[C]//Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg: Association for Computational Linguistics, 2011: 1626-1635.
[14]	RIEDEL S, SAETRE R, CHUN H W, et al. Bio-molecular event extraction with Markov logic[J]. Computational Intelligence, 2011, 27(4): 558-582. doi: 10.1111/j.1467-8640.2011.00400.x
[15]	VENUGOPAL D, CHEN C, GOGATE V, et al. Relieving the computational bottleneck: Joint inference for event extraction with high-dimensional features[C]//Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2014: 831-843.
[16]	TONG M H, XU B, WANG S, et al. Improving event detection via open-domain trigger knowledge[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2020: 5887-5897.
[17]	CHEN Y B, XU L H, LIU K, et al. Event extraction via dynamic multi-pooling convolutional neural networks[C]//Proceedings of the 53th Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2015: 167-176.
[18]	NGUYEN T H, GRISHMAN R. Event detection and domain adaptation with convolutional neural networks[C]//Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2015: 365-371.
[19]	NGUYEN T H, CHO K, GRISHMAN R. Joint event extraction via recurrent neural networks[C]//Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg: Association for Computational Linguistics, 2016: 300-309.
[20]	ZHANG W B, DING X, LIU T. Learning target-dependent sentence representations for Chinese event detection[C]//China Conference on Information Retrieval. Switzerland: Springer, 2018: 251-262.
[21]	DUAN S Y, HE R F, ZHAO W L. Exploiting document level information to improve event detection via recurrent neural networks[C]//Proceedings of the Eighth International Joint Conference on Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2017: 352-361.
[22]	LEI S, QIAN F, CHANG B B, et al. Jointly extracting event triggers and arguments by dependency-bridge RNN and tensor-based argument interaction[C]//Proceedings of the AAAI Conference on Artificial Intelligence. Palo Alto, CA: AAAI, 2018: 5916-5923.
[23]	LIU X, LUO Z C, HUANG H Y. Jointly multiple events extraction via attention-based graph information aggregation[C]//Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2018: 1247-1256.
[24]	ZHANG Z K, XU W R, CHEN Q Q. Joint event extraction based on skip-window convolutional neural networks[M]//Natural Language Understanding and Intelligent Applications. Switzerland: Springer, 2016: 324-334.
[25]	DING R X, LI Z J. Event extraction with deep contextualized word representation and multi-attention layer[C]//International Conference on Advanced Data Mining and Applications. Switzerland: Springer, 2018: 189-201.
[26]	NGUYEN T M, NGUYEN T H. One for all: Neural joint modeling of entities and events[C]//Proceedings of the AAAI Conference on Artificial Intelligence. Palo Alto, CA: AAAI, 2019: 6851-6858.

[1]	陈欣, 李闯, 金凡. 量子自注意力神经网络的时间序列预测 . 电子科技大学学报, 2024, 53(1): 110-118. doi: 10.12178/1001-0548.2022340
[2]	要媛媛, 付潇, 杨东瑛, 王洁宁, 郑文. 结合全局信息增强的医学领域命名实体识别研究 . 电子科技大学学报, 2024, 53(): 1-10. doi: 10.12178/1001-0548.2023064
[3]	刘志刚, 张国辉, 高月, 刘苗苗. 多因素引导的行人重识别数据增广方法研究 . 电子科技大学学报, 2024, 53(2): 235-242. doi: 10.12178/1001-0548.2023056
[4]	崔少国, 独潇, 张宜浩. 基于兴趣注意力网络的会话推荐算法 . 电子科技大学学报, 2024, 53(1): 67-75. doi: 10.12178/1001-0548.2022307
[5]	李阳, 李春璇, 徐灿飞, 方立梅. 基于残差注意力机制的肺结节数据增强方法 . 电子科技大学学报, 2023, 52(6): 880-886. doi: 10.12178/1001-0548.2022363
[6]	黄颖, 许剑, 周子祺, 陈树沛, 周帆, 曹晟. 高效长序列水位预测模型的研究与实现 . 电子科技大学学报, 2023, 52(4): 595-601. doi: 10.12178/1001-0548.2022133
[7]	莫太平, 黄巧人, 陈德鸿, 伍锡如, 张向文. 改进可逆缩放网络的图像超分辨率重建 . 电子科技大学学报, 2023, 52(5): 739-746. doi: 10.12178/1001-0548.2022261
[8]	赵云龙, 田生祥, 李岩, 罗龙, 齐鹏文. 基于注意力模型和Soft-NMS的输电线路小目标检测方法 . 电子科技大学学报, 2023, 52(6): 906-914. doi: 10.12178/1001-0548.2022290
[9]	张凤荔, 王雪婷, 王瑞锦, 汤启友, 韩英军. 融合动态图表示和自注意力机制的级联预测模型 . 电子科技大学学报, 2022, 51(1): 83-90. doi: 10.12178/1001-0548.2021100
[10]	王磊, 孙志成, 王磊, 陈端兵, 蒋家玮. 基于DRSN-CW和LSTM的轴承故障诊断 . 电子科技大学学报, 2022, 51(6): 921-927. doi: 10.12178/1001-0548.2021385
[11]	毛文清, 徐雅斌. 基于深度图卷积网络的社交机器人识别方法 . 电子科技大学学报, 2022, 51(4): 615-622, 629. doi: 10.12178/1001-0548.2021280
[12]	周丰丰, 牛甲昱. 细胞穿膜肽识别问题的多特征融合卷积网络预测算法 . 电子科技大学学报, 2022, 51(4): 494-499. doi: 10.12178/1001-0548.2021391
[13]	代翔. 基于事件模式及类型的事件检测模型 . 电子科技大学学报, 2022, 51(4): 592-599. doi: 10.12178/1001-0548.2021377
[14]	郭磊, 王邱龙, 薛伟, 郭济. 基于注意力机制的光线昏暗条件下口罩佩戴检测 . 电子科技大学学报, 2022, 51(1): 123-129. doi: 10.12178/1001-0548.2021222
[15]	头旦才让, 仁青东主, 尼玛扎西, 于永斌, 邓权芯. 基于改进字节对编码的汉藏机器翻译研究 . 电子科技大学学报, 2021, 50(2): 249-255, 293. doi: 10.12178/1001-0548.2020218
[16]	赵磊, 高联丽, 宋井宽. 面向视觉对话的自适应视觉记忆网络 . 电子科技大学学报, 2021, 50(5): 749-753. doi: 10.12178/1001-0548.2021057
[17]	吴劲, 陈树沛, 杨庆, 周帆. 基于图神经网络的用户轨迹分类 . 电子科技大学学报, 2021, 50(5): 734-740. doi: 10.12178/1001-0548.2020435
[18]	李学明, 岳贡, 陈光伟. 基于多模态注意力机制的图像理解描述新方法 . 电子科技大学学报, 2020, 49(6): 867-874. doi: 10.12178/1001-0548.2019228
[19]	邓钰, 雷航, 李晓瑜, 林奕欧. 用于目标情感分类的多跳注意力深度模型 . 电子科技大学学报, 2019, 48(5): 759-766. doi: 10.3969/j.issn.1001-0548.2019.05.016
[20]	李明奇, 李玉柏, 彭启琮. 基于循环神经网络的OFDM系统的失真补偿 . 电子科技大学学报, 2007, 36(4): 677-680.

点击查看大图

图(1) / 表(5)

计量

文章访问数: 4271
HTML全文浏览量: 1415
PDF下载量: 70
被引次数: 0

全文HTML

文本事件抽取旨在从句子或文档中识别发生的事件，以结构化的方式描述事件的触发词、事件类型、事件论元及其角色，通常是信息检索中的重要前置任务之一，在诸多领域有着广泛应用。如在政府公共事务管理领域，及时捕获社会事件的爆发和掌握演变动态将有助于快速应急响应和事件处置，维护社会安定。因此，面向以自然语言形式存在的文本数据，研究满足应用场景需要的事件抽取方法成为当前热点研究课题之一。

尽管已经开展了相关的研究工作，事件抽取仍然是一项颇具挑战性的任务，主要原因有以下几个方面。首先，自然语言形式表达的文本信息通常具有语义歧义和多样化的话语风格，增加了处理难度。其次，事件抽取还依赖于自然语言处理(natural language processing, NLP)中若干子任务的性能，如命名实体识别、词性标记和语法解析等。为了应对上述挑战，文献[1-5]提出了基于模式匹配的事件抽取方法。该方法先构造一些特定的事件模板，然后执行模板匹配从文本中提取带有参数的事件。代表性的工作有AutoSlog^[1]、GenPAM^[3]、BEECON^[4]和PALKA^[5]。虽然由具有专业知识的专家手动构建事件模式质量非常高，且针对特定领域通常可以实现较高的抽取精度，但是手动构建耗时费力，且无法迁移应用到其他领域中。随着机器学习方法在事件抽取方面的广泛应用，研究人员又提出了基于机器学习的事件抽取方法克服人工构建模板的局限性。该方法的基本思路是从训练数据中学习分类器，并将分类器应用于从新文本中提取事件。由于事件抽取可以进一步分为触发词抽取和论元抽取两个子任务，根据两个子任务的完成时间顺序，可以划分为基于流水线式的事件抽取模型^[6-10]和联合抽取模型^[11-15]。前者将触发词抽取和论元抽取任务以串行的方式进行，且针对任务特点采用不同的分类器，更注重结构性，针对性模型能够收获更好的效果；而后者同时完成触发词抽取和论元抽取任务，考虑两个任务之间的信息交互，注重任务的整体性。最近，神经网络在NLP任务中不断取得突破，基于深度神经网络的事件抽取方法^[16-23]研究得到了很多关注，寻找抽取效果更佳的深度学习模型成为主要难点问题。代表性的工作有：基于卷积神经网络(convolutional neural networks, CNN)的事件抽取方法DMCNN^[17]，基于递归神经网络(recurrent neural networks, RNN)的事件抽取方法JRNN^[19]和dbRNN^[22]，基于图卷积神经网络的事件抽取方法JMEE^[23]。基于CNN的事件抽取方法的缺点是无法很好地捕捉到距离较远的单词之间的相互关系，因为CNN是将单词嵌入级联作为输入的。RNN刻画可以利用直接或者间接连接的两个任意的词之间的潜在依赖关系，但也存在长距离遗忘的问题。此外，现有的基于深度神经网络的事件抽取方法大多忽略触发词与触发词之间的关联，在多事件句上的效果不佳。

针对上述问题，本文提出一种基于双向门控循环神经网络(bidirectional gated recurrent neural network, Bi-GRU)和多注意力机制的事件论元抽取模型，该模型在输入层结合深度上下文词向量和基础特征编码句子，经过Bi-GRU层特征提取后，输入改进的多注意力机制层，从3个方向计算注意力权重，编码语义结构之间的相似度，最后进行分类，完成事件论元抽取任务。

3. 结束语

本文提出了一个基于Bi-GRU和改进注意力机制的事件论元抽取模型Bi-GRU-MATT。该模型在特征编码层同样使用了深度的上下文预训练语言模型BERT，并结合词性特征和位置特征，以及触发词特征来编码单词向量，之后送入Bi-GRU网络中编码长距离的依赖关系，再输入多注意力机制层计算注意力权重，生成事件向量和句子表示向量，级联输入全连接层完成最后的分类工作。实验表明该模型可以显著提升事件论元抽取的效果，在事件论元识别和论元角色分类任务上达到了较好的效果，F1-score值分别为69.2%和61.6%。

本文的研究工作得到了信息系统工程重点实验室开放基金项目(05201901)的支持，在此深表感谢！

参考文献 (26)

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

基于双向门控循环神经网络的事件论元抽取方法

doi: 10.12178/1001-0548.2021153

作者简介:
葛唯益(1985-)，男，博士，高级工程师，主要从事知识图谱、自然语言处理等方面的研究

通讯作者: 徐建，dolphin.xu@njust.edu.cn

Bi-GRU-Based Event Argument Extraction Approach

计量

基于双向门控循环神经网络的事件论元抽取方法

doi: 10.12178/1001-0548.2021153

1. 中国电子科技集团公司第二十八研究所信息系统工程重点实验室　南京　210007

2. 南京理工大学计算机科学与工程学院　南京　210094

作者简介:
葛唯益(1985-)，男，博士，高级工程师，主要从事知识图谱、自然语言处理等方面的研究

通讯作者: 徐建，dolphin.xu@njust.edu.cn

English Abstract

Bi-GRU-Based Event Argument Extraction Approach

1. Science and Technology on Information Systems Engineering Laboratory, the 28th Research Institute of China Electronics Technology Group Corporation　Nanjing　210007

2. School of Computer Science & Engineering, Nanjing University of Science & Technology　Nanjing　210094

全文HTML

1.1. 框架

1.2. 特征编码层

1.3. 双向门控循环单元层

1.4. 多注意力机制层

1.5. 全连接层

1.6. 损失函数

2.1. 实验设置

2.2. 消融实验

2.2.1. BERT对模型性能的影响

2.2.2. Bi-GRU层和多注意力机制层对模型性能的影响

2.3. 多注意力机制层相关分析

2.4. 模型在多论元事件句上的表现

2.5. 对比实验

目录

期刊在线

编辑办公

友情链接

留言板

基于双向门控循环神经网络的事件论元抽取方法

doi: 10.12178/1001-0548.2021153

作者简介: 葛唯益(1985-)，男，博士，高级工程师，主要从事知识图谱、自然语言处理等方面的研究

通讯作者: 徐建，dolphin.xu@njust.edu.cn

Bi-GRU-Based Event Argument Extraction Approach

计量

出版历程

基于双向门控循环神经网络的事件论元抽取方法

doi: 10.12178/1001-0548.2021153

1. 中国电子科技集团公司第二十八研究所信息系统工程重点实验室 南京 210007 2. 南京理工大学计算机科学与工程学院 南京 210094

作者简介: 葛唯益(1985-)，男，博士，高级工程师，主要从事知识图谱、自然语言处理等方面的研究

通讯作者: 徐建，dolphin.xu@njust.edu.cn

English Abstract

Bi-GRU-Based Event Argument Extraction Approach

1. Science and Technology on Information Systems Engineering Laboratory, the 28th Research Institute of China Electronics Technology Group Corporation Nanjing 210007 2. School of Computer Science & Engineering, Nanjing University of Science & Technology Nanjing 210094

全文HTML

1.1. 框架

1.2. 特征编码层

1.3. 双向门控循环单元层

1.4. 多注意力机制层

1.5. 全连接层

1.6. 损失函数

2.1. 实验设置

2.2. 消融实验

2.2.1. BERT对模型性能的影响

2.2.2. Bi-GRU层和多注意力机制层对模型性能的影响

2.3. 多注意力机制层相关分析

2.4. 模型在多论元事件句上的表现

2.5. 对比实验

目录

期刊在线

编辑办公

友情链接

作者简介:
葛唯益(1985-)，男，博士，高级工程师，主要从事知识图谱、自然语言处理等方面的研究

1. 中国电子科技集团公司第二十八研究所信息系统工程重点实验室　南京　210007

2. 南京理工大学计算机科学与工程学院　南京　210094

作者简介:
葛唯益(1985-)，男，博士，高级工程师，主要从事知识图谱、自然语言处理等方面的研究

1. Science and Technology on Information Systems Engineering Laboratory, the 28th Research Institute of China Electronics Technology Group Corporation　Nanjing　210007

2. School of Computer Science & Engineering, Nanjing University of Science & Technology　Nanjing　210094