Event Detection Model Based on Event Pattern and Type Bias

DAI Xiang

doi:10.12178/1001-0548.2021377

To address the problems of vague criteria for trigger word definition and the high cost of corpus annotation, a deep learning model for event detection called pattern and type based neural network (PTNN) is proposed. First, potential theorems are obtained based on entities' syntactic and semantic features. Then, the potential theorems are abstracted as roles. The embedding representation of PTNN is constructed by combining syntactic, semantic, and role features to enhance the representation of event patterns. Last, event detection and type determination are accomplished by using Bi-LSTM (bidirectional long short-term memory) with an event type-based attention mechanism. The model achieves event detection by enhancing event pattern features instead of identifying trigger words, thus avoiding the challenging problem of trigger word annotation. Such an approach demonstrates the positive effect of event patterns for event detection on neural networks. Experiments demonstrate that it improves the state-of-the-art of event detection by 3%.

HTML

事件是重要的信息表现形式，具有典型的类型及模式特征。事件模式体现事件的结构化特征，由事件的角色及其关系构成。不同类型事件在模式特征上有各自的个性特征。事件检测任务识别事件及其类型，主流研究均基于触发词特征完成检测。方法需专业人员对训练语料标注触发词，标注成本高且部分领域难以形成有效的标注标准。

为解决上述问题，本文提出面向事件模式及类型特征的事件检测深度学习模型(pattern and type bias neural network, PTNN)。模型不检测触发词，通过实体的语法、语义特征获取潜在论元，判断其对应角色进行抽象表示，凸显事件模式信息；融合潜在论元的语法、语义及角色特征构建嵌入表示，利用深度学习模型学习事件模式特征及文本语义，并结合事件类型注意力机制实现事件检测。

主要工作如下：

1)以实体类型、依存关系、词性定义论元特征约束，基于论元特征约束抽取潜在论元；

2)以角色抽象潜在论元并进行表示替换，结合语法、语义、角色特征构建词嵌入表示，凸显输入的事件模式信息；

3)使用双向长短时网络(Bi-long short-term memory, Bi-LSTM)及类型注意力机制构建深度学习模型，根据输入的词、句级特征，学习事件模式及特征类型，实现事件检测。

1. 相关工作

事件检测的主流方法为面向触发词的检测，包括基于特征的方法和基于表示的方法。基于特征的方法以触发词统计特性为依据，定义特征建模触发词。常用特征包括词法、句法特征、篇章信息及外部知识等^[1-8]。特征构建耗时耗力，效果一般且不稳定。基于表示的方法为目前主流方法，其利用标注触发词的训练语料，学习输入的高维特征实现触发词识别^[9-14]。虽避免了人工构建特征，但语料需专业人员标注，具有触发词标准难以制定、标注成本高等问题，且同一词语可触发多类事件的问题未很好解决。

而事件论元、模式等特征同样体现事件本质，因此许多研究从这些特征入手。部分研究从事件实例出发，利用同类事件在实体组成、语句结构上的相似性进行事件检测。文献[15]基于相同事件拥有相似实体的假设，根据实体元素对文本聚类，得到若干事件簇实现面向开放域的事件检测。方法避免了触发词标注，但无法给出事件簇对应的具体类型，得到的结果是若干关键词，无法用于后续任务。文献[16]以实体和其类型构建“槽值对”，基于同类事件具有相似“槽值对”集的假设分析事件类型与槽值对的关联。方法一定程度上实现了类型判定，但“槽值对”不能全面体现事件框架特征，对部分事件区分度较低。部分研究根据不同类型事件在实体、模式上的差别，预先定义事件表示框架实现事件检测。文献[17]结合事件的5W1H分析法，定义7种突发事件类型的事件框架，抽取突发事件用于公共安全预警；相似研究在科技、金融等领域也取得了一定成果^[18-19]。该类方法结合远程监督可自动生成标注数据，解决了语料标注困难的问题。但事件框架表示结构复杂，严重依赖专家知识，且不同领域和任务需要不同的表示框架，重复定义工作量大。文献[20]提出了基于实体的TBNNAM模型。其以实体为原始特征构建输入，使用LSTM与基于类型的注意力完成事件识别。但该方法仅利用了实体信息，没有考虑事件模式特征，且只利用了单向语义信息。文献[21]结合Bi-LSTM与基于类型的多层多头注意力，基于词向量生成长文本的向量表示，识别电影剧本是否包含指定类型事件。但其关注目标为文档级事件，弱化了细粒度事件的识别和判定。

4. 结束语

实验证明，PTNN模型在不依靠触发词的情况下，仅依靠事件模式信息和潜在论元信息达到了触发词模型的检测水平，同时其表现优于同类模型。验证了模式信息对事件检测的意义，并验证了其在深度学习模型中的可用性。

后续拟进一步探究如何更好的表示事件模式特征，同时探究如何进行事件模式与类型的联合识别。

Reference (21)

[1]	AHN D. The stages of event extraction[C]//Proceedings of the Workshop on Annotating and Reasoning about Time and Events. Sydney: Association for Computational Linguistics, 2006: 1-8.
[2]	JI H, GRISHMAN R. Refining event extraction through cross-document inference[C]//Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics. Columbus: Association for Computational Linguistics, 2008: 254-262.
[3]	CHEN Z, JI H. Language specific issue and feature exploration in Chinese event extraction[C]//Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers. [S.l.]: Association for Computational Linguistics, 2009: 209-212.
[4]	LIAO S, GRISHMAN R. Using document level cross-event inference to improve event extraction[C]//Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. [S.l.]: Association for Computational Linguistics, 2010: 789-797.
[5]	LI Q, JI H, HUANG L. Joint event extraction via structured prediction with global features[C]//Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Sofia: Association for Computational Linguistics, 2013: 73-82.
[6]	张建恒, 黄蔚, 胡国超. 基于LDA模型和AP聚类的主题事件抽取技术[J]. 计算机与现代化, 2017(12): 77-81.	ZHANG J H, HUANG W, HU G C. Topic event extraction technology based on LDA model and AP clustering method[J]. Computer and Modernization, 2017(12): 77-81.
[7]	高源, 席耀一, 李弼程. 基于依存句法分析与分类器融合的触发词抽取方法[J]. 计算机应用研究, 2016(5): 1407-1410.	GAO Y, XI Y Y, LI B C. Trigger extraction algorithm based on dependency parsing and classifier fusion[J]. Application Research of Computers, 2016(5): 1407-1410.
[8]	万齐智, 万常选, 胡蓉, 等. 基于句法语义依存分析的中文金融事件抽取[J]. 计算机学报, 2021, 44(3): 508-530.	WAN Q Z, WAN C X, HU R, et al. Chinese financial event extraction base on syntactic and semantic dependency parsing[J]. Chinese Journal of Computers, 2021, 44(3): 508-530.
[9]	CHEN Y, LIU S, HE S, et al. Event extraction via bidirectional long short-term memory tensor neural networks[M]//Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. [S.1.]: Springer, 2016: 190-203.
[10]	WU Y, ZHANG J. Chinese event extraction based on Attention and semantic features: A bidirectional circular neural network[J]. Future Internet, 2018, 10(10): 95.
[11]	LIN H, LU Y, HAN X, et al. Nugget proposal networks for Chinese event detection[EB/OL]. [2021-10-11]. https://arxiv.org/pdf/1805.00249.pdf.
[12]	FENG X, QIN B, LIU T. A language-independent neural network for event detection[J]. Science China Information Sciences, 2018, 61(9): 1-12.
[13]	田梓函, 李欣. 基于BERT-CRF模型的中文事件检测方法研究[J]. 计算机工程与应用, 2021, 57(11): 135-139.	TIAN Z H, LI X. Research on Chinese event detection method based on BERT-CRF model[J]. Computer Engineering and Applications, 2021, 57(11): 135-139.
[14]	陈安南, 叶岩宁, 王畅畅, 等. 基于 BERT-DGCNN 的中文事件抽取方法研究[J]. 计算机科学与应用, 2021, 11: 1572.	CHEN A N, YE Y N, WANG C C, et al. Research on Chinese event extraction method based on BERT-DGCNN[J]. Computer Science and Application, 2021, 11: 1572.
[15]	ZHOU D, ZHANG X, HE Y. Event extraction from Twitter using non-parametric Bayesian mixture model with word embeddings[C]//Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers. [S. l.]: Association for Computational Linguistics, 2017: 808-817.
[16]	YUAN Q, REN X, HE W, et al. Open-schema event profiling for massive news corpora[C]//Proceedings of the 27th ACM International Conference on Information and Knowledge Management. [S.l.]: Association for Computing Machinery, 2018: 587-596.
[17]	PETRONI F, RAMAN N, NUGENT T, et al. An extensible event extraction system with cross-media event resolution[EB/OL]. [2021-11-10]. https://aclanthology.org/E17-1076.pdf.
[18]	刘振. 基于网络科技信息的事件抽取研究[J]. 情报科学, 2018, 36(9): 115-117.	LIU Z. Research on event extraction from networks scientific information[J]. Information Science, 2018, 36(9): 115-117.
[19]	YANG H, CHEN Y, LIU K, et al. Dcfee: A document-level chinese financial event extraction system based on automatically labeled training data[C]//Proceedings of ACL 2018, System Demonstrations. Melbourne: Association for Computational Linguistics, 2018: 50-55.
[20]	LIU S, LI Y, ZHANG F, et al. Event detection without triggers[C]//Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Minnesota: Association for Computational Linguistics, 2019: 735-744.
[21]	XU W, ZHANG W, WANG D. Event detection without trigger words on movie scripts[C]//2020 International Conference on Image, Video Processing and Artificial Intelligence. [S.l.]: SPIE, 2020, 11584: 115841G.

事件类型	参与者可对应实体类型
Life-marry	PER
Life-injure	PER、ORG、GPE、WEA、VEH、SUB
Business-start-org	PER、ORG、GPE

实体类型	依存关系	角色$r$	抽象词${r_w}$
人、组织机构	主谓类	施动者	%sub%
	表被动的主谓类	受动者	%obj%
	动宾类	受动者	%sub%
时间	状语类	时间	%time%
地名、国家名	状语类	地点	%loc%
其他实体	状语类、动宾类	其他	−

数据集	文档数	语句数	事件句数	事件数
训练集	529	12855	2929	3376
验证集	30	826	322	392
测试集	40	634	278	347
总计	599	14365	3529	4117

基线名称	输入	编码器	句子表示
MC-LSTM_avg	<Sentence >	LSTM	隐层均值
MC-LSTM_last	<Sentence >	LSTM	LSTM输出
BC-LSTM_avg	<Sentence,Type>	LSTM	隐层均值
BC-LSTM_last	<Sentence,Type>	LSTM	LSTM输出
TBNNAM	<Sentence,Type>	LSTM	LSTM输出

方法	P	R	F₁
MC-LSTM_avg	0.552	0.421	0.478
MC-LSTM_last	0.561	0.416	0.477
BC-LSTM_avg	0.563	0.597	0.579
BC-LSTM_last	0.589	0.615	0.602
TBNNAM	0.707	0.602	0.65
子模型M₁	0.708	0.636	0.67

Event Detection Model Based on Event Pattern and Type Bias

doi: 10.12178/1001-0548.2021377

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Related

Proportional views