A BERT-Based Vector Autoregressive Network for Sentiment Analysis of Financial News

ZHANG Dian; WANG Jiening; LI Zhaoying; LIU Runnan; ZHENG Wen

doi:10.12178/1001-0548.2022058

Volume 52 Issue 2

Mar. 2023

Article Contents

Article Navigation > Journal of University of Electronic Science and Technology of China > 2023 > 52(2): 263-270

ZHANG Dian, WANG Jiening, LI Zhaoying, LIU Runnan, ZHENG Wen. A BERT-Based Vector Autoregressive Network for Sentiment Analysis of Financial News[J]. Journal of University of Electronic Science and Technology of China, 2023, 52(2): 263-270. doi: 10.12178/1001-0548.2022058

Citation:

ZHANG Dian, WANG Jiening, LI Zhaoying, LIU Runnan, ZHENG Wen. A BERT-Based Vector Autoregressive Network for Sentiment Analysis of Financial News[J]. Journal of University of Electronic Science and Technology of China, 2023, 52(2): 263-270. doi: 10.12178/1001-0548.2022058

A BERT-Based Vector Autoregressive Network for Sentiment Analysis of Financial News

doi: 10.12178/1001-0548.2022058

1.
College of Data Science, Taiyuan University of Technology　Taiyuan　030060
2.
Guangdong Province Corps General Staff Department, The Chinese Armed Police Force　Guangzhou　510630
3.
Center for Healthy Big Data, Changzhi Medical College　Changzhi Shanxi　046000

Received Date: 2022-03-01
Rev Recd Date: 2022-05-16

Available Online: 2023-04-03

Publish Date: 2023-03-28

Abstract

Stock market forecasting is a difficult problem in the field of financial analysis. The intrinsic information contained in financial news has a great impact on the stock market performance. In this paper, we propose a BERT-based vector autoregressive network (BVANet), which quantifies financial news sentiment by BERT and then combines it with market performance to construct a financial time series vector autoregressive (VAR) model to achieve stock prediction eventually. The results show that BVANet has improved results in extracting news sentiment information and model prediction compared with traditional algorithms, and the sentiment of news has predictive effect on market performance. This study can provide a practical reference for the application of natural language processing in financial prediction.
- deep learning,
- financial news,
- natural language based financial forecasting,
- sentiment analysis,
- time-series analysis

References

[1]	FAMA E F. The behavior of stock-market prices[M]. Chicago: University of Chicago, 1965.
[2]	BENGIO Y, DUCHARME R, VINCENT P. A neural probabilistic language model[EB/OL]. [2021-10-9]. https://jmlr.org/papers/volume3/bengio03a/bengio03a.pdf.
[3]	MIKOLOV T, CHEN K, CORRADO G, et al. Efficient estimation of word representations in vector space[EB/OL]. [2021-10-11]. https://arxiv.org/pdf/1301.3781.pdf.
[4]	VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[EB/OL]. [2021-11-11]. https://arxiv.org/pdf/1706.03762.pdf.
[5]	DEVLIN J, CHANG M W, LEE K, et al. Bert: Pre-Training of deep bidirectional transformers for language understanding[EB/OL]. [2021-11-18]. https://arxiv.org/pdf/1810.04805.pdf.
[6]	顾文涛, 王儒, 郑肃豪, 等. 金融市场收益率方向预测模型研究—基于文本大数据方法[J]. 统计研究, 2020, 37(11): 68-79. GU W T, WANG R, ZHENG S H, et al. Research on the prediction model of the direction of financial market returns: Based on text big data method[J]. Statistical Research, 2020, 37(11): 68-79.
[7]	孟志青, 郑国杰, 赵韵雯. 网络投资者情绪与股票市场价格关系研究——基于文本挖掘技术分析[J]. 价格理论与实践, 2018(8): 127-130. MENG Z Q, ZHENG G J, ZHAO Y W. The Research on the relationship between network investor emotion and stock market price: Empirical analysis based on text mining technology[J]. Price Theory & Practice, 2018(8): 127-130.
[8]	姚加权, 冯绪, 王赞钧, 等. 语调、情绪及市场影响: 基于金融情绪词典[J]. 管理科学学报, 2021, 24(5): 26-46. YAO J Q, FENG X, WANG Z J, et al. Tone, sentiment and market impacts: The construction of Chinese sentiment dictionary in finance[J]. Journal of Management Sciences in China, 2021, 24(5): 26-46.
[9]	冉杨帆, 蒋洪迅. 基于BPNN和SVR的股票价格预测研究[J]. 山西大学学报(自然科学版), 2018, 41(1): 1-14. doi: 10.13451/j.cnki.shanxi.univ(nat.sci.).2018.01.001 RAN Y F, JIANG H X. Stock prices prediction based on back propagation neural network and support vector regression[J]. Journal of Shanxi University A (Natural Science Edition), 2018, 41(1): 1-14. doi: 10.13451/j.cnki.shanxi.univ(nat.sci.).2018.01.001
[10]	GIDOFALVI G, ELKAN C. Using news articles to predict stock price movements[EB/OL]. [2021-12-18]. https://www.researchgate.net/profile/Gyozo-idofalvi/publication/228892903_Using_news_articles_to_predict_stock_price_movements/links/54f58e690cf2ba6150668a52/Using-news-articles-to-predict-stock-price-movements.pdf.
[11]	IZUMI K, GOTO T, MATSUI T. Trading tests of long-term market forecast by text mining[C]//The 10th IEEE International Conference on Data Mining Workshops. Sydney: IEEE, 2011: 935-942.
[12]	YILDIRIM S, JOTHIMANI D, KAVAKLIOĞLU C, et al. Classification of "hot news" for financial forecast using NLP techniques[C]//Proceedings of the 2018 IEEE International Conference on Big Data. [S.l.]: IEEE, 2018: 4719-4722 .
[13]	HEATON J B, POLSON N G, WITTE J H. Deep learning for finance: Deep portfolios[J]. Applied Stochastic Models in Business and Industry, 2017, 33(1): 3-12. doi: 10.1002/asmb.2209
[14]	姚加权, 张锟澎, 罗平. 金融学文本大数据挖掘方法与研究进展[J]. 经济学动态, 2020(4): 143-158. YAO J Q, ZHANG K P, LUO P. Text mining in financial big data and its research progress[J]. Economic Perspectives, 2020(4): 143-158.
[15]	岑咏华, 谭志浩, 吴承尧. 财经媒介信息对股票市场的影响研究: 基于情感分析的实证[J]. 数据分析与知识发现, 2019, 3(9): 98-114. CEN Y H, TAN Z H, WU Z Y. Impacts of financial media information on stock market: An empirical study of sentiment analysis[J]. Data Analysis and Knowledge Discovery, 2019(9): 98-114.
[16]	AKITA R, YOSHIHARA A, MATSUBARA T, et al. Deep learning for stock prediction using numerical and textual information[C]//Proceedings of the 2016 IEEE/ACIS 15th International Conference on Computer and Information Science (ICIS). [S.l.]: IEEE, 2016: 1-6.
[17]	EAPEN J, BEIN D, VERMA A. Novel deep learning model with CNN and bi-directional LSTM for improved stock market index prediction[C]//Proceedings of the 2019 IEEE 9th Annual Computing and Communication Workshop and Conference (CCWC). [S.l.]: IEEE, 2019: 264-270.
[18]	GAO T, LI X, CHAI Y, et al. Deep learning with stock indicators and two-dimensional principal component analysis for closing price prediction system[C]//Proceedings of the 2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS). [S.l.]: IEEE, 2016: 166-169.
[19]	SIM H S, KIM H I, AHN J J. Is deep learning for image recognition applicable to stock market prediction?[J]. Complexity, 2019(3): 1-10.
[20]	VARGAS M R, DE LIMA B S, EVSUKOFF A G. Deep learning for stock market prediction from financial news articles[C]//Proceedings of the 2017 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA). [S.l.]: IEEE, 2017: 60-65.
[21]	邹宗森, 杨素婷. 货币供应量、利率对汇率的影响——基于VAR模型的分析[J]. 金融教育研究, 2020, 33(3): 16-24. doi: 10.3969/j.issn.2095-0098.2020.03.003 ZOU Z S, YANG S T. The impact of money supply and interest rate on exchange rate: Analysis based on var model[J]. Research of Finance and Education, 2020, 33(3): 16-24. doi: 10.3969/j.issn.2095-0098.2020.03.003
[22]	王书芹, 华钢, 徐永刚, 等. AUC的不一致性分析[J]. 江苏师范大学学报(自然科学版), 2013, 31(3): 31-34. WANG S Q, HUA G, XU Y G, et al. The incoherence of the area under the ROC curve[J]. Journal of Jiangsu Normal University (Natural Science Edition), 2013, 31(3): 31-34.

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(6) / Tables(8)

Get Citation

PDF

XML

Article Metrics

Article views(4410) PDF downloads(146) Cited by()

Proportional views

HTML

有效市场假说 (efficient market hypothesis, EMH) 认为，股票市场价格受到所有可观察到的信息影响^[1]。财经新闻涵盖了宏观经济走势、行业政策消息以及上市公司业绩状况等信息，这些新闻都会通过影响投资者情绪进而对金融市场产生作用。在最近十几年里，自然语言处理 (natural language processing, NLP) 发展迅速。文献[2]最早将分布式表征应用于单词，结合神经网络来训练语言模型。Mikolov 引入词向量 (word embedding) 技术^[3]，实现了文本信息的向量表示，进而使得计算机可以更有效地理解和处理文本信息。Google 团队基于自注意力机制提出的Transformer 结构^[4]以及BERT模型^[5]能更好地学习到句子中单词与单词之间的联系，从而能够结合上下文语境来提高挖掘信息的效果。NLP 中不断增强的文本表示能力使得计算机可以更准确地捕捉文本中的语义和情感。而如何将财经新闻中蕴含的信息高效准确地提取出来应用于金融市场是一项具有挑战性的研究课题。

在金融情绪分析研究中有一种基于情感词典提取的方法^[6-8]，这种方法通过人工提取的方式构建金融领域的情感词表，通过一些加权方法得到文本的情绪指数，以此来进一步分析和金融市场的联系。这种方法只能捕捉到文本的表层特征，即词语的频次、重要程度等，无法获取文本的句法和语义特征。随着机器学习在金融领域的广泛应用，研究者提出了基于机器学习的方法来解决上述问题^[9-12]。这种方法通过将文本视为词袋模型，采用贝叶斯分类^[10]、逻辑回归和SVM^[12]等算法训练文本分类器实现文本情绪提取进而分析金融市场。相比基于词典的方式，传统机器学习的方法可以捕捉到句法和语义层次的信息，提高情感表示的准确性。然而，研究表明深度学习在解决金融预测和分类问题时有更好的效果^[13]。此外，当训练集较大时，深度学习的信息提取精度明显高于传统的机器学习方法^[14]。因此，近年来研究者更倾向于采用深度学习的方法^[15-19]。

已有研究表明，循环神经网络(recurrent neural network, RNN)相比卷积神经网络(convolutional neural networks, CNN) 在捕捉上下文信息和建模复杂的时间特征方面更胜一筹^[20]。其中文献[20] 和文献[15]的工作比较有代表性。文献[20]提出了一种新的深度学习模型，通过段落向量将新闻文章转换成分布式表示，将长短期记忆网络 (long short-term memory, LSTM) 应用于金融时间序列预测中，并对多家公司开市价格的过去事件时间影响进行建模。文献[15]利用 LSTM 深度神经网络方法识别和提取金融新闻等文本的情感信息，构建自回归分布滞后模型和面板回归模型，从宏观市场以及微观股票资产两个层面实证揭示财经媒介信息所蕴含的情感对股票市场表现的关联影响。上述研究大多使用 RNN 或基于 RNN 的 LSTM 算法来提取文本信息，这些算法有如下缺点：1)信息提取准确率不高，新闻文本较长，LSTM 在处理长文本时效果不好；2)难以并行处理大规模的文本信息，而且效率低；3)预测部分使用机器学习和深度学习的方法缺乏可解释性，不适用于金融分析。

为解决上述问题，本文构建了基于 BERT 的向量自回归融合网络 BVANet ，该网络在情感信息提取过程中继承了 BERT 模型处理长文本的优势，利用自注意力机制提高了财经新闻情感信息提取的准确性和训练效率，之后通过量化情感信息并融合向量自回归 VAR 模型对市场表现进行时间序列分析。实验结果表明，相比于传统算法，BVANet 有更好的财经新闻情感分类效果，且在股票市场中有更好的可解释性和预测性。

3. 结束语

本文提出的 BVANet 是一种基于 BERT 向量自回归融合网络的股票预测方法，本文使用其分析了个股2020年全年新闻情感与市场表现的关系。结果表明 BVANet 在财经新闻的情感量化和股票预测中均优于其他方法，财经新闻情感信息对股票市场表现具有预测作用。BVANet 完整地进行了模型的训练、模型评估、时间序列分析和股票预测的全过程，对自然语言处理在金融领域的应用提供了实践参考。

未来研究或可从以下2个方面深入展开：

1) 本研究在训练模型时选取的细粒度为字向量，在中文中一个词语可能比字更能表达清楚，接下来可以利用十分完备的分词工具对文本进行分词，选取细粒度为词构建词向量。进一步提升对文章的向量表达效果，提高情感表达的精确性，更好地把握情感。

2) 本研究在进行时序分析时只用了财经新闻的情感特征和市场表现特征，然而实际的市场远比这要复杂的多，接下来可以加入社交媒体信息、股吧评论信息等，更全面地探究不同类型信息以及它们对市场的不同影响。

Reference (22)

[1]	FAMA E F. The behavior of stock-market prices[M]. Chicago: University of Chicago, 1965.
[2]	BENGIO Y, DUCHARME R, VINCENT P. A neural probabilistic language model[EB/OL]. [2021-10-9]. https://jmlr.org/papers/volume3/bengio03a/bengio03a.pdf.
[3]	MIKOLOV T, CHEN K, CORRADO G, et al. Efficient estimation of word representations in vector space[EB/OL]. [2021-10-11]. https://arxiv.org/pdf/1301.3781.pdf.
[4]	VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[EB/OL]. [2021-11-11]. https://arxiv.org/pdf/1706.03762.pdf.
[5]	DEVLIN J, CHANG M W, LEE K, et al. Bert: Pre-Training of deep bidirectional transformers for language understanding[EB/OL]. [2021-11-18]. https://arxiv.org/pdf/1810.04805.pdf.
[6]	顾文涛, 王儒, 郑肃豪, 等. 金融市场收益率方向预测模型研究—基于文本大数据方法[J]. 统计研究, 2020, 37(11): 68-79.	GU W T, WANG R, ZHENG S H, et al. Research on the prediction model of the direction of financial market returns: Based on text big data method[J]. Statistical Research, 2020, 37(11): 68-79.
[7]	孟志青, 郑国杰, 赵韵雯. 网络投资者情绪与股票市场价格关系研究——基于文本挖掘技术分析[J]. 价格理论与实践, 2018(8): 127-130.	MENG Z Q, ZHENG G J, ZHAO Y W. The Research on the relationship between network investor emotion and stock market price: Empirical analysis based on text mining technology[J]. Price Theory & Practice, 2018(8): 127-130.
[8]	姚加权, 冯绪, 王赞钧, 等. 语调、情绪及市场影响: 基于金融情绪词典[J]. 管理科学学报, 2021, 24(5): 26-46.	YAO J Q, FENG X, WANG Z J, et al. Tone, sentiment and market impacts: The construction of Chinese sentiment dictionary in finance[J]. Journal of Management Sciences in China, 2021, 24(5): 26-46.
[9]	冉杨帆, 蒋洪迅. 基于BPNN和SVR的股票价格预测研究[J]. 山西大学学报(自然科学版), 2018, 41(1): 1-14.	RAN Y F, JIANG H X. Stock prices prediction based on back propagation neural network and support vector regression[J]. Journal of Shanxi University A (Natural Science Edition), 2018, 41(1): 1-14.
[10]	GIDOFALVI G, ELKAN C. Using news articles to predict stock price movements[EB/OL]. [2021-12-18]. https://www.researchgate.net/profile/Gyozo-idofalvi/publication/228892903_Using_news_articles_to_predict_stock_price_movements/links/54f58e690cf2ba6150668a52/Using-news-articles-to-predict-stock-price-movements.pdf.
[11]	IZUMI K, GOTO T, MATSUI T. Trading tests of long-term market forecast by text mining[C]//The 10th IEEE International Conference on Data Mining Workshops. Sydney: IEEE, 2011: 935-942.
[12]	YILDIRIM S, JOTHIMANI D, KAVAKLIOĞLU C, et al. Classification of "hot news" for financial forecast using NLP techniques[C]//Proceedings of the 2018 IEEE International Conference on Big Data. [S.l.]: IEEE, 2018: 4719-4722 .
[13]	HEATON J B, POLSON N G, WITTE J H. Deep learning for finance: Deep portfolios[J]. Applied Stochastic Models in Business and Industry, 2017, 33(1): 3-12.
[14]	姚加权, 张锟澎, 罗平. 金融学文本大数据挖掘方法与研究进展[J]. 经济学动态, 2020(4): 143-158.	YAO J Q, ZHANG K P, LUO P. Text mining in financial big data and its research progress[J]. Economic Perspectives, 2020(4): 143-158.
[15]	岑咏华, 谭志浩, 吴承尧. 财经媒介信息对股票市场的影响研究: 基于情感分析的实证[J]. 数据分析与知识发现, 2019, 3(9): 98-114.	CEN Y H, TAN Z H, WU Z Y. Impacts of financial media information on stock market: An empirical study of sentiment analysis[J]. Data Analysis and Knowledge Discovery, 2019(9): 98-114.
[16]	AKITA R, YOSHIHARA A, MATSUBARA T, et al. Deep learning for stock prediction using numerical and textual information[C]//Proceedings of the 2016 IEEE/ACIS 15th International Conference on Computer and Information Science (ICIS). [S.l.]: IEEE, 2016: 1-6.
[17]	EAPEN J, BEIN D, VERMA A. Novel deep learning model with CNN and bi-directional LSTM for improved stock market index prediction[C]//Proceedings of the 2019 IEEE 9th Annual Computing and Communication Workshop and Conference (CCWC). [S.l.]: IEEE, 2019: 264-270.
[18]	GAO T, LI X, CHAI Y, et al. Deep learning with stock indicators and two-dimensional principal component analysis for closing price prediction system[C]//Proceedings of the 2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS). [S.l.]: IEEE, 2016: 166-169.
[19]	SIM H S, KIM H I, AHN J J. Is deep learning for image recognition applicable to stock market prediction?[J]. Complexity, 2019(3): 1-10.
[20]	VARGAS M R, DE LIMA B S, EVSUKOFF A G. Deep learning for stock market prediction from financial news articles[C]//Proceedings of the 2017 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA). [S.l.]: IEEE, 2017: 60-65.
[21]	邹宗森, 杨素婷. 货币供应量、利率对汇率的影响——基于VAR模型的分析[J]. 金融教育研究, 2020, 33(3): 16-24.	ZOU Z S, YANG S T. The impact of money supply and interest rate on exchange rate: Analysis based on var model[J]. Research of Finance and Education, 2020, 33(3): 16-24.
[22]	王书芹, 华钢, 徐永刚, 等. AUC的不一致性分析[J]. 江苏师范大学学报(自然科学版), 2013, 31(3): 31-34.	WANG S Q, HUA G, XU Y G, et al. The incoherence of the area under the ROC curve[J]. Journal of Jiangsu Normal University (Natural Science Edition), 2013, 31(3): 31-34.

数据集	数据来源	样本	新闻数量/条
股票新闻数据集	东方财富网	贵州茅台	1777
		山西汾酒	427
		五粮液	607
		洋河股份	384

参数	值
输入字向量维度	384
Transformer层数	6
Multi-head个数	12
优化器	Adam
Batch Size	1
Dropout	0.4
学习率	1×10⁻⁶

算法	Accuracy	Precision	Recall	F1-score
SVM	0.5882	0.5920	09674	0.7357
BP	0.5902	0.5896	0.9896	0.7450
CNN	0.7535	0.8145	0.7558	0.7824
LSTM	0.8244	0.8108	0.8936	0.8510
BVANet	0.9508	0.9513	0.9521	0.9517

个股	Pearson相关性
贵州茅台	0.793792**
五粮液	0.565619**
山西汾酒	0.880881**
洋河	0.663530**
**表示显著性水平为0.05

Lag	LR	AIC	SC	HQ
0	NA	−6.622052	−6.592961	−6.610329
1	25.14849	−6.695141	−6.607866	−6.659971
2	2.527013	−6.672467	−6.527009	−6.613851
3	4.607531	−6.658855	−6.455212	−6.576792
4	1.972506	−6.633958	−6.372132	−6.528449
5	2.497880	−6.611441	−6.291431	−6.482486

A BERT-Based Vector Autoregressive Network for Sentiment Analysis of Financial News

doi: 10.12178/1001-0548.2022058

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Related

Proportional views