Short-Term User Load Forecasting Based on Deep Random Forest: Take Jinhua City as an Example

HU Zhaolong; HU Junjian; PENG Hao; HAN Jianmin; ZHU Xiangbin; DING Zhiguo

doi:10.12178/1001-0548.2022172

Volume 52 Issue 3

May 2023

Article Contents

Article Navigation > Journal of University of Electronic Science and Technology of China > 2023 > 52(3): 430-437

HU Zhaolong, HU Junjian, PENG Hao, HAN Jianmin, ZHU Xiangbin, DING Zhiguo. Short-Term User Load Forecasting Based on Deep Random Forest: Take Jinhua City as an Example[J]. Journal of University of Electronic Science and Technology of China, 2023, 52(3): 430-437. doi: 10.12178/1001-0548.2022172

Citation:

HU Zhaolong, HU Junjian, PENG Hao, HAN Jianmin, ZHU Xiangbin, DING Zhiguo. Short-Term User Load Forecasting Based on Deep Random Forest: Take Jinhua City as an Example[J]. Journal of University of Electronic Science and Technology of China, 2023, 52(3): 430-437. doi: 10.12178/1001-0548.2022172

Short-Term User Load Forecasting Based on Deep Random Forest: Take Jinhua City as an Example

doi: 10.12178/1001-0548.2022172

School of Computer Science and Technology, Zhejiang Normal University　Jinhua Zhejiang　321004

Received Date: 2022-06-06
Rev Recd Date: 2022-10-27
Accepted Date: 2023-02-01

Available Online: 2023-05-26

Publish Date: 2023-05-28

Abstract

By crawling weather data and combining with user load data in Jinhua City, a deep random forest algorithm is introduced to implement short-term user load forecasting. With four evaluation indicators, by comparing the support vector regression algorithm, the K-nearest neighbor algorithm, the Bayesian ridge regression algorithm, the random forest algorithm, and several neural network algorithms, it is found that the deep random forest algorithm has the best performance, and followed by the support vector regression. However, the neural network algorithm performed mediocre on this dataset.
- deep random forest algorithm,
- machine learning,
- short-term load forecasting,
- weather information

References

[1]	李海, 刘凡, 李际. 2020年我国电力发展形势与2021展望[J]. 中国能源, 2021, 3: 24-29. doi: 10.3969/j.issn.1003-2355.2021.03.004 LI H, LIU F, LI J. China's electric power development situation in 2020 and prospects for 2021[J]. Energy of China, 2021, 3: 24-29. doi: 10.3969/j.issn.1003-2355.2021.03.004
[2]	肖国全, 王春, 张福伟. 电力负荷预测[M]. 北京: 中国电力出版社, 2001. XIAO G Q, WANG C, ZHANG F W. Electric load forecasting[M]. Beijing: China Electric Power Press, 2001.
[3]	曹安照, 田丽. 基于RBF神经网络的短期电力负荷预测[J]. 电子科技大学学报, 2006, 35(4): 507-509. doi: 10.3969/j.issn.1001-0548.2006.04.023 CAO A Z, TIAN L. Short-Term electric power load forecasting based on neural network model[J]. Journal of University of Electronic Science and Technology of China, 2006, 35(4): 507-509. doi: 10.3969/j.issn.1001-0548.2006.04.023
[4]	钱卫华, 姚建刚, 龙立波, 等. 基于短期相关性和负荷增长的中长期负荷预测[J]. 电力系统自动化, 2007, 31(11): 59-64. QIAN W H, YAO J G, LONG L B, et al. Short-Term correlation and annual growth based mid-long term load forecasting[J]. Automation of Electric Power Systems, 2007, 31(11): 59-64.
[5]	徐晴, 周超, 赵双双, 等. 基于机器学习的短期电力负荷预测方法研究[J]. 电测与仪表, 2019, 56(23): 70-75. XU Q, ZHOU C, ZHAO S S, et al. Research on short-term power load forecasting method based on machine learning[J]. Electrical Measurement & Instrumentation, 2019, 56(23): 70-75.
[6]	LEE J Y, CHO Y S. National-Scale electricity peak load forecasting: Traditional, machine learning, or hybrid model?[J]. Energy Part D, 2022, 239: 122366.
[7]	李闯, 孔祥玉, 朱石剑, 等. 能源互联环境下考虑需求响应的区域电网短期负荷预测[J]. 电力系统自动化, 2021, 45(1): 71-78. LI C, KONG X Y, ZHU S J, et al. Short-Term load forecasting of regional power grid considering demand response in energy interconnection environment[J]. Automation of Electric Power Systems, 2021, 45(1): 71-78.
[8]	王勇, 黄国兴, 彭道刚. 带反馈的多元线性回归法在电力负荷预测中的应用[J]. 计算机应用与软件, 2008, 25(1): 82-84. doi: 10.3969/j.issn.1000-386X.2008.01.031 WANG Y, HUANG G X, PENG D G. Application of multiple linear-feedback regression analysis to in electric load forecasting[J]. Computer Applications and Software, 2008, 25(1): 82-84. doi: 10.3969/j.issn.1000-386X.2008.01.031
[9]	RAMANATHAN R, ENGLE R, GRANGER C W J, et al. Short-Run forecasts of electricity loads and peaks[J]. International Journal of Forecasting, 1997, 13(2): 161-174. doi: 10.1016/S0169-2070(97)00015-0
[10]	李东东, 覃子珊, 林顺富, 等. 基于混沌时间序列法的微网短期负荷预测[J]. 电力系统及其自动化学报, 2015, 27(5): 14-18. doi: 10.3969/j.issn.1003-8930.2015.05.03 LI D D, QIN Z S, LIN S F, et al. Short-Term load forecasting for microgrid based on method of chaotic time series[J]. Proceedings of the CSU-EPSA, 2015, 27(5): 14-18. doi: 10.3969/j.issn.1003-8930.2015.05.03
[11]	BENTO P M R, POMBO J A N, CALADO M R A, et al. Stacking ensemble methodology using deep learning and ARIMA models for short-term load forecasting[J]. Applied Mathematical Sciences, 2021, 14(21): 7378.
[12]	ALBERG D, LAST M. Short-Term load forecasting in smart meters with sliding window-based ARIMA algorithms[J]. Vietnam Journal of Computer Science, 2018, 5: 241-249. doi: 10.1007/s40595-018-0119-7
[13]	张辰睿. 基于机器学习的短期电力负荷预测和负荷曲线聚类研究[D]. 杭州: 浙江大学, 2021. ZHANG C R. Research on short-term power load forecasting and load curve clustering based on machine learning[D]. Hangzhou: Zhejiang University, 2021.
[14]	吴潇雨, 和敬涵, 张沛, 等. 基于灰色投影改进随机森林算法的电力系统短期负荷预测[J]. 电力系统自动化, 2015(12): 50-55. doi: 10.7500/AEPS20140916005 WU X Y, HE J H, ZHANG P, et al. Power system short-term load forecasting based on improved random forest with grey relation projection[J]. Automation of Electric Power Systems, 2015(12): 50-55. doi: 10.7500/AEPS20140916005
[15]	ZHOU Z H, FENG J. Deep forest[J]. National Science Review, 2019, 6(1): 74-86. doi: 10.1093/nsr/nwy108
[16]	陈吕鹏, 殷林飞, 余涛, 等. 基于深度森林算法的电力系统短期负荷预测[J]. 电力建设, 2018, 39(11): 42-50. CHEN L P, YIN L F, YU T, et al. Short-Term power load forecasting based on deep forest algorithm[J]. Electric Power Construction, 2018, 39(11): 42-50.
[17]	SEUNGHYOUNG R, NOH J, KIM H. Deep neural network based demand side short term load forecasting[J]. Energies, 2017, 10(1): 3.
[18]	KONG W, DONG Z Y, JIA Y, et al. Short-Term residential load forecasting based on LSTM recurrent neural network[J]. IEEE Transactions on Smart Grid, 2019, 10(1): 841-851. doi: 10.1109/TSG.2017.2753802
[19]	LIU F, DONG T, HOU T, et al. A hybrid short-term load forecasting model based on improved fuzzy C-Means clustering, random forest and deep neural networks[J]. IEEE Access, 2021, 9: 59754-59765. doi: 10.1109/ACCESS.2021.3063123
[20]	MOHAMED M, SHADY S R, INES C, et al. A novel stacked generalization ensemble-based hybrid LGBM-XGBMLP model for short-term load forecasting[J]. Energy, 2021, 214: 118874. doi: 10.1016/j.energy.2020.118874
[21]	SHARIFF S M. Autoregressive integrated moving average (ARIMA) and long short-term memory (LSTM) network models for forecasting energy consumptions[J]. European Journal of Electrical Engineering and Computer Science, 2022, 6(3): 7-10. doi: 10.24018/ejece.2022.6.3.435
[22]	ZHOU D, MA S, HAO J, et al. An electricity load forecasting model for integrated energy system based on BiGAN and transfer learning[J]. Energy Reports, 2020, 6: 3446-3461. doi: 10.1016/j.egyr.2020.12.010

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(7) / Tables(4)

Get Citation

PDF

XML

Article Metrics

Article views(4432) PDF downloads(101) Cited by()

Proportional views

HTML

随着居民生活水平的提高，用户对于电力供应的稳定性和吞吐量需求越来越高^[1]。快速增长的用电需求带来的是复杂的电力资源调度和管理问题，而准确预测用户负荷对能源优化调度和管理有着非常重要的作用。

用户负荷预测，是指从已知的电力系统、经济、社会、气象等情况出发，通过对历史数据的分析和研究，考虑不确定性因素的影响，对未来用户负荷做出预先估计和推测^[2-3]。用户负荷预测包括长期、中期和短期预测，一般将几小时到一周内的预测称为短期预测，中长期预测是指数周到未来几年这样较长时间的用户负荷预测^[4]。短期负荷预测一直是研究热点，随着科技的发展以及用户用电需求的增加，短期用户负荷预测研究方法已从传统的回归过渡到了深度学习，预测精度已实现了显著提升^[5-7]，本文同样聚焦于短期负荷预测。

传统用户负荷预测方法主要为线性回归和时间序列回归分析法。线性回归方法通过分析大量的相关变量和负荷数据的对应关系，并建立相应的数学模型得到相关变量和负荷数据的内在联系，从而对负荷数据进行预测^[8]。该方法在确定的参数下可以快速地对负荷数据进行预测，缺点是难以确定一个精准的模型描述相关变量和负荷数据之间的关系^[9]。时间序列回归方法将用户历史负荷数据看作一个与时间有关的变量，构建回归模型来预测未来负荷数据的趋势，如AR，MA，ARIMA等回归模型^[10-12]。

现代智能优化算法包括传统的机器学习算法和深度学习算法。传统机器学习算法可以解决非线性和高维中的分类和回归问题^[13]，如支持向量机、决策树、贝叶斯算法、K近邻算法(K-nearest neighbors, KNN)、随机森林算法。其中森林算法通过构造多个决策树对同一个问题进行决策，并对样本和特征进行随机的选取，研究发现随机森林模型呈现出更准确的预测效果^[14]。

深度随机森林实际上是级联过程，即利用前一层次预测的结果和输入特征相连接构成新的输入特征，可以达到和神经网络同等优秀的模型，并且计算开销小、超参数少、效率高^[15]。文献[16]发现深度森林算法的负荷预测结果比传统机器学习算法预测结果更准，但并没有深入研究和对比其他深度学习算法的预测结果，同时没有分析参数对预测结果的影响。

与深度随机森林网络模型不同，深度神经网络模型主要通过模拟人脑神经元的工作过程建立多层网络模型，通过对数据的学习进行预测。神经网络具有很强的学习能力和容错率，但需要大量的数据和算力才能训练好一个模型。文献[17]利用深度神经网络对用户短期负荷进行预测，研究结果表明深度神经网络比浅层神经网络和传统机器学习算法预测得更准确。文献[18]发现长短记忆循环神经网络(long short-term memory, LSTM)比传统机器学习算法预测得更准确。

此外，学者们还相继提出了结合多种算法的混合模型，如基于随机森林和LSTM算法的混合算法^[19]、LGBM-XGB-MLP混合算法^[20]以及混合ARIMA回归模型和LSTM循环神经网络模型算法^[21]。尽管混合算法通常表现出较好的预测结果，但训练时间较长。大量研究揭示深度森林网络不仅训练时间短，同时预测效果也很好。虽然已有研究发现深度森林算法比传统机器学习算法能更准确地预测用户负荷，但是否比其他深度学习算法预测得更准确尚不清楚，同时也没有深入分析参数的影响^[16]。

本文通过网络爬虫获取天气数据，并与2020年金华市企业负荷数据相结合，对用户负荷进行预测。采用深度随机森林算法，不仅与传统机器学习算法进行对比，也与多个深度学习算法进行了对比，同时也分析了各算法的参数对预测结果的影响。

3. 结束语

本文以金华市用户负荷数据为例，通过爬取以及分析金华市天气信息和用户负荷数据，并借助RMSE、MAE、R²、MAPE这4个评价指标，使用深度随机森林算法对用户负荷进行预测。通过与其他多种预测算法结果对比，发现深度随机森林算法表现出更好的预测效果。

尽管本文爬取的天气数据没有每小时的信息，却能较好地预测用户负荷。如果能挖掘出每小时的天气数据，应该能得到更好的预测结果。不少研究发现混合多个机器学习算法有望获得更好的预测准确率，这也是我们下一步的研究方向。此外，如果将用户按行业分类，对每个行业分别训练和预测可能会提升预测结果。最后，本文仅研究了用户在全年都有的数据，忽略了新用户稀疏数据的影响，采用对抗网络模型^[22]可以较好地解决这个问题。

Reference (22)

[1]	李海, 刘凡, 李际. 2020年我国电力发展形势与2021展望[J]. 中国能源, 2021, 3: 24-29.	LI H, LIU F, LI J. China's electric power development situation in 2020 and prospects for 2021[J]. Energy of China, 2021, 3: 24-29.
[2]	肖国全, 王春, 张福伟. 电力负荷预测[M]. 北京: 中国电力出版社, 2001.	XIAO G Q, WANG C, ZHANG F W. Electric load forecasting[M]. Beijing: China Electric Power Press, 2001.
[3]	曹安照, 田丽. 基于RBF神经网络的短期电力负荷预测[J]. 电子科技大学学报, 2006, 35(4): 507-509.	CAO A Z, TIAN L. Short-Term electric power load forecasting based on neural network model[J]. Journal of University of Electronic Science and Technology of China, 2006, 35(4): 507-509.
[4]	钱卫华, 姚建刚, 龙立波, 等. 基于短期相关性和负荷增长的中长期负荷预测[J]. 电力系统自动化, 2007, 31(11): 59-64.	QIAN W H, YAO J G, LONG L B, et al. Short-Term correlation and annual growth based mid-long term load forecasting[J]. Automation of Electric Power Systems, 2007, 31(11): 59-64.
[5]	徐晴, 周超, 赵双双, 等. 基于机器学习的短期电力负荷预测方法研究[J]. 电测与仪表, 2019, 56(23): 70-75.	XU Q, ZHOU C, ZHAO S S, et al. Research on short-term power load forecasting method based on machine learning[J]. Electrical Measurement & Instrumentation, 2019, 56(23): 70-75.
[6]	LEE J Y, CHO Y S. National-Scale electricity peak load forecasting: Traditional, machine learning, or hybrid model?[J]. Energy Part D, 2022, 239: 122366.
[7]	李闯, 孔祥玉, 朱石剑, 等. 能源互联环境下考虑需求响应的区域电网短期负荷预测[J]. 电力系统自动化, 2021, 45(1): 71-78.	LI C, KONG X Y, ZHU S J, et al. Short-Term load forecasting of regional power grid considering demand response in energy interconnection environment[J]. Automation of Electric Power Systems, 2021, 45(1): 71-78.
[8]	王勇, 黄国兴, 彭道刚. 带反馈的多元线性回归法在电力负荷预测中的应用[J]. 计算机应用与软件, 2008, 25(1): 82-84.	WANG Y, HUANG G X, PENG D G. Application of multiple linear-feedback regression analysis to in electric load forecasting[J]. Computer Applications and Software, 2008, 25(1): 82-84.
[9]	RAMANATHAN R, ENGLE R, GRANGER C W J, et al. Short-Run forecasts of electricity loads and peaks[J]. International Journal of Forecasting, 1997, 13(2): 161-174.
[10]	李东东, 覃子珊, 林顺富, 等. 基于混沌时间序列法的微网短期负荷预测[J]. 电力系统及其自动化学报, 2015, 27(5): 14-18.	LI D D, QIN Z S, LIN S F, et al. Short-Term load forecasting for microgrid based on method of chaotic time series[J]. Proceedings of the CSU-EPSA, 2015, 27(5): 14-18.
[11]	BENTO P M R, POMBO J A N, CALADO M R A, et al. Stacking ensemble methodology using deep learning and ARIMA models for short-term load forecasting[J]. Applied Mathematical Sciences, 2021, 14(21): 7378.
[12]	ALBERG D, LAST M. Short-Term load forecasting in smart meters with sliding window-based ARIMA algorithms[J]. Vietnam Journal of Computer Science, 2018, 5: 241-249.
[13]	张辰睿. 基于机器学习的短期电力负荷预测和负荷曲线聚类研究[D]. 杭州: 浙江大学, 2021.	ZHANG C R. Research on short-term power load forecasting and load curve clustering based on machine learning[D]. Hangzhou: Zhejiang University, 2021.
[14]	吴潇雨, 和敬涵, 张沛, 等. 基于灰色投影改进随机森林算法的电力系统短期负荷预测[J]. 电力系统自动化, 2015(12): 50-55.	WU X Y, HE J H, ZHANG P, et al. Power system short-term load forecasting based on improved random forest with grey relation projection[J]. Automation of Electric Power Systems, 2015(12): 50-55.
[15]	ZHOU Z H, FENG J. Deep forest[J]. National Science Review, 2019, 6(1): 74-86.
[16]	陈吕鹏, 殷林飞, 余涛, 等. 基于深度森林算法的电力系统短期负荷预测[J]. 电力建设, 2018, 39(11): 42-50.	CHEN L P, YIN L F, YU T, et al. Short-Term power load forecasting based on deep forest algorithm[J]. Electric Power Construction, 2018, 39(11): 42-50.
[17]	SEUNGHYOUNG R, NOH J, KIM H. Deep neural network based demand side short term load forecasting[J]. Energies, 2017, 10(1): 3.
[18]	KONG W, DONG Z Y, JIA Y, et al. Short-Term residential load forecasting based on LSTM recurrent neural network[J]. IEEE Transactions on Smart Grid, 2019, 10(1): 841-851.
[19]	LIU F, DONG T, HOU T, et al. A hybrid short-term load forecasting model based on improved fuzzy C-Means clustering, random forest and deep neural networks[J]. IEEE Access, 2021, 9: 59754-59765.
[20]	MOHAMED M, SHADY S R, INES C, et al. A novel stacked generalization ensemble-based hybrid LGBM-XGBMLP model for short-term load forecasting[J]. Energy, 2021, 214: 118874.
[21]	SHARIFF S M. Autoregressive integrated moving average (ARIMA) and long short-term memory (LSTM) network models for forecasting energy consumptions[J]. European Journal of Electrical Engineering and Computer Science, 2022, 6(3): 7-10.
[22]	ZHOU D, MA S, HAO J, et al. An electricity load forecasting model for integrated energy system based on BiGAN and transfer learning[J]. Energy Reports, 2020, 6: 3446-3461.

模型名称	RMSE	MAE	R²	MAPE
随机森林	3.123	2.559	0.901	18.502
深度随机森林	2.818	1.878	0.919	14.390

模型名称	激活函数	RMSE	MAE	R²	MAPE
BP	Sigmoid	4.292	3.618	0.812	26.271
BP	Relu	3.543	2.810	0.872	21.928
BP	Elu	3.489	2.800	0.876	19.378
BP	Tanh	4.978	4.074	0.747	40.964
LSTM	Sigmoid	4.959	3.982	0.749	27.632
LSTM	Relu	4.200	3.441	0.820	25.540
LSTM	Elu	3.647	2.850	0.864	21.157
LSTM	Tanh	6.100	4.486	0.621	26.361
双向LSTM	Sigmoid	5.229	4.014	0.721	24.310
双向LSTM	Relu	4.964	3.681	0.749	23.726
双向LSTM	Elu	3.458	2.747	0.878	23.541
双向LSTM	Tanh	6.102	4.679	0.621	26.958

模型名称	RMSE	MAE	R²	MAPE	时长/s
SVM回归	2.955	2.152	0.911	17.715	4.8
KNN算法	3.298	2.728	0.889	20.531	0.3
Bayes岭回归	4.306	3.561	0.811	23.186	0.1
随机森林	3.123	2.559	0.901	18.502	2.2
深度随机森林	2.818	1.878	0.919	14.390	4.7
BP神经网络	3.489	2.800	0.876	19.378	8.4
LSTM	3.647	2.85	0.864	21.157	21.4
双向LSTM	3.458	2.747	0.878	23.541	73.1

模型名称	后3天；后5天
模型名称	RMSE	MAE	R²	MAPE
SVR回归	4.060；4.938	2.779；3.426	0.861；0.794	35.653；54.923
Knn算法	4.600；4.392	3.344；3.349	0.821；0.837	39.541；52.813
Bayes岭回归	5.539；5.819	3.968；4.232	0.741；0.715	52.073；75.965
随机森林	4.398；4.859	3.194；3.612	0.837；0.801	45.540；75.610
深度随机森林	4.0468；4.448	2.846；3.206	0.862；0.833	30.724；46.691
BP神经网络	4.787；6.998	3.288；4.924	0.807；0.587	40.51；65.081
LSTM	7.472；9.614	5.804；6.891	0.761；0.726	53.801；62.126
双向LSTM	5.505；6.363	3.599；4.558	0.716；0.659	47.558；78.369

Short-Term User Load Forecasting Based on Deep Random Forest: Take Jinhua City as an Example

doi: 10.12178/1001-0548.2022172

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Related

Proportional views