Citation: | XIA Ying, LIU Min. Traffic Flow Prediction Based on Spatial-Temporal Attention Convolutional Neural Network[J]. Journal of Southwest Jiaotong University, 2023, 58(2): 340-347. doi: 10.3969/j.issn.0258-2724.20210526 |
In order to fully exploit the complex spatial-temporal dynamic correlation of traffic flow and improve the accuracy of traffic flow prediction, a spatial attention mechanism and an dilated causal convolutional neural network are introduced. A traffic flow prediction model STACNN based on spatial-temporal attention convolutional neural network is proposed. Firstly, the gated temporal convolution network block constructed by dilated causal convolution and gating unit is used to obtain the nonlinear temporal dynamic correlation of traffic flow and avoid gradient disappearance or gradient explosion when training long-term sequences. Secondly, the spatial attention mechanism is used to automatically assign attention weights to the traffic sensor nodes in the road network, which can dynamically pay attention to the spatial relationship between non-adjacent nodes, and combine the graph convolutional neural network to extract the local spatial dynamic correlation of the road network. Then, the final traffic flow prediction result is obtained through the fully connected layer. Finally, a 60-minute traffic flow prediction experiment is carried out using two highway traffic datasets PEMSD4 and PEMSD8. The experimental results show that: compared with the spatio-temporal graph convolutional network (STGCN) model with good performance in the baseline model, the MAE (mean absolute error) value of the prediction results of the proposed STACNN model on the two datasets is improved by 2.79% and 1.18%, the MAPE (mean absolute percentage error) value increased by 1.00% and 0.46%, and the RMSE (root mean square error) value increased by 3.8% and 1.25%, respectively. In addition, introducing dilated causal convolutional neural network and spatial attention mechanism have positively contributed to extraction of spatial-temporal dynamic correlation features.
[1] |
NAGY A M, SIMON V. Survey on traffic prediction in smart cities[J]. Pervasive and Mobile Computing, 2018, 50: 148-163. doi: 10.1016/j.pmcj.2018.07.004
|
[2] |
刘静,关伟. 交通流预测方法综述[J]. 公路交通科技,2004,21(3): 82-85.
LIU Jing, GUAN Wei. A summary of traffic flow forecasting methods[J]. Journal of Highway Transportation Research Development, 2004, 21(3): 82-85.
|
[3] |
周晓,唐宇舟,刘强. 基于卡尔曼滤波的道路平均速度预测模型研究[J]. 浙江工业大学学报,2020,48(4): 392-396,404.
ZHOU Xiao, TANG Yuzhou, LIU Qiang. Research on road average speed prediction model based on kalman filter[J]. Journal of Zhejiang University of Technology, 2020, 48(4): 392-396,404.
|
[4] |
OKUTANI I, STEPHANEDES Y J. Dynamic prediction of traffic volume through Kalman filtering theory[J]. Transportation Research Part B: Methodological, 1984, 18(1): 1-11. doi: 10.1016/0191-2615(84)90002-X
|
[5] |
HAMED M M, AL-MASAEID H R, SAID Z M B. Short-term prediction of traffic volume in urban arterials[J]. Journal of Transportation Engineering, 1995, 121(3): 249-254. doi: 10.1061/(ASCE)0733-947X(1995)121:3(249)
|
[6] |
李洁,彭其渊,杨宇翔. 基于SARIMA模型的广珠城际铁路客流量预测[J]. 西南交通大学学报,2020,55(1): 41-51. doi: 10.35741/issn.0258-2724.55.1.41
LI Jie, PENG Qiyuan, YANG Yuxiang. Passenger flow prediction for Guangzhou−Zhuhai intercity railway based on SARIMA model[J]. Journal of Southwest Jiaotong University, 2020, 55(1): 41-51. doi: 10.35741/issn.0258-2724.55.1.41
|
[7] |
ZIVOT E, WANG J H. Modeling financial time series with S-PLUS®[M]. 2nd editon. New York: Springer, 2006: 385-429.
|
[8] |
姚智胜,邵春福,高永亮. 基于支持向量回归机的交通状态短时预测方法研究[J]. 北京交通大学学报,2006,30(3): 19-22. doi: 10.3969/j.issn.1673-0291.2006.03.005
YAO Zhisheng, SHAO Chunfu, GAO Yongliang. Research on methods of short-term traffic forecasting based on support vector regression[J]. Journal of Beijing Jiaotong University, 2006, 30(3): 19-22. doi: 10.3969/j.issn.1673-0291.2006.03.005
|
[9] |
张晓利,贺国光,陆化普. 基于K-邻域非参数回归短时交通流预测方法[J]. 系统工程学报,2009,24(2): 178-183.
ZHANG Xiaoli, HE Guoguang, LU Huapu. Short-term traffic flow forecasting based on K-nearest neighbors non-parametric regression[J]. Journal of Systems Engineering, 2009, 24(2): 178-183.
|
[10] |
陈丹,胡明华,张洪海,等. 基于贝叶斯估计的短时空域扇区交通流量预测[J]. 西南交通大学学报,2016,51(4): 807-814. doi: 10.3969/j.issn.0258-2724.2016.04.028
CHEN Dan, HU Minghua, ZHANG Honghai, et al. Short-term traffic flow prediction of airspace sectors based on Bayesian estimation theory[J]. Journal of Southwest Jiaotong University, 2016, 51(4): 807-814. doi: 10.3969/j.issn.0258-2724.2016.04.028
|
[11] |
SHAO H X, SOONG B H . Traffic flow prediction with long short-term memory networks (LSTMS)[C]// Proceedings of 2016 IEEE Region 10 Conference (TENCON). Singapore: IEEE, 2016: 2986-2989.
|
[12] |
刘明宇,吴建平,王钰博,等. 基于深度学习的交通流量预测[J]. 系统仿真学报,2018,30(11): 4100-4105,4114. doi: 10.16182/j.issn1004731x.joss.201811007
LIU Mingyu, WU Jianping, WANG Yubo, et al. Traffic flow prediction based on deep learning[J]. Journal of System Simulation, 2018, 30(11): 4100-4105,4114. doi: 10.16182/j.issn1004731x.joss.201811007
|
[13] | |
[14] |
YAO H X, TANG X F, WEI H, et al. Revisiting spatial-temporal similarity: a deep learning framework for traffic prediction[C]//Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence. Honolulu: AAAI, 2019: 5668-5675.
|
[15] |
ZHANG J, ZHENG Y, QI D. Deep spatio-temporal residual networks for citywide crowd flows prediction[C]//Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence. San Francisco: AAAI, 2016: 1655-1661.
|
[16] |
ZHAO L, SONG Y J, ZHANG C, et al. T-GCN: a temporal graph convolutional network for traffic prediction[J]. IEEE Transactions on Intelligent Transportation Systems, 2020, 21(9): 3848-3858. doi: 10.1109/TITS.2019.2935152
|
[17] |
YU B, YIN H T, ZHU Z X. Spatio-temporal graph convolutional networks: a deep learning framework for traffic forecasting[C]//Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence. Stockholm: IJCAI, 2018: 3634-3640.
|
[18] |
TEDJOPURNOMO D A, BAO Z F, ZHENG B H, et al. A survey on modern deep neural network for traffic prediction: trends, methods and challenges[J]. IEEE Transactions on Knowledge and Data Engineering, 2022, 34(4): 1544-1561.
|
[19] | |
[20] |
FENG X C, GUO J, QIN B, et al. Effective deep memory networks for distant supervised relation extraction[C]//Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence. Melbourne: IJCAI, 2017: 4002-4008.
|
[21] | |
[22] |
SIMONOVSKY M, KOMODAKIS N. Dynamic edge-conditioned filters in convolutional neural networks on graphs[C]//Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017: 29-38.
|
[23] |
YU F , KOLTUN V. Multi-scale context aggregation by dilated convolutions[C]//4th International Conference on Learning Representations. San Juan: ICLR, 2016: 1-13.
|
[24] |
DAUPHIN Y N, FAN A, AULI M, et al. Language modeling with gated convolutional networks[C]//34th International Conference on Machine Learning. Sydney: IMLS, 2017: 1551-1559.
|
[25] |
GUO S N, LIN Y F, FENG N, et al. Attention based spatial-temporal graph convolutional networks for traffic flow forecasting[C]//Proceedings of the AAAI Conference on Artificial Intelligence. Honolulu: AAAI, 2019: 922-929.
|