| Citation: | HU Liwei, HOU Zhi, ZHAO Xueting, LIU Bing, CHEN Chen, HE Yu, ZHANG Ruijie. Improvement of Highway Traffic Risk Prediction Method Based on Traffic Accident Text Mining[J]. Journal of Southwest Jiaotong University, 2025, 60(6): 1487-1498. doi: 10.3969/j.issn.0258-2724.20230290 |
To effectively solve the problems of long highway inspection mileage and control difficulty, the applicability of the existing bidirectional long and short-term memory network (BiLSTM) text classification model and convolutional neural network (CNN) risk prediction model was improved, and the historical road traffic accident text data were analyzed and mined. The road segmentation method was introduced to accurately predict the distribution of highway driving risks and realize the scientific control of highway driving safety. Firstly, the text of traffic accidents was classified by the improved BiLSTM based on a self-attention mechanism (BiLSTM-AT), and the corresponding accident risk level of each accident was obtained. Second, the highway was divided into segments in ArcGIS, and the driving risk level within each segment was counted; kernel density analysis was performed to visualize the text classification results and show the risk level in different areas. Finally, the CNN based on LSTM (CNN-LSTM) was used to conduct time series prediction for the classified risk levels, obtaining the spatial distribution of future highway driving risks and drawing the cloud map of highway driving risk levels. The results show that the accuracy of the BiLSTM-AT model reaches 95.03% in terms of accident text classification, which is 0.91% and 0.67% higher than that of the BiLSTM and gate recurrent unit (GRU), respectively; the average relative error and root mean square error of the CNN-LSTM are 0.04 and 0.07, respectively, in terms of risk prediction, which are lower than that of the suboptimal LSTM model by 9.05% and 6.84%, respectively. The proposed method that closely connects accident text classification, segment division, driving risk prediction, and result visualization can effectively extract and analyze the driving risk information in the traffic accident text and provide a reference for optimizing the highway inspection routes and the traffic control of key segments.
| [1] |
何杰, 叶云涛, 徐扬, 等. 基于多模态参数的高速公路驾驶人压力负荷检测方法[J]. 西南交通大学学报, 2025, 60(5): 1229-1239.
HE Jie, YE Yuntao, XU Yang, et al. Method for stress detection of freeway drivers based on multimodal parameters[J]. Journal of Southwest Jiaotong University, 2025, 60(5): 1229-1239.
|
| [2] |
胡立伟, 吕一帆, 赵雪亭等. 基于数据驱动的交通事故伤害程度影响因素及其耦合关系研究[J]. 交通运输系统工程与信息, 2022, 22(05): 117-124, 134
HU Liwei, LV Yifan, ZHAO Xueting, et al. Research on the influencing factors and coupling relationship of traffic accident injury severity based on data-driven approach[J]. Journal of Transportation Systems Engineering and Information Technology, 2022, 22(05): 117-124, 134.
|
| [3] |
牛世峰, 董景钊, 常东风, 等. 考虑空间稳定性的公路单车事故严重程度影响因素分析[J/OL]. 西南交通大学学报, 1-15[2025-05-03]. http://kns.cnki.net/kcms/detail/51.1277.U.20250324.1433.002.html.
|
| [4] |
PUTRA A D, GIRSANG A S. Analysis of named-entity effect on text classification of traffic accident data using machine learning[J]. Indonesian Journal of Electrical Engineering and Computer Science, 2022, 25(3): 1672-1678. doi: 10.11591/ijeecs.v25.i3.pp1672-1678
|
| [5] |
RIVERA G, FLORENCIA R, GARCÍA V, et al. News classification for identifying traffic incident points in a Spanish-speaking country: a real-world case study of class imbalance learning[J]. Applied Sciences, 2020, 10(18): 6253. doi: 10.3390/app10186253
|
| [6] |
范维克, 张绍阳, 陈博远, 等. 交通信息标准条款BLSTM和CNN链式模型分类方法[J]. 江苏大学学报(自然科学版), 2020, 41(02): 143-148. doi: 10.3969/j.issn.1671-7775.2020.02.004
FAN Weike, ZHANG Shaoyang, CHEN Boyuan, et al. Classification methods of traffic standard terms based on BLSTM and CNN chain model[J]. Journal of Jiangsu University (Natural Science Edition), 2020, 41(02): 143-148. doi: 10.3969/j.issn.1671-7775.2020.02.004
|
| [7] |
YUAN S, WANG Q. Imbalanced traffic accident text classification based on Bert-RCNN[C]//Journal of Physics: Conference Series. [S.l.]: IOP, 2022: 012003.
|
| [8] |
李昀轩, 李萌, 陆建, 等. 基于多任务迁移学习的交通警情信息自动处理方法[J]. 中国公路学报, 2022, 35(9): 1-12.
LI Yunxuan, LI Meng, LU Jian, et al. An auto-processing method of traffic safety information based on a multi-task transfer learning algorithm[J]. China Journal of Highway and Transport, 2022, 35(9): 1-12.
|
| [9] |
HOSSAIN M, MUROMACHI Y. A Bayesian network based framework for real-time crash prediction on the basic freeway segments of urban expressways[J]. Accident Analysis & Prevention, 2012, 45: 373-381.
|
| [10] |
YU R, ABDEL-ATY M. Utilizing support vector machine in real-time crash risk evaluation[J]. Accident Analysis & Prevention, 2013, 51: 252-259.
|
| [11] |
LIN L, WANG Q, SADEK A W. A novel variable selection method based on frequent pattern tree for real-time traffic accident risk prediction[J]. Transportation Research Part C: Emerging Technologies, 2015, 55: 444-459. doi: 10.1016/j.trc.2015.03.015
|
| [12] |
SAMEEN M I, PRADHAN B. Severity prediction of traffic accidents with recurrent neural networks[J]. Applied Sciences, 2017, 7(6): 476-482. doi: 10.3390/app7060476
|
| [13] |
YUAN Z, ZHOU X, YANG T. Hetero-convlstm: a deep learning approach to traffic accident prediction on heterogeneous spatio-temporal data[C]//Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. London: Association for Computing Machinery, 2018: 984-992.
|
| [14] |
ZHOU Z, WANG Y, XIE X, et al. RiskOracle: A minute-level citywide traffic accident forecasting framework[C]//Proceedings of the AAAI Conference on Artificial Intelligence. New York: AAAI, 2020, 34(1): 1258-1265.
|
| [15] |
熊晓夏, 刘擎超, 沈钰杰等. 基于LSTM-BF的高速公路交通事故风险模型[J]. 中国安全科学学报, 2022, 32(5): 170-176.
XIONG Xiaoxia, LIU Qingchao, SHEN Yujie, et al. Study on risk model of highway traffic accidents based on LSTM-BF[J]. China Safety Science Journal, 2022, 32(5): 170-176.
|
| [16] |
袁振洲, 胡嫣然, 杨洋. 考虑多维动态特征交互的高速公路实时事故风险建模[J]. 交通运输系统工程与信息, 2022, 22(3): 215-223.
YUAN Zhenzhou, HU Yanran, YANG Yang. Modeling towards freeway real-time traffic crash prediction considering multi-dimensional dynamic feature interactions[J]. Journal of Transportation Systems Engineering and Information Technology, 2022, 22(3): 215-223.
|
| [17] |
KAFFASH CHARANDABI N, GHOLAMI A, ABDOLLAHZADEH BINA A. Road accident risk prediction using generalized regression neural network optimized with self-organizing map[J]. Neural Computing and Applications, 2022, 34(11): 8511-8524. doi: 10.1007/s00521-021-06549-8
|
| [18] |
王贝贝, 万怀宇, 郭晟楠等. 融合局部和全局时空特征的交通事故风险预测[J]. 计算机科学与探索, 2021, 15(9): 1694-1702.
WANG Beibei, WAN Huaiyu, GUO Shengnan, et al. Local and global spatial-temporal networks for traffic accident risk forecasting[J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(9): 1694-1702.
|
| [19] |
LERIAN J C, CHENAYAN G. The implementation of multi label K-nearest neighbor algorithm to classifying essay answers[J]. Journal of Information System, Technology and Engineering, 2023, 1(3): 89-94. doi: 10.61487/jiste.v1i3.38
|
| [20] |
BANSAL M, GOYAL A, CHOUDHARY A. A comparative analysis of K-nearest neighbor, genetic, support vector machine, decision tree, and long short term memory algorithms in machine learning[J]. Decision Analytics Journal, 2022, 3: 100071. doi: 10.1016/j.dajour.2022.100071
|
| [21] |
DEVLIN J, CHANG M W, Lee K, et al. Bert: pre-training of deep bidirectional transformers for language understanding[C]//2019 Conference of the North American Chapter. Minnesota: Association for Computational Lin uistics, 2019: 4171-4186.
|
| [22] |
LIU Y, OTT M, GOYAL N, et al. Roberta: a robustly optimized bert pretraining approach[EB/OL]. (2019-07-26). https://arxiv.org/abs/190.11692.
|
| [23] |
尹何举, 昝红英, 陈俊怡, 等. 交通事故的自动判案研究[J]. 中文信息学报, 2019, 33(3): 136-144.
YIN Heju, ZAN Hongying, CHEN Junyi, et al. Study on automatic judgment of traffic accidents[J]. Journal of Chinese Information Processing, 2019, 33(3): 136-144.
|
| [24] |
张文峰, 奚雪峰, 崔志明, 等. 多标签文本分类研究回顾与展望[J]. 计算机工程与应用, 2023, 59(18): 28-48. doi: 10.3778/j.issn.1002-8331.2210-0446
ZHANG Wenfeng, XI Xuefeng, CUI Zhiming, et al. Review and prospect of multi-Label text classification research[J]. Computer Engineering and Applications, 2023, 59(18): 28-48. doi: 10.3778/j.issn.1002-8331.2210-0446
|
| [25] |
中华人民共和国国务院. 道路交通事故处理办法(国务院令第89号)[EB/OL]. (1992-01-01). https://www.mps.gov.cn/n2255079/n2256030/n2256036/c3946102/content.html. 2025-05-03.
|
| [26] |
WEI J, ZOU K. EDA: easy data augmentation techniques for boosting performance on text classification tasks[C]//Proceedings of the 2019 Conference on EMNLP-IJCNLP. HongKong: Association for Computational Linguistics, 2019, 11196: 1-15.
|