Abstract:To effectively obtain the conformance clauses in the transportation information standards and simplify the standard methods, combining the bidirectional long short-term memory (BLSTM)based text enhancement representation with the CNN based sentence classification, the classification method was proposed to classify the conformance clauses for solving the problems of lack of context meaning in convolution neural network and gradient disappearance and gradient dispersion in cyclic neural network in the existing text classification methods. The core idea was to add the vectors generated by the forward and backward processes of BLSTM, and the added vectors were spliced with the original vector as vector representation of the text. The text was classified as the input of CNN network. To verify the proposed model, the comparative test with traditional TF-IDF+SVM machine model, single CNN, BLSTM neural network model and classic hybrid model was set up. According to the test of the data set of standard terms of transportation information, the accuracy of the chain-mixed neural network model based on the improved BLSTM and CNN reaches 93.77%.
范维克, 张绍阳, 陈博远, 王珂. 交通信息标准条款BLSTM和CNN链式模型分类方法[J]. 江苏大学学报(自然科学版), 2020, 41(2): 143-148.
FAN Weike, ZHANG Shaoyang, CHEN Boyuan, WANG Ke. Classification methods of traffic standard terms based on BLSTM and CNN chain model[J]. Journal of Jiangsu University(Natural Science Eidtion)
, 2020, 41(2): 143-148.
BERNA A, CAN G M. Semantic text classification: a survey of past and recent advances[J]. Information Processing & Management, 2018, 54(6):1129-1153.
[2]
MIKOLOV T,SUTSKEVER I,CHEN K,et al. Distributed representations of words and phrases and their compositionality[C]∥Advances in Neural Information Processing Systems,2013:3111-3119.
[3]
PENNINGTON J, SOCHER R, MANNING C D. Glove: global vectors for word representation[C]∥Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing,2014: 1532-1543.
[4]
SOCHER R, HUVAL B, MANNING C D, et al. Semantic compositionality through recursive matrix-vector spaces[C]∥Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012: 1201-1211.
ZHANG X C,SANG R T,ZHOU Z H,et al. A short text classification method based on two-channel convolutional neural network[J]. Journal of Chongqing University of Technology(Natural Science),2019, 33(1):45-52.(in Chinese)
[6]
SONG J Y, HE Y, FU G H. Polarity classification of short product reviews via multiple cluster-based SVM classifiers[C]∥Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation,2015: 267-274.
[7]
LIU T F, YU S Y, XU B M, et al. Recurrent networks with attention and convolutional networks for sentence representation and classification[J]. Applied Intelligence, 2018,48(1):3797-3806.
[8]
WANG P, XU J M, XU B, et al. Semantic clustering and convolutional neural network for short text categorization[C]∥Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, 2015: 352-357.
[9]
SHEN Y, ZHANG Q, ZHANG J, et al. Improving me-dical short text classification with semantic expansion using word-cluster embedding[J].Lecture Notes in Electrical Engineering, 2019,514:401-411.
LI Y,DONG H B. Text sentiment analysis based on convolution neural network and BLSTM network feature fusion[J].Journal of Computer Applications,doi:10.11772/j.issn.1001-9081.2018041289.(in Chinese)
CHEN J,SHAO Z Q,ZHANG H H,et al. Short text sentiment analysis based on parallel hybrid neural network model[J]. Journal of Computer Applications,2019,39(8):2192-2197. (in Chinese)
[12]
ZHOU C T, SUN C L, LIU Z Y, et al. A C-LSTM neural network for text classification[J]. Computer Science,2015,1(4): 39-44.
SUN W,HUANG Z,ZHANG X F. Research on method of literature dataset construction for domain analysis based on feature measure [J]. Digital Library Forum,2015(12):9-14. (in Chinese)