The invention discloses an LDA-based Chinese question mapping method, which includes using the LDA
topic model to classify the document base, and then using the Softmax regression model to classify the part-of-speech of the question, and according to the difference of the part-of-
speech classification, the weight of the
content word is higher than that of the
function word High, but the weights of different parts of speech in content words are not the same, and then use the syntactic analysis based on
dependency grammar to find out the dependency relationship of words in the
sentence, and give different weights according to the different components of words in the
sentence, so the problem The weight of each word in is obtained by the product of two parts, and finally, according to Bayesian rules, the connection is established through the weighted distribution of words in the question and the distribution of topics and terms in the document. The
topic model based on LDA classifies the documents, and at the same time, assigns different weights by referring to the
part of speech of the terms in the question
sentence and the components in the sentence, so as to improve the role of important terms in classification and improve the mapping of Chinese questions. accuracy.