A Kalman filter word vector learning method based on Diesel process, the method includes: training and preprocessing the corpus, generating an LDS language model system, initializing the system parameters, assuming that the process noise satisfies a normal distribution, defining the aggregation class theta t =(μ t ,∑ t ), μ t For the frequency of word t in the corpus, calculate θ t The prior distribution of Dirichlet, the posterior distribution is calculated by Kalman filter derivation and Gibbs sampling estimation, the candidate clusters are extracted by MCMC sampling algorithm, the selection probability of the candidate clusters is calculated, and the candidate with the highest probability value is selected Choose the cluster as θ t , calculate the estimated value of the minimum mean square error of the clustering, substitute the calculation result into the LDS language model, train the model through the EM algorithm, make the model parameters stable, input the preprocessed corpus into the trained LDS language model, and pass Carl The Mann filter updates the formula in one step to compute the implicit vector representation.