The invention discloses a microblog
query expansion method based on multiple
layers. The microblog
query expansion method based on the multiple
layers is characterized in that keywords are extracted from a PRF (Pseudo
Relevance Feedback) layer of a corpus corresponding to original microblog query words and a web layer of an
external source to serve as candidate
query expansion words, the candidate query expansion words and original microblog query sentences are merged as a
label set for labeling documents in the PRF layer, moreover, Labeled LDA is utilized to semantically model for the labeled PRF documents, the candidate query expansion words and the microblog query words coming from the different sources are then mapped to a unified
semantic layer, the potential
semantics of the candidate query expansion words and the microblog query words are mined, and according to the
semantic similarity between the candidate query expansion words and the microblog query words, the candidate query expansion words which are irrelevant to the
semantics of the microblog query words are filtered out, so that a new microblog query word is formed for more accurate query and retrieval. Compared with the prior art, the microblog query expansion method based on the multiple
layers has the advantages of less query drifts, high retrieval efficiency and high accuracy, and in particular, the microblog query expansion method based on the multiple layers effectively integrates expansion words to achieve an optimal expansion effect, so that query results can meet the real information requirement of users.