The invention provides a Chinese
query expansion method based on pattern mining and word vector similarity calculation, which. The method comprises the following steps: firstly, retrieving a Chinese document set through user query to obtain an initial retrieval document, and performing word vector
semantic learning training on the initial retrieval document set to obtain a word vector set comprising query word items and non-query word items; then mining extension words for the pseudo-correlation feedback document set by adopting a Copulas-function-based associated extension word mining method,and establishing an associated extension word set; and performing
cosine similarity operation of two vectors in the word vector set to obtain a
word embedding extension word set and a word vector association extension word set, finally fusing the
word embedding extension word set and the word vector association extension word set to obtain a final extension word, combining the final extension words with the original query to form a new query, and retrieving the document set again to realize query extension. According to the method, association mode mining and word vector learning are fused, high-quality extension words can be mined, the
information retrieval performance is improved, and the method has good application value and popularization prospects.