Text retrieval method based on matrix weighted association rules and mixed expansion of front and back components

A matrix weighting and hybrid expansion technology, applied in the field of information retrieval, can solve problems such as query subject drift and word mismatch

Active Publication Date: 2019-02-01
GUANGXI UNIVERSITY OF FINANCE AND ECONOMICS
View PDF5 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In order to solve the above-mentioned problems, the present invention proposes a text retrieval method based on matrix weighted association rules, which is suitable for the field of information retrieval, improves and improves the performance of information retrieval, and solves the problems of query topic drift and word mismatch in information retrieval

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text retrieval method based on matrix weighted association rules and mixed expansion of front and back components
  • Text retrieval method based on matrix weighted association rules and mixed expansion of front and back components
  • Text retrieval method based on matrix weighted association rules and mixed expansion of front and back components

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] In order to better illustrate the technical solution of the present invention, the relevant concepts involved in the present invention are introduced as follows:

[0056] 1. Antecedents and postconditions of feature word association rules: Let x and y be any set of feature word items, and the implication of the form x → y is called feature word association rule, where x is called the antecedent of the rule, y is called the consequent of the rule.

[0057] 2. Mixed expansion of weighting rules before and after:

[0058] The mixed expansion of weighted rules and contexts means that the expansion words come from the antecedents and consequent itemsets of weighted association rules. Moreover, when the expansion word comes from the antecedent item set, the rule’s consequence must be the query term set. Similarly, when the expansion word belongs to the consequent item set, the rule’s antecedent must be the query term set.

[0059] 3. Feature word item set support

[0060] ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text retrieval method based on mixed expansion of front and back parts of matrix weighted association rules, First, the user queries and retrieves the document set to construct the related document set of the first-checked user. Then, the weighted value and frequency of the item set are fused with the total weighted value of the feature words and the total number of documents of the first-checked user. The frequent item set containing the original query term is mined. The candidate item set is pruned by item weight sorting, and by use of a confidence-correlation evaluation frame is used to excavate association rules from frequent item sets. Finally, the consequent association rules of the original query term and the consequent association rules of the original query term are used as extension words, and the extension words are combined with the original query term to retrieve the document set again to obtain the final retrieval result document and return it tothe user. The invention adopts the pruning method based on item weight sorting, improves the mining efficiency, adopts the weighted association rule before and after parts mixed expansion technology,and improves the text information retrieval performance.

Description

technical field [0001] The invention belongs to the field of information retrieval, and in particular relates to a text retrieval method based on matrix weighted association rules with mixed extensions of context and context. Background technique [0002] How to efficiently and accurately find more needed information from the ocean of information has always been a hot issue in the field of information retrieval. The current web search engines have alleviated people’s difficulties in retrieving information on the Internet to a certain extent. However, the current search engines or web information retrieval systems are often based on keyword mechanical symbol matching retrieval, which is difficult to avoid information overload and word retrieval. issues such as mismatches, for example, the query term is "computer", although "computer" describes almost the same meaning, but, to the information retrieval system, "computer" and "computer" are considered different search terms, I...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/41G06F16/43
Inventor 黄名选
Owner GUANGXI UNIVERSITY OF FINANCE AND ECONOMICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products