Intelligent retrieval method and device for calculating patent literature similarity based on word frequency and semantics, electronic equipment and storage medium thereof
A technology of semantic computing and patent documents, applied in the fields of intelligent retrieval, electronic equipment and its storage media, it can solve the problems of strong subjectivity of audit opinions, low accuracy of results, and single use method, so as to reduce the scope of examination and save manpower. and time, the effect of improving accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0032] see figure 1 , an intelligent retrieval method based on word frequency and semantic calculation of patent document similarity provided by this embodiment, the examples given are only used to explain the present invention, and are not intended to limit the scope of the present invention. The method specifically includes the following steps:
[0033] S101, for all the patent data in the question bank, extract the text information related to the content of the test question, organize it into structured data, and form a word segmentation result;
[0034] S102. Carry out word bag statistics and word vector conversion calculations for the word segmentation results of all the above patent data, and obtain the weight value of each word as preloaded data for model prediction;
[0035] S103. Load all the word bags, word vectors, and vocabulary data mentioned above, perform a full matching query according to the test question publication number, compare the similarity predicted b...
Embodiment 2
[0074] see figure 2 , is an intelligent data retrieval method based on a single server provided in this embodiment, and the examples given are only used to explain the present invention, and are not used to limit the scope of the present invention. The method specifically includes the following steps:
[0075] S201, extracting patent information and content from the XML file of the question bank and performing storage operations, the extracted content is downloaded into a CSV file of a specified field after preliminary cleaning and sorting in the patent database;
[0076] S202. After segmenting the full content, removing stop words, and screening high-frequency words, construct a vector model;
[0077] S203. Load the vector model data, and combine multiple sets of fusion results based on the literal-based bag-of-words algorithm and the semantic-based semantic algorithm to predict top-ranked patents.
[0078] Among them, S203 further includes:
[0079] S2031. Segment the co...
Embodiment 3
[0086] see image 3 , an intelligent retrieval device 210 for calculating the similarity of patent documents based on word frequency and semantics provided in this embodiment, the examples given are only for explaining the present invention, and are not intended to limit the scope of the present invention.
[0087] The device specifically includes the following components:
[0088] Data processing module 211: used to extract all patent text content according to fields and importance from the question bank, and obtain the data standard format for modeling;
[0089] Intelligent calculation module 212: used to carry out various calculations to the extracted standard data, and obtain model data reflecting its frequency, semantics and weight in the text;
[0090] Model building module 213: used to model and calculate model data, combine and optimize calculation results, and construct an intelligent retrieval model in combination with business requirements;
[0091] Model predicti...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com