Method and device for extracting domain keywords
A technology for keywords and domain words, applied in the field of extracting domain keywords, can solve the problems of inability to effectively extract keywords, difficult to give results, and inability to effectively reflect the importance of keywords and the distribution of keywords.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0021] figure 1 It is a flow diagram of a method for extracting field keywords provided by Embodiment 1 of the present invention. This embodiment is applicable to when the user enters search words through the browser on the terminal to perform information retrieval, and the corresponding information website server extracts field text In the case of identifying the field to which the search term belongs, the method can be executed by a computer device with a field keyword extraction function such as an information website server. see figure 1 , the method specifically includes the following steps 101-103:
[0022] Step 101, generating a domain word frequency matrix composed of word frequencies of word segmentations in each domain description text.
[0023] The information website server may first obtain the description texts of various fields stored locally or obtained by crawling webpages. In this embodiment, the description text of each field can be the text contained in t...
Embodiment 2
[0035] figure 2It is a schematic flowchart of a method for extracting domain keywords provided by Embodiment 2 of the present invention. In this embodiment, on the basis of the above-mentioned embodiments, the step of decoupling the domain word frequency matrix into a low-rank background word frequency matrix and a sparse keyword frequency matrix according to a set algorithm is further described. see figure 2 , the method includes steps 201-206:
[0036] Step 201, generating a field word frequency matrix composed of word frequencies of word segmentations in each field description text.
[0037] Step 202, constructing the domain term frequency matrix as an additive model of the low-rank first term frequency matrix and the sparse second term frequency matrix.
[0038] Step 203, constructing an objective function with the smallest difference between the word frequency matrix in the field and the sum, wherein the restriction of the objective function is: the first word freque...
Embodiment 3
[0063] image 3 It is a schematic structural diagram of a device for extracting domain keywords provided in Embodiment 3 of the present invention. This embodiment is applicable to the situation when the user enters a search word through the browser on the terminal to search for information, and the corresponding information website server extracts the field keywords in the field text to identify the field to which the search word belongs. The specific structure is as follows:
[0064] The domain term frequency matrix generation module 301 is used to generate a domain term frequency matrix composed of the term frequency of each domain description text segmentation;
[0065] The domain word frequency matrix decoupling module 302 is used to decouple the domain word frequency matrix into the sum of the low-rank background word frequency matrix and the sparse keyword word frequency matrix according to the set algorithm;
[0066] The domain keyword extraction module 303 is configu...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com