Automatic webpage classification method based on network hot word identification
An automatic classification and hot word technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of multiple manual review steps, hot word response lag, etc., to achieve the effect of improving accuracy and reducing workload
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment approach
[0026] Step 1, use a customized crawler to obtain the content information of the webpage.
[0027] Step 2, performing a word segmentation operation on the extracted webpage content.
[0028] Step 3: compare the word segmentation results of the webpage content with the established Internet keyword category library, and then list the total number of categories that the webpage may belong to as M.
[0029] Step 4, if the value of M is greater than or equal to 2, go to step 5, otherwise go to step 7.
[0030] Step 5: Randomly select two categories from M categories each time, and use formula (2) to determine which category the content of the web page belongs to, so that a total of a comparison result.
[0031] Step 6, analyze the According to the comparison results, the category information of the webpage content is obtained, and the word segmentation results are written into the hot word association data thesaurus according to the category.
[0032] Step 7, if the value of ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com