Method for webpage classification
A web page classification and web page technology, applied in the network field, can solve the problems of unknown URL classification, lack of coverage, time-consuming and labor-intensive, etc., and achieve the effect of rich content and high search efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0032] A method for classifying webpages in this embodiment includes a data collection layer, a webpage parsing layer, and an application presentation layer sequentially from bottom to top according to the data flow direction, such as figure 1 shown, including the following specific steps:
[0033] (1) Read the URL list of the preset URL navigation site, which stores many navigation URLs, such as www.hao123.com, www.sohu.com, etc.;
[0034] (2) Judging whether the URL list is empty, if empty, it means that the search has been completed, go to step 8 and end, if not empty, then continue to step 3;
[0035] (3) Take out a URL;
[0036] (4) Query the URL in the V_URL list of the visited URL storage table. V_URL stores all URL addresses that have been visited. If the URL is found in V_URL, it means that it has been visited. Then go to step 3, if not If it is found, it means that it has not been visited, then continue to step 5;
[0037] (5) Use the focused crawler technology to...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com