Webpage classification identifying system and method based on vertical search and focused crawler technology
A technology focusing on crawlers and web page classification, applied in the field of web search engines, it can solve the problems of no directional extraction, difficult to judge crawling, and difficult to identify different types of web pages, so as to save network bandwidth, improve efficiency, and reduce the number of effects.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0034] The navigation website warehousing engine and broadband network user behavior analysis system developed in this embodiment adopts the B / S architecture, and the development platform is vs2005+oracle 9i. Users can easily access existing URL categories according to their needs. In the system. You only need to modify the configuration file during deployment, and it can run on one PC or on multiple PCs at the same time.
[0035] The following is a detailed introduction to the various modules of the design and their web page classification and recognition methods based on vertical search and focused crawlers. The specific processing process of the method of web page classification and recognition is as attached figure 1 , Follow the steps below:
[0036] (1) Read the URL list of the preset URL navigation site and judge whether the URL list is empty,
[0037] If it is empty, go to step (8);
[0038] (2) Take out a site URL and put it in the list of unvisited URLs (UV_URL list).
[00...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com