Collection method for vertical data of web spider
A technology of data collection and web spider, which is applied in the fields of electrical digital data processing, special data processing applications, instruments, etc. Application prospect, accurate effect of resources
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0030] The present invention will be further described below in conjunction with the accompanying drawings.
[0031] Such as figure 1 The system block diagram of the network vertical crawling system is shown. The process of the network vertical crawling system is to obtain input from the Internet (initially contains the user-specified starting seed URL class library collection, which can be one or more), and parse the URL class library The server address indicated in , establish a connection, send a request and receive data, store the obtained webpage data in the original webpage database, extract the link information from it and put it into the webpage structure database, and put the URL to be captured into the URL class Library, to ensure the recursive process of the whole process, until the URL library is empty, the web spider vertical search system provides retrieval services, it is necessary to save the original text of the webpage, and the collected webpage should be sto...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com