A node and system in a distributed crawler cluster
A distributed, node-based technology, applied in transmission systems, digital transmission systems, special data processing applications, etc., can solve performance bottlenecks and large-scale expansion, lack of management, collaboration, url deduplication and network load balancing are difficult to solve and other issues to achieve the effect of realizing a large-scale distributed crawler cluster
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0013] Specific embodiments of the present invention will be further described in detail below in conjunction with the accompanying drawings.
[0014] The embodiment of the present invention builds the underlying overlay network by using the structured p2p algorithm kademlia, and establishes a communication mechanism between nodes; a complete set of crawling modules is run independently on each node, responsible for webpage crawling, data analysis and link extraction, etc. ;At the same time, a control center is configured on each node, which is responsible for receiving and distributing urls, load balancing and handling the transfer of url history records. Since each node has equal status and consistent functions, relying on the internal mechanism of the node to realize crawler cooperation, no additional operations outside the system are required for a single node to join the network, and the entire network can expand the number of crawler nodes at will to realize a large-scale...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com