A method and system for collecting and publishing network information
A technology for information collection and network information, applied in digital data information retrieval, special data processing applications, instruments, etc., can solve problems such as time-consuming and labor-intensive, low efficiency, save labor and time consumption, and improve publishing efficiency Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0042] A kind of network information collection and publishing method provided by the present invention is introduced below, see figure 1 , embodiment one includes:
[0043] Step S101: Receive and analyze the information collection request sent by the user, and determine the keyword and data platform of the information to be collected.
[0044] Step S102: Collecting data from the data platform to obtain a plurality of keyword webpages including the keyword.
[0045] Step S103: Perform information capture on each of the keyword webpages to obtain multiple webpage contents.
[0046] Step S104: Calculate the originality of each item of the webpage content.
[0047] It should be noted that, in the present invention, the degree of originality is a parameter reflecting the originality of webpage content. The calculation process of the degree of originality is not the key point of the present invention, so it will not be described in detail. For example, when the content of the c...
Embodiment 2
[0052] Based on the above considerations, the present invention provides a second embodiment of a method for collecting and distributing network information. The second embodiment will be described below, see figure 2 , embodiment two mainly includes:
[0053]Step S201: Receive and analyze the information collection request sent by the user, and determine the keyword and data platform of the information to be collected.
[0054] Step S202: Create multiple collectors according to the keyword and the data platform, wherein each collector adopts a different user ID.
[0055] Wherein, the collector refers to a web crawler program written by a python program.
[0056] Step S203: Every preset time period, use multiple collectors to collect data on the data platform once, and obtain multiple keyword webpages containing the keyword.
[0057] Wherein, the grabber refers to a program written in java language to grab pages.
[0058] The preset time period can be determined by a timer...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com