Edge computing cloud environment distributed Web page extraction and analysis system and method

An edge computing and analysis system technology, applied in the field of network communication, can solve problems such as spending hours or even days, low processing speed of government affairs office systems, etc., to facilitate horizontal expansion, solve the problem of wasting computing resources, and improve the production capacity ratio. Effect

Pending Publication Date: 2020-03-31
SHANXI TISCO INFORMATION & AUTOMATION TECH CO LTD
View PDF12 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The web vulnerability scanning methods in the prior art basically use scanning tools or hardware devices to scan the crawled website vulnerabilities. For common government affairs office systems (the average page files exceed 20,000), the processing speed is too low, generally from It will take hours or even days from scanning to the end of the analysis, so the premise of scanning each website needs to crawl all the pages under the website

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Edge computing cloud environment distributed Web page extraction and analysis system and method
  • Edge computing cloud environment distributed Web page extraction and analysis system and method
  • Edge computing cloud environment distributed Web page extraction and analysis system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] The present invention will be described in further detail below in conjunction with the accompanying drawings.

[0023] like Figures 1~3 As shown, an edge computing cloud environment distributed Web page extraction and analysis system adopts distributed deployment, including: task monitoring unit, a central management node and multiple computing nodes, and crawling strategy analysis module, crawling depth analysis Module, page crawling and deduplication module.

[0024] The task monitoring unit is used to monitor the scanning tasks submitted by the user, and put the new scanning tasks into the message queue.

[0025] A central management node is used to distribute work tasks to each computing node, collect the result data completed by the computing node at the same time, summarize and analyze each result data, and then perform data persistence processing.

[0026] Multiple computing nodes are used to complete the work tasks distributed by the central management node,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of network communication. Cloud computing has security risks such as data loss and leakage, sharing technical vulnerabilities, unsafe application program interfaces and the like, the webpage crawling efficiency is low, a large amount of time is required for subsequent vulnerability scanning, the invention provides an edge computing cloud environment distributed Web page extraction and analysis system and method. The central management node schedules the computing node to complete the work task according to the historical crawling efficiency, the reasonable allocation of resources is improved, and the crawling strategy and depth are analyzed and deduplicated; on the premise that a vulnerability scanning result is correct, the crawling speed of awebsite is effectively increased, transverse expansion of scanning capacity and reasonable utilization of computing resources are facilitated, higher transmission and response speed is brought by edgecomputing, the problem of computing resource waste of nodes in a traditional cloud computing system is solved, the productivity ratio is increased, and the resource utilization rate is obviously increased.

Description

technical field [0001] The present invention relates to the technical field of network communication, and more specifically, relates to a cloud environment distributed Web page extraction and analysis system and method for edge computing. Background technique [0002] Based on cloud computing technology, the main job function is to provide daily automated office support services for relevant work departments and related staff, comprehensively empower social development, improve intelligent management capabilities, improve work execution efficiency, integrate urban planning, and improve service efficiency. Efficiency; the cloud computing software service model has the characteristics of integrating software and hardware resources, lower client requirements and a unified maintenance platform. Applying the cloud computing model to the construction of the e-government platform can maximize the sharing of data resources and save construction and operation Cost, increase the load ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/958G06F16/951
CPCG06F16/958G06F16/951
Inventor 张宏巍张弋兰志超张森玮王帅琪李兆国
Owner SHANXI TISCO INFORMATION & AUTOMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products