Adaptive method and apparatus for automatic regression detection and block matching for web page changes

An automatic regression and block matching technology, which is applied in network data retrieval, website content management, and other database retrieval, can solve problems such as high requirements for equipment computing capabilities, reduce manpower and time costs, reduce economic losses, and reduce The effect of maintaining the threshold

Active Publication Date: 2019-02-15
BEIJING INTERNETWARE LTD
View PDF25 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this method requires the support of a large amount of data, and at the same time has high requirements for the computing power of the device.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Adaptive method and apparatus for automatic regression detection and block matching for web page changes
  • Adaptive method and apparatus for automatic regression detection and block matching for web page changes
  • Adaptive method and apparatus for automatic regression detection and block matching for web page changes

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] Specific embodiments of the present invention will be described below in conjunction with the accompanying drawings.

[0029] The automatic regression detection and block matching adaptive method for webpage changes of the present invention are as follows: figure 1 As shown, it includes the following steps: detecting webpage changes, used to detect whether the webpages of the old and new target systems have changed and giving a report; performing content block matching, analyzing the webpage after receiving the report of detected changes, and finding out the information of the new target system The content block part of the web page corresponding to the web page of the old target system, and then based on the change of the corresponding content block part, a code modification suggestion for the existing data extraction system or tool is given.

[0030] The automatic regression detection and block matching adaptive device for web page changes of the present invention inc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an adaptive method and apparatus for automatic regression detection and block matching of web page change, which can dynamically detect the change of target web page and give modification suggestions. The adaptive method for automatic regression detection of web page changes and block matching is characterized in that the adaptive method comprises the following steps: detecting web page changes, detecting whether web pages of a new and an old target system are changed and providing a report; carrying out Content block matching. After receiving the report of detecting thechange, the web page is analyzed to find out the content block part corresponding to the web page of the new target system and the web page of the old target system. The content block matching includes the following steps: character analyzing step, obtaining the semantic information and the character area; A graphical interface analysis step of obtaining a graphical area; The mapping step performs similarity matching on the semantic information, the text area and the graphic area obtained above, and then, based on the change of the corresponding content block part, gives the code modificationsuggestions for the existing web page data extraction system tools.

Description

technical field [0001] The invention relates to the technical field of webpage data extraction, in particular to an adaptive method and device for automatic regression detection and block matching of webpage changes. Background technique [0002] With the development of Web technology and the arrival of the era of big data, the webpage system began to contain more and more information, which led to the development of various technologies for extracting data (information carriers) from the webpage system. To extract the data from the webpage, it is first necessary to locate the data needed in the webpage. Common positioning techniques for elements in the webpage include positioning based on XPath (XML Path Language, XML Path Language), based on CSS (Cascading StyleSheets, Cascading style sheet) selector positioning, and other simple positioning based on id attributes, name attributes, etc. These positioning technologies are very dependent on the structure of the webpage. Aft...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/958G06F8/71
CPCG06F8/71
Inventor 张颖杨威徐经纬苏星黄罡
Owner BEIJING INTERNETWARE LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products