Detection method and scanning engine of web pages
A detection method and scanning engine technology, which are applied in the field of web page detection methods and scanning engines, can solve the problems of false positives and false positives, and reduce user experience, so as to avoid false positives and improve user experience.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0030] Reference figure 1 , Shows a flow chart of the steps of a webpage detection method according to the first embodiment of the present invention.
[0031] The webpage detection method of this embodiment includes the following steps:
[0032] Step S10: Grab the URL or content of the target website, determine that it is a webpage based on the returned result, and visit the webpage.
[0033] The crawling of the URL (Uniform Resource Locator) or content of the target website can be realized by spider or crawler technology. The result returned by the spider or crawler is used to determine whether it is a web page of the website, and if it is determined to be a web page, the web page is visited.
[0034] Step S20: Determine whether the visited webpage meets at least one of the following rules: general abnormal page rules, custom abnormal page rules, and custom abnormal page behavior rules;
[0035] Among them, the general abnormal page rule is used to determine whether a web page is an ab...
Embodiment 2
[0039] Reference figure 2 , Shows a flowchart of the steps of a webpage detection method according to the second embodiment of the present invention.
[0040] This embodiment is a further preferred solution of embodiment 1. In this embodiment, the abnormal page includes other error pages other than the 404 page of the 404 page. Correspondingly, the general exception page rules include general 404 page rules and custom exceptions. Page rules include custom 404 page rules, custom error page rules, custom abnormal page behavior rules, and custom 404 page behavior rules.
[0041] The webpage detection method of this embodiment includes the following steps:
[0042] Step S102: Visit the webpage, and determine whether the visited webpage meets at least one of the following rules: general 404 page rules, custom 404 page rules, custom 404 page behavior rules, and custom error page rules.
[0043] Among them, the general 404 page rule is used to determine whether the web page is a 404 page ac...
Embodiment 3
[0048] Reference image 3 , Shows a flow chart of the steps of a webpage detection method according to the third embodiment of the present invention.
[0049] The webpage detection method of this embodiment includes the following steps:
[0050] Step S202: Collect at least one of general 404 page rules, custom 404 page rules, custom 404 page behavior rules, and custom error page rules.
[0051] In this embodiment, all of the foregoing rules can be set to collect, and in actual applications, only part of the foregoing rules can also be collected as required. When collecting the above rules, you can collect and set up and use it once, and then update the previously collected rules uniformly at an interval setting; you can also dynamically collect the rules and update them in real time.
[0052] The collected general 404 page rules may include: judging whether the webpage status code is 404, and / or judging whether the webpage content includes 404 page content, such as "404NOTFOUND", "404...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com