Cloud-based website homepage structure monitoring method
A structure monitoring and home page technology, which is applied in the direction of website content management, network data retrieval, special data processing applications, etc., can solve the problems that whether the page has been tampered with cannot be sensed and monitored, the website monitoring system cannot detect, and the deformation of the home page cannot be monitored. , to achieve the effect of improving monitoring accuracy and timeliness, improving user experience, and improving timeliness
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0074]在某个服务小程序产品中使用了本发明的系统,具体应用方法如下:
[0075]S1、添加域名:在系统中添加待监测的网站域名清单www.xjtu.edu.cn;
[0076]S2、进行采集:间隔5分钟访问一次域名对应的网站首页http: / / www.xjtu.edu.cn / index.htm,下载首页网页源代码,用程序过滤掉首页网页源代码代码中的文字,IMG标签的src属性、A标签的href属性、标签中的src属性,只保留标签,生成首页标签代码;
[0077]S3、进行保存:在数据库中检查域名www.xjtu.edu.cn是否存在采集记录,如果是第一次采集的话,将首页网页源代码保存在pageCode目录下,命名为2020-08-20-11-30_index_pageCode.txt;首页标签代码保存在labelCode目录下,命名为2020-08-20-11-30_index_labelCode.txt;
[0078]S4、进行计算:如果不是第一次采集,则分别计算下载下来的首页页面代码与pageCode目录下最近一次历史文件2020-08-20-11-25_index_pageCode.txt的相似度;计算方法如下:
[0079](1)以两次采集的标签元素分别为行和列生成二维矩阵,矩阵的元素为两次生成的对应标签是否相等,如果相等则为1,不相等则为0,二维矩阵如下表1所示:
[0080]表1:
[0081]
[0082](2)计算两次标签变化数量为两次标签数量m和n的差值绝对值:
[0083]k=|m-n|=|13-13|=0;
[0084](3)计算矩阵上下三角元素之和为:
[0085]
[0086](4)计算举证对角线元素之和:
[0087]
[0088](5)计算举证对角线为0的元素之和为:
[0089]
[0090](6)计算首页标签相似度:
[0091]
[0092]S5、下载下来的首页标签代码与labelCode目录下最近一次历史文件2020-08-20-11-25index_labelCode.txt的相似度。计算方法如下:
[0093](1)按照本次采集的首页标签代码结构将本次采集首页网页源码和最近一次采集到的首页网页源码中的标签替换成空字符串,然后将空格和换行液体换成空字符串,只保留文本内容。分别记为本次采集首页文本内容NC,和最近一次采集首页文本内容OC,如下表2所示:
[00...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com