Website data anti-scrabbling method, device and equipment and storage medium

A website and data technology, applied in the field of data scraping, can solve problems such as increased website bandwidth burden, crawling of website core text, affecting real user operations, etc., to achieve rapid batch confusion, reduce impact, and improve efficiency.

Pending Publication Date: 2022-06-21
企知道科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] From the perspective of website business security, while crawlers bring considerable traffic to the website, they also bring immeasurable threats and losses
For example, malicious crawlers can lead to crawling of the core text of the website, scanning of registered user information, and increased bandwidth burden on the website; therefore, in order to improve the security of website data, it is necessary to set up an anti-pickup mechanism to reduce the situation that website data is stolen by crawlers
[0004] The key point of the website anti-pickup mechanism is to identify whether the operator is a user or a crawler. It is necessary to avoid affecting the operation of the real user on the basis of identifying the crawler. However, the related technologies adopt limiting the number of user visits, adding image verification code verification and text messages. Anti-cheat methods such as verification code verification have problems affecting real user operations

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Website data anti-scrabbling method, device and equipment and storage medium
  • Website data anti-scrabbling method, device and equipment and storage medium
  • Website data anti-scrabbling method, device and equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0061] The embodiment of the present application also discloses a computer-readable storage medium.

[0062] Specifically, the computer-readable storage medium stores a computer program that can be loaded by a processor and executes the above-mentioned website data scraping method, and the computer-readable storage medium includes, for example: U disk, mobile hard disk, read-only memory (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disk or optical disk, and various media that can store program codes.

[0063] This specific embodiment is only an explanation of the present invention, and it is not a limitation of the present invention. Those skilled in the art can make modifications to this embodiment without creative contribution as required after reading this specification, but as long as they are within the rights of the present invention All claims are protected by patent law.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a website data anti-scrabbling method, device and equipment and a storage medium, and is applied to the field of data anti-scrabbling, and the method comprises the steps that to-be-processed website information is acquired, and the website information comprises webpage content in a website page; randomly extracting characters from a preset character database, and constructing a mapping relation between the webpage content and the characters to form a character relation file; when a website front-end data acquisition request is received, acquiring interface information corresponding to the login request; comparing the interface information with pre-matched interface information; if the interface information is consistent with pre-matched interface information, outputting the webpage content; otherwise, confusing the webpage content according to the character relation file to generate a confused document, and outputting the confused document. The method and the device have the technical effects that the anti-scrabbling process is non-inductive to a normal user, and the influence of a website anti-scrabbling mechanism on real user operation is reduced.

Description

technical field [0001] The present application relates to the technical field of data anti-pickup, in particular to a website data anti-pickup method, device, equipment and storage medium. Background technique [0002] A web crawler is a program or script that automatically grabs Internet information according to certain rules. According to the crawler survey report released by AberdeenGroup based on the data of hundreds of companies in North America, real people only account for 54.4% of the total traffic, and the remaining traffic consists of 27% friendly crawlers and 18.6% malicious crawlers. [0003] From the perspective of website business security, while crawlers bring considerable traffic to the website, they also bring immeasurable threats and losses. For example, malicious crawlers can lead to crawling of the core text of the website, scanning of registered user information, and increased bandwidth burden on the website; therefore, in order to improve the security ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L9/40
CPCH04L63/145
Inventor 陈龙珠付冠叶彭卓勋程启飞
Owner 企知道科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products