Method and devices of intercepting crawler
A crawler and page technology, applied in the network field, can solve the problems of inefficient interception of web crawlers, and normal users mistakenly think that they are web crawlers, etc., to achieve the effects of high concurrency, reduced pressure, and improved interception rate.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0036] Example 1, in one embodiment,
[0037] 1) The browser sends an HTTP request to the server, requesting the first page of the current category;
[0038] The server generates an image URL path containing the cookie value and saves it to the first page;
[0039] The server side pre-sets the range of pages that allow direct access to pages as 1-10 pages, and the server side judges that the first page belongs to the direct access range, so it returns the first page that includes the image URL path to the browser;
[0040] The browser automatically downloads the picture to the browser according to the URL path of the picture contained in the returned page of the first page of the current category; parses the picture with the JS method, extracts the cookie value, and saves it; carries the cookie value when turning the page later .
[0041] 2) The browser sends an HTTP request carrying a cookie value to the server, requesting page 10 of the current category;
[0042] The serv...
Embodiment 2
[0050] Embodiment 2, in another embodiment,
[0051] If the browser receives a link to page 10 of the category, then,
[0052] The browser sends an HTTP request to the server, requesting page 10 of the current category;
[0053] The server side generates the image URL path containing the cookie value and saves it to page 10;
[0054] The server side pre-sets the range of pages that allow direct access to pages 1-10, and the server judges that the 10th page belongs to the direct access range. Therefore, although the HTTP request does not contain a cookie value at this time, it will directly include pictures. Page 10 of the URL path is returned to the browser.
[0055] The browser automatically downloads the picture to the browser according to the URL path of the picture contained in the returned page of the 10th page of the current classification; parses the picture with the JS method, extracts the cookie value in it, and saves it; carries the cookie value when turning the pa...
Embodiment 3
[0056] Embodiment three, in another embodiment,
[0057] If the browser receives a link to category page 11, then,
[0058] The browser sends an HTTP request to the server, requesting page 11 of the current category;
[0059] The server generates an image URL path containing the cookie value and saves it to page 11;
[0060] The server judges that the 11th page does not belong to the scope of direct access. Therefore, it further judges whether there is a cookie value in the HTTP request. Since it is a link directly received by the browser, the HTTP request does not contain a cookie value. Therefore, to browse The browser returns to the first page of the current category.
[0061] Next, if you want to continue to visit other pages, you can repeat the operation in Embodiment 1 to achieve normal page visits.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com