Chinese web page repeated document detection and filtration method based on full stop characteristic word string
A filtering method and feature word technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve the problems of repeated web page detection methods that are difficult to achieve the ideal processing effect at the same time, large amount of calculation, poor detection accuracy, etc., to achieve Simple and effective real-time detection and processing, the method is simple and easy to implement, and the effect of low cost is achieved
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0044]Below in conjunction with accompanying drawing and specific embodiment, further illustrate the present invention, should be understood that these embodiments are only for illustrating the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various aspects of the present invention Modifications in equivalent forms all fall within the scope defined by the appended claims of this application.
[0045] The main design idea and processing process of the Chinese duplicate document detection method in the present invention are: in order to perform duplicate document detection on the huge amount of webpages searched by the search engine after responding to a user's search request, we propose and use a simple and effective The feature of Chinese periods, using the usage characteristics and statistical characteristics of Chinese periods in webpage texts, completes the f...
PUM
![No PUM](https://static-eureka-patsnap-com.libproxy1.nus.edu.sg/ssr/23.2.0/_nuxt/noPUMSmall.5c5f49c7.png)
Abstract
Description
Claims
Application Information
![application no application](https://static-eureka-patsnap-com.libproxy1.nus.edu.sg/ssr/23.2.0/_nuxt/application.06fe782c.png)
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com