Network link topology reconstruction method based on content
A technology of network linking and topology reconstruction, applied in network data retrieval, network data indexing, special data processing applications, etc., can solve problems such as difficulty in finding spam web pages, ignoring web page text information, etc., to overcome possibilities and improve efficiency Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0026] The content-based network link topology reconstruction method of the present invention will be described in detail below with reference to the embodiments and the accompanying drawings.
[0027] The content-based network link topology reconstruction method of the present invention adds webpage content analysis on the basis of the TrustRank algorithm, and reconstructs the network link topology from the perspective of content, which can improve the efficiency of detecting and identifying spam webpages.
[0028] Such as figure 1 As shown, the content-based network link topology reconstruction method of the present invention comprises the following steps:
[0029] 1) Eliminate redundant and irrelevant feature attributes from the aspect of content features and link features, and combine the new feature vector feature;
[0030] 2) Calculate the similarity between two connected webpages, determine the correlation between the two connected webpages, the closer the similarity, ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com