Method for ranking web pages on basis of hyperlink source analysis
A web page ranking and hyperlink technology, applied in the field of information retrieval, can solve problems such as web page cheating
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
example 1
[0082] Example 1: Comparative analysis of the present invention and 4 kinds of existing algorithms based on artificial network to suppress the effect of web page cheating
[0083] The experimental data is a synthetic scale-free network. The network is generated using the BA model (Barabási-Albert model). The model parameters are shown in Table 1. The generated network contains 100 nodes and 1098 edges, and the network diameter is 4.
[0084] Table 1 Parameter settings of BA model
[0085] Initial number of nodes
5
The probability that an edge exists between the initial nodes
0.3
node average degree
10
The total number of nodes in the network
100
[0086] The experiment chooses the following two commonly used cheating methods to detect the effect of the algorithm to suppress cheating:
[0087] (1) Link exchange cheating: Set up several nodes in the network as cheating nodes, and these nodes add links to each other t...
example 2
[0100] Example 2: Comparative analysis of the present invention and 4 kinds of existing algorithms based on actual network data to suppress the effect of web page cheating
[0101] The experimental data adopts the WEBSPAM-UK2007 data set provided by Yahoo Labs. There are a total of 114,529 web pages and links under the website in the data set. Volunteers have marked some websites as "non-cheating" or "cheating" at the host level. The specific information is shown in Table 3. This experiment uses a host-level network for experiments. If a page in one website points to a page in another website, then there is a directed edge between the two website hosts. Because the TrustRank, DiffusionRank and AIR algorithms all need seed sets, some of these artificially marked "non-cheating" websites are used as seed sets for these algorithms. The remaining part of "non-cheating" sites and sites with domain names such as gov, ac, mod, nhs, sch, etc. together constitute the collection of auth...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com