Method and apparatus for machine learning a document relevance function

A technology of relevance function and relevance score, applied in the field of search engines, which can solve problems such as change

Inactive Publication Date: 2006-08-30
OVERTURE SERVICE INC
View PDF0 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

These preferences are often difficult to capture in the set of algorithmic rules used to define the correlation function
Moreover, these subjective factors may change over time, for example, for current events associated with specific query terms

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for machine learning a document relevance function
  • Method and apparatus for machine learning a document relevance function
  • Method and apparatus for machine learning a document relevance function

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] refer to figure 1 , computer network 100 includes one or more client computers 104 connected to network 105 . Network 105 may be the Internet, or in other embodiments an intranet. In embodiments where network 105 is the Internet, collection of documents 103 known as World Wide Web 102 may be accessed by client computers over network 105 . On the Internet, documents are located by Uniform Resource Locators (eg, "http: / / www.av.com"). By providing the URL to a document server (not shown), the document 103 corresponding to the URL can be accessed.

[0024] In addition to documents and client computers, computer network 100 includes a search engine. Examples of search engines available on the Internet include, but are not limited to, Alta Vista (URL http: / / www.av.com), Google (URL http: / / www.google.com), and Yahoo! (URL is http: / / www.yahoo.com). Search engines typically include a database that indexes documents on the World Wide Web. A user of client computer 104 - 1 w...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Provided is a method and computer program product for determining a document relevance function for estimating a relevance score of a document in a database with respect to a query. For each of a plurality of test queries, a respective set of result documents is collected. For each test query, a subset of the documents in the respective result set is selected, and a set of training relevance scores is assigned to documents in the subset. In one embodiment, at least some of the training relevance scores are assigned by human subjects who determine individual relevance scores for submitted documents with respect to the corresponding queries. Finally, a relevance function is determined based on the plurality of test queries, the subsets of documents, and the sets of training relevance scores.

Description

technical field [0001] The present invention relates generally to the field of search engines for locating documents in a database, such as an index of documents stored on a server coupled to the Internet or in an Intranet, and in particular, the present invention relates to methods for determining document relevance functions Method and apparatus, the document relevance function is used to estimate the relevance scores of documents in a database with respect to a query. Background technique [0002] The development of a search engine that is capable of indexing a large and diverse collection of documents while only returning a brief list of relevant result documents to the user in response to a query has long been considered a difficult problem. The Internet, which currently contains billions of documents stored on host computers worldwide, represents a particularly large collection of documents. A user of a search engine typically provides the search engine with a short q...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06FG06F7/00
CPCY10S707/99931G06F17/30864G06F17/30702Y10S707/99937G06F16/951G06F16/337
Inventor 大卫·科索克
Owner OVERTURE SERVICE INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products