Method and device for excavating search log and page search method and device
A timeliness and log technology, applied in the Internet field, can solve the problems of users being unable to find, understand, and identify the timeliness needs of users.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0059] figure 1 The flow chart of the method for mining search logs provided by the present invention, such as figure 1 As shown, the method can include the following steps:
[0060] Step 101: Perform word segmentation processing on the query captured from the search log.
[0061] When fetching a query from the search log, the fetching strategy can adopt one or any combination of the following strategies:
[0062] Fetching strategy 1: Fetch queries in which the proportion of pages clicked by the user within the first time period in the most recent first time period in the search results exceeds the preset first proportion threshold. For example, suppose that the most recent first time period is within the last 2 days, and the preset first ratio threshold is 50%. If a query’s search results are published within the last 2 days, pages that are If the percentage of clicks on the total page is 70%, the query can be captured. For another example, if the publication time of the page clic...
Embodiment 2
[0092] figure 2 The flow chart of the page search method provided by the present invention, such as figure 2 As shown, the method can include the following steps:
[0093] Step 201: Perform word segmentation processing on the query input by the user.
[0094] Step 202: Utilize the combination of each word and / or the attribute of each word obtained after word segmentation processing, and the distribution probability of each combination to summarize the type corresponding to the query.
[0095] The processing method of the query input by the user in step 201 to step 202 is the same as the processing method of the captured query in step 101 to step 102, and will not be repeated here.
[0096] Step 203: Look up the timeliness probability table, and determine the timeliness probability corresponding to the type summarized in step 202.
[0097] Step 204: If the highest value of the determined timeliness probability exceeds the preset timeliness probability threshold, it is determined that t...
Embodiment 3
[0110] image 3 It is a structural diagram of a search log mining device provided by an embodiment of the present invention, such as image 3 As shown, the mining device may include: a grabbing unit 300, a first word segmentation unit 310, a first type determination unit 320, a screening unit 330, and a probability calculation unit 340.
[0111] The grabbing unit 300 is used to grab the query from the search log.
[0112] The first word segmentation unit 310 is configured to perform word segmentation processing on the query captured by the capture unit 300.
[0113] The word segmentation processing method adopted by the first word segmentation unit 310 may include, but is not limited to: a word segmentation method for string matching, a word meaning word segmentation method, and a statistical word segmentation method.
[0114] The first type determining unit 320 is configured to use the combination of each word and / or attribute composition of each word and the distribution probability ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com