Method for establishing a data index database, method and device for generating search suggestions

A technology for establishing methods and search suggestions, which is applied in the fields of electronic digital data processing, special data processing applications, instruments, etc., can solve the problems of slow database construction, reduced timeliness of search and suggestion services, and decreased response speed, so as to improve efficiency and performance, reduce I/O read and write operations, and improve timeliness

Active Publication Date: 2018-09-04
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When the amount of map data is large and it is impossible to store all the data in the machine memory, the data can only be stored on the hard disk to build the database. In this way, there will inevitably be more frequent I / O read and write operations, which will lead to the speed of building the database. Very slow, the response speed gradually decreases with the expansion of the index size, reducing the timeliness of the search suggestion service

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for establishing a data index database, method and device for generating search suggestions
  • Method for establishing a data index database, method and device for generating search suggestions
  • Method for establishing a data index database, method and device for generating search suggestions

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0055] figure 1 It is a flowchart of the method for establishing a data index library provided by this embodiment, such as figure 1 As shown, the method includes:

[0056] Step S101: Sort all the data to be processed to form a data list to be processed, and perform step S102 and step S103 on the data in the data list to be processed respectively.

[0057] The data to be processed includes the point of interest data on the map or the search term input by the user with a frequency greater than a preset threshold. The data to be processed is sorted first, which is similar to the order of the existing dictionary. All the data to be processed are sorted in the order of letters, numbers, or strokes of the data name to form a list of data to be processed.

[0058] Specifically, for data whose data name is Chinese characters, the alphabetical order of pinyin can be used for sorting, and for homophones, the order can be further determined by strokes or tones. For numerical names, data is sor...

Embodiment 2

[0092] image 3 It is a flowchart of the method for establishing a data index library provided by this embodiment, such as image 3 As shown, the method includes:

[0093] Step S301: Divide all the data to be processed into multiple pieces of data to be processed that are not redundant with each other.

[0094] All the data to be processed need to be deduplicated first to avoid unnecessary repetitive operations, and then randomly or sequentially divided into multiple pieces of complementary and redundant data to be processed, and distributed to each processing device for processing.

[0095] For example, "Tsinghua University", "Tsinghua University West Gate", "Tsinghua University East Gate", "Tsinghua East Road", "Tsinghua West Road", "Tsinghua Park", "Qingqin Road", "Qinglin Road", After removing the duplicates, "Qinglin East Road" is randomly divided into A and B. Among them, A includes "Tsinghua University", "Tsinghua University West Gate", "Qingqin Road", "Qinglin Road", and "Qin...

Embodiment 3

[0109] Figure 4 It is a flowchart of the search suggestion generation method provided in this embodiment. Such as Figure 4 As shown, the method includes:

[0110] Step S401: Receive a search term input by the user.

[0111] Word by word or word by word to receive the search words input by the user on the screen. For example, the user inputs "清", "清华" or "qing".

[0112] Step S402: Use the received search term as a prefix segment, and find data whose key value is the prefix segment from a pre-established data index database.

[0113] The data index library is pre-established using the method for establishing a data index library provided by the present invention. Use the received search term to find matching data from the data index library. For example, when the user enters "clear" on the screen, the index of "clear" will be found from the data index database, and all the data contained in the corresponding "clear" will be obtained.

[0114] Step S403: Sort the searched data accor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a creating method for a data index base and searching suggest generation method and device. The creating method comprises the following steps of: sequencing all data to be processed to form a sequence of data to be processed; and reading data from the sequence of data to be processed one by one as current data, determining the offset of prefix fragments recorded by the current data, acquiring the prefix fragments of the current data by gradually increasing characters starting from the offset recorded by the current data, acquiring all data containing the same prefix fragments from the sequence of data to be processed by using the current prefix fragment acquired at each time, creating a key value as an index of the current prefix fragment until all the prefix fragments of the current data are completely acquired, and creating key values as indexes of corresponding prefix fragments. Compared with the prior art, the creating method can lower the internal memory usage amount during base creation, reduce I / O read-write operations, reduce the base creation time of the data index base and increase the timeliness of searching suggest service.

Description

【Technical Field】 [0001] The present invention relates to the technical field of Internet information processing, in particular to a method for establishing a data index database, a method and a device for generating search suggestions. 【Background technique】 [0002] An index is a structure for sorting the values ​​of one or more columns in a database table. The index can be used to quickly access specific information in the database table, which can realize the storage and retrieval of large amounts of data. Generally speaking, the structure of an index (or called an index zipper) mainly includes the key and index data of the index. The quality of the index structure will directly affect the retrieval speed of the search engine. According to the keywords entered by the user, the search engine finds the index key values ​​that match these keywords in the index library, and extracts the corresponding index data as related searches The result is returned to the user. [0003] Exis...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 徐旭
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products