A small-scale indexed data storage method for offline search

A small-scale technology for indexing data, applied in network data indexing, network data retrieval, and other database retrieval directions, it can solve the problems of complex retrieval process, poor user experience, limited data volume, etc., and achieve fast query speed, compact data, simple structure

Active Publication Date: 2018-12-14
HOHAI UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For small-scale data such as corporate websites or general technical documents, etc., the amount of data is limited, and the cost of using traditional online full-text retrieval methods is high, especially the retrieval process is complex, greatly affected by the network environment, and the user experience is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A small-scale indexed data storage method for offline search

Examples

Experimental program
Comparison scheme
Effect test

Embodiment example

[0025] (9) Implementation example: We applied the method of the present invention to an online document help system of a software, and achieved good results. The online document help system has fewer files, more than 100 files. The offline index file after collection (first compression) is about 18KB, and after the second compression (gzip compression), it is only 7KB. The web client obtains the After indexing data, you can do offline search completely on the web client without interacting with the server at all. The average query time is milliseconds, and further subdivided, the individual query time is less than 1 millisecond, and the total time for displaying the results is also less than 1 millisecond. More than 100 milliseconds, that is, within 0.1 second, greatly improves the user experience.

[0026] The invention provides a small-scale index data storage method for off-line search. The index data storage method provided has the characteristics of simple structure, comp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a small-scale index data storage method for off-line searching. The method comprises the following steps: numbering data contents according to the sequence from 0, and storing titles into an array; splitting keywords of the data contents one by one; storing the split keywords into an associative array structure one by one, wherein keys of the associative array are the keywords per se, the value of the associative array is a large binary number, if the bit n is 1, the keywords exist in the nth piece of webpage or document, and if the bit n is 0, the keywords do not exist; after finishing analyzing all contents, conducting content compression on the associative array, namely compressing the value of the associative array, and conducting first compression on the continuous same bits by adopting a stroke compression method; conducting serialization output on the associative array into character strings, and conducting recompression on the character strings. The provided index data storage method has the characteristics of being simple in structure, compact in data, quick in query speed and friendly to combination query computation.

Description

technical field [0001] The invention belongs to the field of data storage, in particular to a small-scale index data storage method for off-line search. Background technique [0002] Full-text search is a common demand, whether it is for websites or documents, full-text search is a very convenient search method. The traditional full-text search is to build an index structure on the server side, input query commands on the client side, that is, the Web side, the server side accepts the query commands, completes the search task on the server side, and then returns the results to the client side, such as Google, Baidu, etc. and other search engines. For small-scale data such as corporate websites or general technical documents, the amount of data is limited, and the cost of using traditional online full-text retrieval methods is high, especially the retrieval process is complex, greatly affected by the network environment, and user experience is poor. Contents of the inventi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F16/316G06F16/951
Inventor 许军才张卫东赖金辉任青文沈振中
Owner HOHAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products