Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

System and method for creating index database

A technology for creating indexes and index libraries, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve the problems of index record generation, low writing efficiency, complex system architecture, etc. The effect of efficiency

Inactive Publication Date: 2010-06-09
ZTE CORP
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The technical problem to be solved by the present invention is to provide a system and method for creating an index library, which is used to solve the problems in the prior art that the generation and writing efficiency of index records is low and the system architecture is relatively complicated

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System and method for creating index database
  • System and method for creating index database
  • System and method for creating index database

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The technical solutions of the present invention will be further described in more detail in conjunction with the accompanying drawings and specific embodiments.

[0035] Such as figure 1 Shown is a schematic structural diagram of the index creation system of the present invention. The index creation system is a system that realizes rapid creation of an index library under a large amount of data in a parallel manner, wherein, part of the data sources 21 in the virtual frame are external modules of the system, and others are internal modules of the system. The index creation system 100 includes: a scheduling module 11 , a crawling module 12, a preprocessing module 13, an index generating module 14, and an index library merging module 15.

[0036] The scheduling module 11 is configured to work according to set policies or events, and is responsible for scheduling control of the crawling module 12 and the index generating module 14 .

[0037] Specifically, the scheduling...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a system for establishing an index database and a method. The method consists of the following steps: step 1, extracting text information for preprocessing from a data source and acquiring preprocessed text information; step 2, establishing a plurality of temporary subindex databases and writing index records generated according to the preprocessed text information in a plurality of temporary subindex databases; step 3, combining and processing the index records in a plurality of temporary subindex databases to generate a single target index database. The invention takes full advantage of processing capacity of multiple CPUs in a server and improves the efficiency of index record generation and writing without adding complexity of the system and changing the formatof the original index database at the same time.

Description

technical field [0001] The invention relates to the field of search engines, in particular to a system and method for creating an index database. Background technique [0002] A search engine system is a network application system that can receive query phrases or expressions submitted by users through browsers or other clients, return a list of information that matches the user's query within an acceptable time, and help users Get information for a list guide. In addition to document retrieval in traditional libraries, search engines have been widely used in the current Internet search field, enterprise information retrieval and service fields. [0003] The search engine system mainly includes two subsystems of retrieval and index creation. The index creation subsystem usually includes a crawling module, a preprocessing module, and an index generation and maintenance module. The crawling module in the current search engine system extracts and collects information from va...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 游波李英
Owner ZTE CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products