Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Extensible markup language (XML) data bank full-text indexing method based on integer mapping

An XML database and full-text indexing technology, applied in the field of database indexing, can solve the problems of large physical space, affecting query efficiency, and high cost of string operations, and achieve the effect of occupying less physical space and improving query efficiency

Active Publication Date: 2013-05-29
TONGFANG KNOWLEDGE NETWORK TECH CO LTD (BEIJING)
View PDF2 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] For the full-text query of the XML database, the current solutions can be divided into two types: one is to build an inverted index for the text content, and save the path information in the information corresponding to the word to meet the requirements of the structure query. Locate words based on keywords, and then judge whether the path corresponding to the word satisfies the path entered by the user. The disadvantage of this method is that when the number of words is much greater than the number of paths, the physical space occupied by the index will be too large, and Too much path information is stored repeatedly, which affects the query efficiency; one solution is to establish a path index based on the path structure information, and save the text content of the node in the path end node information located by each path. This method will cause each query String manipulation is expensive when

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Extensible markup language (XML) data bank full-text indexing method based on integer mapping
  • Extensible markup language (XML) data bank full-text indexing method based on integer mapping
  • Extensible markup language (XML) data bank full-text indexing method based on integer mapping

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] In order to make the purpose, technical solutions and advantages of the present invention clearer, the implementation of the present invention will be further described in detail below in conjunction with the accompanying drawings:

[0024] This embodiment provides a method for full-text indexing of an XML database based on integer mapping, such as figure 1 As shown, the method includes the following steps:

[0025] Step 10 establishes a full-text index to the XML document collection;

[0026] Step 20 uses the built full-text index to perform full-text search.

[0027] The above step 10 specifically includes (such as figure 2 shown):

[0028] Step 101 calculates and stores information corresponding to data nodes in the XML document;

[0029] Analyze the XML document collection, calculate the prefix code information and structure summary information corresponding to each data node, and create the structure summary tree corresponding to the XML document collection, a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an extensible markup language (XML) data bank full-text indexing method based on integer mapping. The method comprises: a, full-text index of an XML document collection is established; and b, full-text retrieval is conducted through the established full-text index. The step a specifically comprises: information corresponding to data nodes in the XML document is calculated and stored; an XPath expression input by a user is analyzed and a corresponding node set is obtained; the lasting integer mapping is established and each node in the node set is mapped on an only integer; and the full-text index is conducted on node content. The step b specifically comprises: an XML full-text inquiring sentence input by the user is analyzed; key words are inquired in relationship list full-text index and a virtual column number integer is returned; and the virtual column number integer is inquired in a tree B and matched node structure information and full-text information are returned to the user. The index offered by the indexing method is small in occupied physical space, repeatedly stored path information does not exist and inquiring efficiency is improved.

Description

technical field [0001] The invention relates to the field of database indexing, in particular to an integer mapping-based XML database full-text indexing method. Background technique [0002] XML (Extensible Markup Language), as a widely used extensible markup language, is becoming one of the standards in the field of data information management, and the database technology for XML data management is also developing rapidly. In the field of how to store and retrieve XML data, there are mainly two methods: one is to expand the corresponding functions in the relational database to make it capable of processing XML data; the other is to fully consider the characteristics of XML data and customize The Native XML Database (Native XML Database) handles XML data in a natural way. [0003] In traditional relational databases, full-text indexing and retrieval exist as indispensable functional points, and the corresponding technical solutions are relatively mature. Generally speakin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 陈琳王奎张学伟夏冬
Owner TONGFANG KNOWLEDGE NETWORK TECH CO LTD (BEIJING)
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products