System and method for integrated searching of structured data and unstructured data

A technology of unstructured data and structured data, which is applied in the information field to achieve the effect of improving performance and improving retrieval speed

Inactive Publication Date: 2013-11-27
南京烽火星空通信发展有限公司
View PDF1 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The technical problem to be solved in this patent application is: to provide a method for comprehensive retrieval of structured data and unstructured data in the database, and to solve the low efficiency of retrieval of unstructured fields in the current database

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System and method for integrated searching of structured data and unstructured data
  • System and method for integrated searching of structured data and unstructured data
  • System and method for integrated searching of structured data and unstructured data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] A system for comprehensive retrieval of structured data and unstructured data described in this patent application, the module distribution of the storage part is as follows figure 1 As shown, it consists of a data parsing module, a raw data import module, a structured field retrieval module, an unstructured field complete segmentation module, and an inverted index establishment module. The module distribution of the retrieval part is attached figure 2 As shown, it consists of a request parsing module, a structured retrieval module, an unstructured retrieval module, an index merge module, and a raw data query module.

[0031] After the original data is stored in the database at the time of warehousing, different types of indexes are established for the index fields of the data according to different scenarios, B+ tree indexes are established for structured fields, and inverted indexes are established for unstructured fields. The retrieval speed of this field is impro...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a system and a method for integrated searching of structured data and unstructured data. After original data are stored into a data base, a B+ tree index is established for the structured data, and a reverse index is established for the unstructured data; during searching, the B+ tree index is queried in case of structured searching, the reverse index is queried in case of unstructured searching, the B+ tree index and the reverse index are queried respectively in case of mixed structured and unstructured searching, query results of the indexes are merged, and finally, the original data are obtained according to the query results of the indexes. According to the method, the problem of low efficiency caused by full table scanning performed during the unstructured searching of the current data base is solved, the performance of the structured searching is guaranteed, and meanwhile, the performance of performing of the unstructured searching in the data base is greatly improved.

Description

technical field [0001] The application belongs to the field of information technology, and in particular relates to a system and method for structured and unstructured retrieval in a database. [0002] Background technique [0003] An index is a structure that sorts the values ​​of one or more columns in a database table. Using an index can quickly access specific information in a database table, greatly improving the performance of database retrieval. Data includes two categories: structured data and unstructured data, and data in practical applications may be a mixture of structured and unstructured data. Retrieving structured data is called structured retrieval, and conversely, retrieving unstructured data is called unstructured retrieval. [0004] The B+ tree index used in the current database index technology can index structured data, which greatly improves the query efficiency of the database. The disadvantage is that it cannot index unstructured data (such as attac...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 孙杰阎星娥赵万亮杨昆
Owner 南京烽火星空通信发展有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products