Multi-source heterogeneous data fusion storage

A multi-source heterogeneous data and data fusion technology, applied in the field of multi-source heterogeneous data fusion storage, can solve problems such as limiting the performance of semantic similarity and text similarity recognition, so as to ensure smooth progress, reliability and speed up , the effect of reducing storage space

Pending Publication Date: 2022-06-07
SHANDONG SHENGDA GAOCHENG MEASUREMENT & CONTROLTECH CO LTD
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the deficiencies of the prior art, the present invention provides fusion storage of multi-source heterogeneous data, which solves the problem of using hadoop, my sql or oracle as storage in the prior art, which limits the recognition of semantic similarity and text similarity in the fusion process. performance problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-source heterogeneous data fusion storage

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0027] like figure 1 As shown, the embodiment of the present invention provides multi-source heterogeneous data fusion storage, including a HaiNaTable database management system and a storage hard disk. The HaiNaTable database management system has the functions of starting fusion, adding, modifying, primary key retrieval, and ending fusion. For the operations of adding, modifying, primary key retrieval, and ending fusion, the HaiNaTable database management system locks the storage files and index files when the fusion starts and ends. The efficiency of first-level indexing to find stored data files;

[0028] The HaiNaTable database management system is used as the entry to access the data stored in the storage hard disk. Through the HaiNaTable database management system, the data stored in the storage hard disk can be accessed, added, modified and merged. The HaiNaTable database management system stores data files as .Tdb As data storage and .TIndex as index storage, and sto...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides multi-source heterogeneous data fusion storage, and relates to the technical field of data storage. The multi-source heterogeneous data fusion storage comprises a HaiNaTable database management system and a storage hard disk, and the HaiNaTable database management system has the functions of fusion starting, newly adding, modification, primary key retrieval and fusion ending; the HaiNaTable database management system takes. Tdb as data storage and. TIndex as index storage and stores the data files into the storage hard disk in real time, and index files of the index storage are files obtained by storing feature information of the data files into Int128 through character strings generated by Md5. Data storage and index storage are carried out on the data file, so that the storage space occupied by index performance can be reduced, meanwhile, the rapid data retrieval speed can be increased, the performance of semantic similarity and text similarity recognition in the fusion process is improved through the data fusion mode recorded in the method, and the fusion efficiency is improved. And thus, smooth implementation and reliability of data fusion are ensured.

Description

technical field [0001] The invention relates to the technical field of data storage, in particular to multi-source heterogeneous data fusion storage. Background technique [0002] With the exponential growth of data in the information age, users have put forward new requirements for multi-source data fusion judgment, and it is required to quickly retrieve whether the data exists during fusion. [0003] Most of the existing solutions use distributed system infrastructure (hadoop), SQL database management system (mysql) or database management system (oracle) as storage. It limits the performance of semantic similarity and text similarity recognition in the fusion process. SUMMARY OF THE INVENTION [0004] (1) Technical problems solved [0005] In view of the deficiencies of the prior art, the present invention provides multi-source heterogeneous data fusion storage, which solves the problem of using hadoop, mysql or oracle as storage in the prior art, which limits the iden...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/2455G06F16/23G06F16/22G06F16/906G06F16/176G06K9/62
CPCG06F16/24564G06F16/23G06F16/2228G06F16/906G06F16/1774G06F18/22
Inventor 纪风超董海峰刘勇
Owner SHANDONG SHENGDA GAOCHENG MEASUREMENT & CONTROLTECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products