A big data fusion method, system and device

A technology of data fusion and fusion methods, which is applied in character and pattern recognition, instruments, calculations, etc., and can solve problems such as the inability to dig out entity concepts

Active Publication Date: 2021-03-30
赵淦森
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the methods and models based on this situation have the following deficiencies: the identification based on data records (tuples) can only identify the entity concepts in the overlapping parts between tables, and cannot dig out more complex information from the fused data. complex entity concepts

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A big data fusion method, system and device
  • A big data fusion method, system and device
  • A big data fusion method, system and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0114] refer to figure 2 , figure 2 The circles in represent nodes, which represent entity concepts in the business logic diagram or data tables in the database. The edges linking nodes in the graph represent relationships, such as edge a or b representing the relationship between entity concepts, such as edges u1 and u2 representing the relationship between two data tables in the original data structure graph, some link relationships, in the process of data It already exists in the original data structure diagram or business logic diagram before fusion, where sub-graph P1 is a business logic diagram, and sub-graph P2 is a data structure diagram.

[0115]In order to make the relationship between the entity concept in the business logic correspond to the relationship in the original data structure diagram, in the process of simple entity concept recognition, it is necessary to construct a mapping f, which maps the entity concept in the business to the data table in the origi...

Embodiment 2

[0118] The data fusion method in this embodiment includes the following steps:

[0119] Simple entity concept recognition stage: reconstruct the mapping relationship R between the entity concept and the original data, the original data structure graph Gs and the business logic graph Gb to obtain the data fusion graph Gfusion;

[0120] Complex entity concept recognition stage: through the central connection subgraph method, the complex entity concept in the data fusion graph is identified, and the complex entity concept and the data structure graph set GcomplexE describing the complex entity concept are obtained.

[0121] refer to image 3 , the simple entity concept recognition stage includes the following steps:

[0122] A1. According to the mapping relationship R between the entity concept and the original data, link the data describing the same entity concept in the original data structure diagram Data to obtain the entity concept table;

[0123] According to the mapping ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a big data fusion method, system and device. The method of the invention includes the following steps: constructing a data fusion diagram according to the mapping relationship between the entity concept and the original data, the original data structure diagram and the business logic diagram; The Unicom subgraph method identifies the complex entity concept in the data fusion graph, and obtains the complex entity concept and the data structure graph collection describing the complex entity concept; the system includes a data reconstruction module and a complex entity concept recognition module; the device includes a memory and a processor . The present invention takes the graph as the core, and adds a step of identifying the data fusion graph through the central connected subgraph method, so that the data fusion method can dig out the complex entity concept and the potential data structure describing the complex entity concept from the data fusion graph, It overcomes the disadvantage that the existing technology cannot mine more complex entity concepts. The invention can be widely used in the field of data mining.

Description

technical field [0001] The invention relates to the field of data mining, in particular to a big data fusion method, system and device. Background technique [0002] Glossary: [0003] DFS algorithm: Depth-first search is a kind of graph algorithm, and the English abbreviation is DFS, which is Depth FirstSearch. Briefly speaking, the process is to go deep into every possible branch path until it cannot go any deeper, and each node can only be visited once. The method of DFS algorithm to traverse the graph is to start from a certain vertex v in the graph: [0004] (1) Visit vertex v; [0005] (2) Start from the unvisited adjacent points of v in turn, and perform depth-first traversal on the graph; until the vertices in the graph that have the same path as v are all visited; [0006] (3) If there are still vertices in the graph that have not been visited at this time, start from an unvisited vertex and perform depth-first traversal again until all vertices in the graph have...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62
CPCG06F18/25
Inventor 赵淦森廖智锐王欣明庄序填席云伍昱燊余达明
Owner 赵淦森
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products