Heterogeneous data table merging method and system thereof

A data table, heterogeneous technology, applied in the field of communication, can solve the problem of data merging that cannot be used in heterogeneous database systems.

Inactive Publication Date: 2010-08-11
CHINA MOBILE COMM GRP CO LTD
View PDF0 Cites 54 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the data may come from different database systems, and the existing Map / Reduce (mapping / simplification) mechanism cannot use the traditional data merging method of heterogeneous database systems for data merging, there is an urgent need for a Map / Reduce Mechanism's Data Merging Method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Heterogeneous data table merging method and system thereof
  • Heterogeneous data table merging method and system thereof
  • Heterogeneous data table merging method and system thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] Embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0018] see figure 1 , is a schematic diagram of the data table merging process provided by the embodiment of the present invention, and the process includes steps:

[0019] Step 101. Assign table IDs to multiple heterogeneous data tables to be merged, and add the assigned table IDs to all data records in the corresponding data tables.

[0020] In this step, taking two heterogeneous data tables (Table 1 and Table 2) as an example, the table identifier assigned to Table 1 is flag1, and the table identifier assigned to Table 2 is flag2, and flag1 is added to Table 1. In all data records, add flag2 to all data records in table 2. The operation of adding a table identifier can be realized by adding a table identifier field in the data table and writing the corresponding table identifier in the table identifier field.

[0021] Step 102, according to the s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a heterogeneous data table merging method and a system thereof. The method comprises the following steps: respectively allocating table marks for a plurality of heterogeneous data tables; adding the table marks into all data records in the corresponding data tables; merging the data records with the same keyword field values and different table marks into a novel data record according to the set keyword fields; deleting the table marks in the novel data record; and storing the data record with the deleted table mark into a novel data table. The invention can be used for realizing the data merging processing of the heterogeneous data table, and can improve the efficiency of the data merging operation.

Description

technical field [0001] The invention relates to data mining technology in the communication field, in particular to a method and system for merging heterogeneous data tables. Background technique [0002] Data mining is the process of extracting hidden, unknown but potentially useful information and knowledge from a large number of incomplete, noisy, fuzzy, and random practical application data. [0003] The existing data mining process includes three main steps: data preprocessing (ETL), data mining algorithm implementation, and result display. Among them, through the ETL step, the source data can be preprocessed to obtain the data to be mined; through the data mining algorithm implementation step, the data mining algorithm that meets the business needs can be realized to obtain the analysis results; through the result display step, the data mining algorithm can be The results are displayed to the user. [0004] ETL operations include merging heterogeneous data tables. S...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 高丹邓超徐萌罗治国周文辉何清谭庆马旭东郑诗豪沈亚飞陈磊
Owner CHINA MOBILE COMM GRP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products