Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Multi-source heterogeneous data quality detection method based on petri network

A multi-source heterogeneous data and quality detection method technology, applied in the direction of structured data retrieval, structured data browsing, digital data information retrieval, etc., to achieve the effect of enhancing the value of data

Active Publication Date: 2021-03-23
中科大数据研究院
View PDF14 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] Aiming at the defects and problems existing in existing equipment, the present invention provides a multi-source heterogeneous data quality detection method based on petri net, which effectively solves the problem that the existing data quality detection often only targets a single data source and a single data format. , lack of system-wide considerations, interpretable results, and cannot be used in the current big data environment

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-source heterogeneous data quality detection method based on petri network
  • Multi-source heterogeneous data quality detection method based on petri network
  • Multi-source heterogeneous data quality detection method based on petri network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0031] Embodiment 1: This embodiment aims to provide a multi-source heterogeneous data detection method based on petri nets. For existing data quality detection, it is often only for a single data source and a single data format. The quality detection lacks system-wide considerations and results. It can be explained, but it is not suitable for the current big data environment. This embodiment is a system based on the petri net model and suitable for multi-source heterogeneous data detection.

[0032]In this embodiment, a petri net-based multi-source heterogeneous data detection system includes a data quality rule library module, a data quality task management module, a data quality operation module, a scheduling plan management module, a data quality analysis module and a data quality knowledge base module ; The data quality rule library module is used as a collection of data quality rules to provide quality rules for quality inspection; the data quality task management module ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a petri network-based multi-source heterogeneous data quality detection method, which can use a scheduling plan module to configure data quality jobs and then use a message feedback mechanism of a petri network to form a table and chart form quality analysis report for each time of quality jobs and finally accumulate data quality detection problems. A data quality problem knowledge base is formed, sustainable development opinions and suggestions are provided for subsequent data quality detection, it is guaranteed that all data sources are independent of one another, SQLforms are unified, and the problem that multi-source heterogeneous data needs a specific query method for different data sources is solved. A graphical and draggable quality task and job management method is provided for a user, a quality detection scheduling plan can be configured, a data quality analysis result is generated for the user to check, and a data quality knowledge base is establishedto improve the capability of solving quality problems. Effective support is provided for system data quality improvement, so that the data value is improved.

Description

technical field [0001] The invention belongs to the technical fields of big data, data governance and data analysis, and in particular relates to a petri-net-based multi-source heterogeneous data quality detection method. Background technique [0002] The information age has now transitioned to the era of big data, artificial intelligence, and the Internet of Things. In the fields of big data analysis, data warehouse construction, machine learning, neural networks, and Internet of Things data transmission, data is the cornerstone of all of these. Data quality is crucial in the use of data. Only with quality-assured data can algorithms get closer to the correct answer. [0003] In the context of big data, data governance has the problems of huge data volume, various data sources, and various data representation methods. With the introduction of the concept of Data Lake, big data governance does not require prior structured processing of data, and data is stored in relational...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/215G06F16/2455G06F16/26G06F16/28
CPCG06F16/215G06F16/2455G06F16/26G06F16/28G06F16/284
Inventor 贵恒冯凯王元卓王洪显
Owner 中科大数据研究院
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products