Multi-source multi-modal data preprocessing method and system for shared learning

A data preprocessing and multi-source data technology, applied in the field of data processing, can solve problems such as ignorance, user information leakage, easy exposure of data privacy, etc., and achieve the effects of strong security, guaranteed data privacy, and high execution efficiency

Active Publication Date: 2021-01-05
ZHEJIANG UNIV
View PDF6 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although these technologies can solve the problem of data preprocessing in the process of multi-source data joint modeling to a certain extent, each has certain shortcomings: some patents do not use encryption technology to easily expose data privacy, and some patents only use sample alignment instead of Other steps in the preprocessing link such as data screening are ignored, and some encryption techniques are time-consuming, which makes practical application difficult
Document 2 reduces the complexity of model calculation and the overhead of communication between participants, but its data preprocessing process is relatively simple, and the sample alignment process does not mention encryption technology, so it may cause user information leakage

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-source multi-modal data preprocessing method and system for shared learning
  • Multi-source multi-modal data preprocessing method and system for shared learning
  • Multi-source multi-modal data preprocessing method and system for shared learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] First of all, it should be explained that the present invention relates to data processing technology, which is an application of computer technology in the field of big data. During the implementation of the present invention, the application of multiple software function modules will be involved. The applicant believes that, after carefully reading the application documents and accurately understanding the realization principle and purpose of the present invention, combined with existing known technologies, those skilled in the art can fully implement the present invention by using their software programming skills. The foregoing software functional modules include but are not limited to: data communication subsystem, data encryption subsystem, data alignment subsystem, data filtering subsystem, network transport layer, data nodes, alignment nodes, filter nodes, and so on. Everything mentioned in the application documents of the present invention belongs to this categ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a data processing technology, and aims to provide a multi-source multi-modal data preprocessing method and system for shared learning. The method comprises the following steps: performing data communication across servers; encrypting multi-source data; aligning privacy protection samples; and filtering the multi-source data to form final unified data. According to the invention, a preprocessing stage before training of the shared learning model is abstracted into a system, and a preprocessing process is completed step by step by utilizing a plurality of subsystems, sothat cross-server data communication, multi-source data encryption, privacy protection sample alignment and multi-source data filtering are realized, the shared learning system preprocessing stage issystematic and more specific and complete, the multi-source data communication, encryption, alignment and filtering method is provided, the execution efficiency is higher, the security is higher, andthe data privacy can be ensured; and the method is independent of a specific shared learning task, and can be integrated in any multi-source data processing task as a single module.

Description

technical field [0001] The invention relates to data processing technology, in particular to a multi-source and multi-modal data preprocessing method and system for shared learning. Background technique [0002] With the rise of artificial intelligence algorithms, data privacy issues are gradually being mentioned, and countries around the world are establishing and improving relevant laws to protect data security and privacy. In this context, the shared learning technology based on the principles of cryptography has emerged. It mainly guarantees the security of data in the case of multi-source data joint modeling through security proof. [0003] Artificial intelligence is driven by big data. AlphaGO used a total of 300,000 game data to achieve such results. So people began to expect such big data-driven AI to be applied in all aspects of life. However, the actual situation may be pessimistic: due to the limitations of various external and internal conditions, it is quite ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06N20/00G06F9/54G06F21/60
CPCG06N20/00G06F21/602G06F9/547
Inventor 郑小林应森辞吴锐
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products