Method and device for multi-party joint dimension reduction processing of private data

A privacy data, dimensionality reduction technology, applied in the fields of electrical digital data processing, digital data protection, computer security devices, etc.

Active Publication Date: 2020-07-10
ALIPAY (HANGZHOU) INFORMATION TECH CO LTD
View PDF7 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although a large amount of high-dimensional data can enrich the training sample data of machine learning, in fact, these high-dimensional data often have some redundant information
The help of redundant information to the effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for multi-party joint dimension reduction processing of private data
  • Method and device for multi-party joint dimension reduction processing of private data
  • Method and device for multi-party joint dimension reduction processing of private data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0063] The solutions provided in this specification will be described below in conjunction with the accompanying drawings.

[0064] figure 1 It is a schematic diagram of an implementation scenario of an embodiment disclosed in this specification. Such as figure 1 As shown, in the shared learning scenario, the data set is jointly provided by multiple holders 1, 2, ..., M (M is a natural number), and each holder owns a part of the data in the data set. The data set may be a training data set for training the neural network model, a test data set for testing the neural network model, or a data set to be predicted. The data set can include attribute characteristic data of business objects, and the business objects can be objects to be analyzed in various businesses such as users, merchants, commodities, and events.

[0065] There can be at least two data distributions here. One is that each holder owns the data of the same attribute item of different business objects. For exa...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a method and device for carrying out dimension reduction processing on private data in a multi-party joint mode. The method comprises the steps: under the condition that the private data is longitudinally distributed, a first holding party performs zero equalization on a first original matrix to obtain a first center matrix, obtains an N*N asymmetric orthogonal matrix, multiplies the asymmetric orthogonal matrix by the first center matrix to obtain a first secret matrix, and sends the first secret matrix to a trusted third party; the trusted third partysplices the secret matrixes to obtain a global secret matrix, multiplies the global secret matrix by the transposed matrix thereof to obtain a covariance matrix, performs eigenvalue solving on the covariance matrix to obtain a dimension reduction transformation matrix, splits the dimension reduction transformation matrix to obtain split matrixes, and sends the split matrixes to a holder; the first holder processes the first original matrix by using the first split matrix to obtain a first dimension reduction matrix, wherein the first dimension reduction matrix is used for performing businessprediction analysis on the business object in a machine learning mode.

Description

technical field [0001] One or more embodiments of this specification relate to the field of machine learning, and in particular to a method and device for multi-party joint dimensionality reduction processing on private data. Background technique [0002] The data needed for machine learning often involves multiple platforms and fields. For example, in the merchant classification analysis scenario based on machine learning, the electronic payment platform has the transaction flow data of the merchants, the e-commerce platform stores the sales data of the merchants, and the banking institution has the loan data of the merchants. Data often exists in silos. Due to issues such as industry competition, data security, and user privacy, data integration is facing great resistance. How to integrate data scattered across various platforms under the premise of ensuring that data is not leaked has become a challenge. [0003] On the other hand, as the amount of data increases, the d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62G06F21/62G06N20/00
CPCG06F21/6245G06N20/00G06F18/2135
Inventor 刘颖婷陈超超王力周俊
Owner ALIPAY (HANGZHOU) INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products