Model training method and device, additional feature data obtaining method and device, equipment and medium

A technology of adding features and data, applied in the field of data science, can solve the problems of source data privacy leakage, source data security is difficult to be guaranteed, hindering the application and development of transfer learning technology, etc.

Pending Publication Date: 2020-03-03
THE FOURTH PARADIGM BEIJING TECH CO LTD
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the migration process will touch the source data, the existing migration process may cause the privacy of the source data to be leaked, making it difficult to guarantee the data security of the source data
Therefore, many data owners, such as banks, insurance, medical, financial and government departments, are unwilling to open their own data as source data, which greatly hinders the application and development of transfer learning technology

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model training method and device, additional feature data obtaining method and device, equipment and medium
  • Model training method and device, additional feature data obtaining method and device, equipment and medium
  • Model training method and device, additional feature data obtaining method and device, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0065] Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like numerals refer to like parts throughout. The embodiments are described below in order to explain the present invention by referring to the figures.

[0066] Before describing the present invention, firstly, a brief description will be made on the nouns and concepts involved in the present invention.

[0067] Transfer learning: The goal of transfer learning is to use knowledge learned from one environment to improve the use of data in a new environment.

[0068] Source Dataset: The data source used for migration.

[0069] Source data: The data in the data source used for migration.

[0070] Target Dataset: The dataset for which transfer learning works.

[0071] Target data: The data in the dataset where transfer learning works.

[0072] target task: one or more tasks on the target data.

[0073] Common features: T...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a model training method and device, an additional feature data obtaining method and device, equipment and a medium. Obtaining a feature prediction model, the feature predictionmodel being trained based on the source data set, the feature prediction model being used for predicting at least one part of unique features of the source data based on at least one part of common features between the source data and the target data; obtaining a target data set; for each part of target data in the target data set, inputting at least one part of common features in the target datainto the feature prediction model to obtain at least one part of unique features predicted by the feature prediction model for at least one part of input common features; and taking at least one partof the predicted unique features as additional feature data of the target data. According to the method, the common features are taken as springboards, the unique features of the source data are migrated to the target data in a model migration mode, and the source data and the target data are not contacted in the process, so that the leakage risk of the source data is greatly reduced.

Description

technical field [0001] This application claims the priority of the Chinese patent application with the application number 201810929755.8, the application date is August 15, 2018, and the title is "Method, device, equipment and medium for model training and obtaining additional characteristic data". The present invention generally relates to the field of data science, and more specifically, relates to a method, device, device and medium for model training and acquisition of additional feature data. Background technique [0002] The goal of transfer learning is to transfer the knowledge acquired from the source data to the target data, so as to improve the use effect of the target data. [0003] Existing transfer learning algorithms are usually based on the circulation of data, and the source data needs to be brought into the environment of the target data during the implementation process. Since the migration process will touch the source data, the existing migration process...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06N20/00G06F21/62
CPCG06F21/6245
Inventor 李京涂威威
Owner THE FOURTH PARADIGM BEIJING TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products