A method and apparatus for training a model based on a gradient lifting decision tree

A decision tree and gradient technology, applied in the information field, can solve problems such as inability to train qualified ones and less data

Inactive Publication Date: 2019-03-15
ADVANCED NEW TECH CO LTD
View PDF4 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In order to solve the problem that a qualified model cannot be trained due to the lack of accumulated data in some special business scenarios, the embodiment of this specification provides a model training method and device based on a gradient boosting decision tree. The technical solution is as follows:

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and apparatus for training a model based on a gradient lifting decision tree
  • A method and apparatus for training a model based on a gradient lifting decision tree
  • A method and apparatus for training a model based on a gradient lifting decision tree

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The invention draws on the transfer learning idea in the field of machine learning technology. When faced with the need to train a model applied to the target business scenario, if the data accumulated in the target business scenario is insufficient, then the data accumulated in the business scenario similar to the target business scenario can be used for model training.

[0038] Specifically, the present invention combines the migration learning idea with the gradient boosting decision tree (Gradient Boosting Decision Tree, GBDT) algorithm to improve the GBDT algorithm flow. In the embodiment of this specification, for a GBDT algorithm process, first use data generated in a business scenario similar to the target business scenario for training, and after meeting certain training suspension conditions, suspend training and calculate the current training residual; then , Use the data generated in the target business scenario, and continue training based on the training resi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a model training method and a device based on a gradient lifting decision tree. A GBDT algorithm is divided into two phases. In the former phase, several decision trees are trained by obtaining labeled samples from the data domain of the business scenarios similar to the target business scenarios, and the training residuals generated after the training in the previous phaseare determined. At a later stage, that annotated sample are obtained from the data domain of the target business scenario, and a number of decision trees are continue to be trained based on the training residuals. Finally, the model applied to the target business scenario is actually integrated from the decision tree trained in the previous stage and the decision tree trained in the next stage.

Description

Technical field [0001] The embodiments of this specification relate to the field of information technology, and in particular to a model training method and device based on a gradient boosting decision tree. Background technique [0002] As we all know, when it is necessary to train a predictive model applied to a certain business scenario, it is usually necessary to obtain a large amount of data from the data domain of the business scenario for labeling, as a labeled sample, for model training. If the number of labeled samples is small, a qualified model cannot usually be obtained. It should be noted that the data domain of a certain business scenario is actually a collection of business data generated based on the business scenario. [0003] However, in practice, less data is accumulated in certain special business scenarios. As a result, when it is necessary to train a model applied to a certain special business scenario, it is impossible to obtain enough labeled samples from ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/2155G06F18/214G06N20/20G06N5/01G06F18/2148G06N20/00G06N20/10G06N5/02G06N5/025G06N5/027
Inventor 陈超超周俊
Owner ADVANCED NEW TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products