Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A user classification method based on multi-model fusion

A classification method, multi-model technology, applied in character and pattern recognition, instruments, computer parts, etc., can solve the problems of complex iteration, unhelpful model training, low generalization ability, etc., to reduce iterative complexity and improve The effect of recognition accuracy and strong generalization ability

Active Publication Date: 2019-06-14
成都新希望金融信息有限公司
View PDF5 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] In view of the above-mentioned research problems, the purpose of the present invention is to provide a user classification method based on multi-model fusion, which solves the problems of traditional model classification with few dimensions, low precision, complex iteration, inability to handle data sparsity well, and weak univariate discrimination. Problem; to solve the problem that the correlation between the features and labels obtained by the first-level model in the most existing multi-mode fusion is strong, which does not help the subsequent model training, and also results in low generalization ability, and is biased towards the first-level model. Weight considerations do not refer to the weight of each model, resulting in poor classification results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A user classification method based on multi-model fusion
  • A user classification method based on multi-model fusion
  • A user classification method based on multi-model fusion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0053] The XGBoost+LR fusion model processing flow includes two processes, each of which includes four processes including data preparation, feature engineering, model training, and result output, such as figure 1 As shown, for example: the data preparation (that is, data processing) step is mainly to clean and reshape the data set; the feature engineering is derived from the user's multi-dimensional features, and is selected through feature importance, and the top 100 features are respectively Pass in two XGBoost models, and after traversing 400 trees, record the number of leaf nodes of the training set samples in each tree (referring to the tuned XGBoost model, the number of training set samples in the leaf nodes of each tree), and pass One-hot encoding conversion to obtain all the LR features corresponding to the sample, then train and tune the LR model, and output the model result, which is the improved Gaussian transformation of the result. Specific steps are as follows: ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a user classification method based on multi-model fusion, belongs to the technical field of machine learning, and solves the problems of less dimension, low precision and complex iteration in traditional model classification. The method comprises the steps that a data set containing user characteristics is acquired and processed; the method is based on an XGBoost algorithmand two different Y value intersection interval settings, different derivation methods are selected to derive characteristics in the processed data set; the data set features and all the derived features are input into two XGBoost models as a new data set for training and optimization, and the leaf node number of each training set sample at each tree is output by the two XGBoost models after optimization; one-hot coding is trained according to leaf node number ser, and the leaf node number output by each XGBoost model is subjected to the trained One-hot code conversion to obtain all LR features corresponding to the training set samples, and training an optimization LR model based on the LR features corresponding to all the training set samples; and an XGBoost + LR fusion model is obtainedto classify the users. The method is used for classifying users based on multi-model fusion.

Description

technical field [0001] A user classification method based on multi-model fusion is used for classifying users based on multi-model fusion, and belongs to the technical field of machine learning. Background technique [0002] Machine learning is a multi-field interdisciplinary subject, involving probability theory, statistics, approximation theory, convex analysis, algorithm complexity theory and other disciplines. Specializes in the study of how computers simulate or implement human learning behaviors to acquire new knowledge or skills, and reorganize existing knowledge structures to continuously improve their performance. [0003] Machine learning is a subset of artificial intelligence and the core of artificial intelligence. It is the fundamental way to make computers intelligent. Its application pervades all fields of artificial intelligence. It mainly uses induction and synthesis rather than deduction. At its core, machine learning is "using algorithms to parse data, le...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
Inventor 冯诗炀程序段银春
Owner 成都新希望金融信息有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products