Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Financial default probability prediction model based on LightGBM

A probabilistic prediction and model technology, applied in finance, prediction, character and pattern recognition, etc., can solve problems such as too many, slow training speed, easy to produce overfitting, etc.

Inactive Publication Date: 2020-06-19
百维金科(上海)信息科技有限公司
View PDF0 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Big data is worthy of its name. The data dimension shows explosive growth, with many dimensions and high sparsity. In risk control modeling, structured data cleaning and processing are heavy, data transformation has sparse matrix and too much loss information, feature extraction is difficult, tens of thousands of Dimensionality exceeds the range that traditional scorecard models can handle, and the requirements for machine learning algorithms are getting higher and higher. Algorithms such as LR, SVM, RF, GBDT, XGBoost, and LightGBM have emerged as the times require. XGBoost is currently the mainstream algorithm, and XGBoost On the basis of traditional Boosting, integrate the advantages of RF and GBDT, use the multi-thread parallelization of CPU, introduce regularization items, support column sampling and parallel approximate histogram algorithm, etc., but in practical applications, it also presents a large amount of calculation Huge, slow training speed, high memory usage, easy to produce overfitting and other shortcomings, which gave birth to a more advanced algorithm such as LightGBM

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Financial default probability prediction model based on LightGBM

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0018] A financial default probability prediction model based on LightGBM, combined with figure 1 As shown, its modeling includes the following steps: Step 1: Sample data acquisition, select customer samples required for modeling analysis, obtain customer application information, credit data and third-party data authorized by customers, and combine the application information, The credit data, third-party data and third-party data are analyzed and converted into on...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a financial default probability prediction model based on LightGBM. Modeling of the method comprises the steps of sample data acquisition, data preprocessing, feature engineering, data set division, model training and parameter adjustment, and model deployment and monitoring, and automatic, comprehensive and process credit risk assessment and prediction of borrowers are realized, so that credit overdue fraud risks are reduced, and improvement of the Internet financial risk control capability and healthy development are promoted. The core of the prediction model of the technical scheme is as follows. According to the method, a LightGBM model based on a histogram algorithm and a level-wise splitting strategy is utilized; the method is advantaged in that internet financial mass data with many abnormal values, high missing values and wide dimensions can be optimized and rapidly processed, a big data processing capability is realized, because of excellence of the algorithm, reliability, flexibility and expandability of the model are all improved, and the method is more suitable for present big data risk control demands.

Description

technical field [0001] The invention relates to the technical field of Internet financial risk control, in particular to a financial default probability prediction model based on LightGBM. Background technique [0002] In recent years, Internet finance represented by P2P lending and consumer finance has developed rapidly. Most Internet finance companies based on logistic regression (LR), support vector machine (SVM), random forest (RF), gradient boosting decision tree (GBDT), Algorithms such as extreme gradient boosting tree (XGBoost) are used for risk control modeling. Generally, the risk control model mainly adopts WOE conversion, and then uses logistic regression model for fitting to construct a credit score card. This method is more effective than traditional financial industries in Internet finance. The effect has declined. With the development of big data, Internet financial risk control and machine learning complement each other. It is very meaningful to use more adv...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06Q40/02G06Q10/04G06K9/62
CPCG06Q10/04G06Q40/03G06F18/214G06F18/24323
Inventor 江远强
Owner 百维金科(上海)信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products