Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Visualization method and device of random forest model and storage medium

A random forest model and training sample technology, applied in the field of machine learning, can solve problems such as inability to display random forest models, reduced interpretability, and difficult to understand models

Pending Publication Date: 2020-10-16
南京星云数字技术有限公司
View PDF6 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Because the random forest algorithm integrates weak classifiers into bags, which is similar to the black box algorithm to a certain extent, resulting in a greatly reduced interpretability of the output results. At present, there is no relevant technical solution to integrate the random forest classification results It is displayed through visual means, which further makes it difficult for users (for example, risk control personnel) to understand the model, and in the case of multiple variables, it is difficult to identify which variables play what role in the decision, and it is impossible to make the decision-making process of the random forest model Reach relevant audiences

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Visualization method and device of random forest model and storage medium
  • Visualization method and device of random forest model and storage medium
  • Visualization method and device of random forest model and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0060] In order to make the purpose, technical solutions and advantages of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only Some, but not all, embodiments of the invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0061] It should be noted that, unless the context clearly requires, the words "include", "include" and other similar words in the entire specification and claims should be interpreted as an inclusive meaning rather than an exclusive or exhaustive meaning; that is, " including but not limited to ".

[0062] In addition, in the description of the present invention, it ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a visualization method and device for a random forest model and a storage medium, and relates to the technical field of machine learning, and the method comprises the steps ofscreening a target training sample meeting a preset condition from a training sample set corresponding to each decision tree of the random forest model, so as to form a target training sample set forconstructing a classification tree; obtaining the variable importance degree of each characteristic variable in each decision tree, and carrying out descending sorting on all the characteristic variables according to the variable importance degrees; according to the target training sample set and all the feature variables after descending sorting, starting from a root node of the classification tree, optimal feature variables and optimal segmentation values corresponding to all the nodes in the classification tree are sequentially determined by taking the Gini coefficient as a splitting rule,so that the classification tree is constructed; and generating a tree-shaped visual graph corresponding to the classification tree and outputting the tree-shaped visual graph. According to the invention, the decision process of the random forest model can be visually displayed, and the interpretability of the model is improved.

Description

technical field [0001] The invention relates to the technical field of machine learning, in particular to a visualization method, device and storage medium of a random forest model. Background technique [0002] The random forest algorithm is an algorithm that uses the idea of ​​ensemble learning to integrate multiple decision trees. By generating several weak classifiers—decision trees, and using a bagging algorithm that randomly selects variables and samples, the voting results of the weak classifiers are Output as the final prediction result, so that a more reasonable classification decision boundary can be obtained, the overall error can be reduced, and a better classification effect can be achieved. It is now widely used in financial risk control and other fields, and has strong prediction stability and anti-oversimulation Compatibility. [0003] In the process of realizing the present invention, the inventor finds that there are at least the following problems in the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62G06F16/26
CPCG06F16/26G06F18/24323
Inventor 刘师雨
Owner 南京星云数字技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products