Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Modeling variable selection method based on correlation and principal component analysis

A technique of principal component analysis and principal component analysis, applied in character and pattern recognition, special data processing applications, instruments, etc., can solve problems such as redundancy, failure to reflect system output, irrelevance, etc.

Inactive Publication Date: 2020-07-17
JIANGSU FRONTIER ELECTRIC TECH
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the complexity of the object, there are often a large number of candidate variables that may be related to the target variable, and there are correlations and couplings between some of the candidate variables, and some of the candidate variables cannot reflect the system output, and there are noises and irrelevance. or redundant

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Modeling variable selection method based on correlation and principal component analysis
  • Modeling variable selection method based on correlation and principal component analysis
  • Modeling variable selection method based on correlation and principal component analysis

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The present invention will be further described below in conjunction with the accompanying drawings and specific preferred embodiments.

[0037] A modeling variable selection method based on correlation and principal component analysis, characterized in that: using the information entropy theory to calculate the correlation information coefficient between influencing factors, eliminating redundant variables, and then using principal component analysis to extract the remaining variable principal components, To reduce the number of modeling variables, the specific steps are as follows:

[0038] A modeling variable selection method based on correlation and principal component analysis, characterized in that: using the information entropy theory to calculate the correlation information coefficient between influencing factors, eliminating redundant variables, and then using principal component analysis to extract the remaining variable principal components, To reduce the num...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a modeling variable selection method based on correlation and principal component analysis. An information entropy theory is used for calculating correlation information coefficients among influence factors, redundant variables are removed, then a principal component analysis method is used for extracting residual variable principal components, the number of modeling variables is reduced, and the method comprises the specific steps that 1, the information entropy theory is used for calculating the correlation information coefficients between to-be-selected variables and target variables; step 2, rejecting to-be-selected variables with relatively small associated information coefficients; 3, extracting residual variable principal components to be selected by utilizing a principal component analysis method; and step 4, obtaining a final modeling variable. According to the method, the number of modeling variables can be reduced on the premise that modeling variable information is reserved as much as possible, and help is provided for establishing an object model.

Description

technical field [0001] The invention relates to the technical field of data-driven modeling, in particular to a modeling variable selection method based on correlation and principal component analysis. Background technique [0002] In recent years, with the continuous development of computer and database technology, data-driven modeling technology has attracted more and more attention. However, due to the complexity of the object, there are often a large number of candidate variables that may be related to the target variable. Among them, there are correlations and couplings among some of the candidate variables, and some of the candidate variables cannot reflect the system output, and there are noises and irrelevance. or redundancy. If all possible variables to be selected are included in the model, it will greatly increase the modeling time and the uncertainty of the model, weaken the generalization ability of the model, and reduce the accuracy of the model. Therefore, in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/21G06K9/62
CPCG06F16/212G06F18/2135
Inventor 李逗孙栓柱孙和泰周春蕾王林孙彬黄翔李春岩杨晨琛潘苗
Owner JIANGSU FRONTIER ELECTRIC TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products