An improved random forest method for air quality classification

A random forest method and air quality technology, applied in the direction of measuring devices, analysis materials, computer parts, etc., can solve the problems of low classification accuracy of minority classes, misleading people's production and life, and errors in training data sets, etc., to achieve accurate imbalanced data Reliable, improve generalization ability and anti-overfitting ability, and maintain consistency

Active Publication Date: 2019-10-15
CHONGQING UNIV
View PDF6 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When it classifies unbalanced data, due to the imbalance in the number of samples of the majority class and the samples of the minority class, the classification model will be biased towards the majority class and ignore the minority class, resulting in low classification accuracy of the minority class, which will cause training problems. The error development of the data set eventually leads to wrong data classification; it affects the indirect judgment of users who use sample data, and then misleads people's production and life, which has a high cost of misclassification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An improved random forest method for air quality classification
  • An improved random forest method for air quality classification
  • An improved random forest method for air quality classification

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, a clear and complete description will be made below in conjunction with the technical solutions in the embodiments of the present invention. Obviously, the described embodiments are part of the embodiments of the present invention, and Not all examples. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0040] In Example 1, such as figure 2 and 3 As shown, an improved random forest method for air quality classification, an improved random forest method for air quality classification, including an air quality classification model, the air quality classification model includes an original data module, a data preprocessing module, Classification generation module, classification data module, whe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an improved random forest method for air quality classification, which comprises an air quality classification model, the air quality classification model comprises an originaldata module, a data preprocessing module, a classification generation module and a classification data module; the original data module is used for collecting original air data; the data preprocessing module is used for performing data cleaning, data integration, data conversion and other operations on the original air data; the classification generation module is used for randomly sampling the data processed by the data preprocessing module and classifying a decision tree based on a CART algorithm; the classification data module is used for receiving the classification model output by the classification generation module and outputting an air data classification result, and the random sampling comprises a self-service sampling method based on sample category grouping and a random featuresubspace method. According to the method, the classification precision of minority class samples is improved, and the overall misclassification cost of a sample set is reduced.

Description

technical field [0001] The invention relates to the technical field of air quality monitoring, in particular to an improved random forest method for air quality classification. Background technique [0002] Air pollution refers to a phenomenon in which human beings carry out production and life in the natural environment, and some pollutants are discharged into the atmosphere due to some inappropriate behaviors. When the concentration of substances reaches a certain value, it will cause harm to human health and the natural environment. The atmospheric environment is complex and changeable and has dynamic uncertain characteristics. There are many related factors that cause air pollution, such as PM(2.5), SO 2 and O 3 Such as air pollutants exceeding a certain concentration, as well as factors such as precipitation, wind direction, and humidity, all of which have a strong nonlinear relationship with the air quality in the future. More accurate air quality forecasts can help ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62G01N33/00
CPCG01N33/0031G06F18/24323G06F18/214
Inventor 熊庆宇易华玲吴丹吉皇余洋高旻王楷
Owner CHONGQING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products