Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Identification method for potential customers of auto industry based on tri-training

A customer identification and potential technology, which is applied in the field of potential customer identification in the automotive industry, can solve the problems of slow manual identification, poor model generalization ability, and high capital cost, so as to save labor costs and capital costs, improve prediction accuracy, and information Take advantage of the full effect

Active Publication Date: 2018-07-06
成都达拓智通科技有限公司
View PDF9 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] Method 1: It is necessary to designate an experienced salesperson to score each piece of data. Usually, the amount of data of potential customers is very large and is generated every day. Although manual recognition is more accurate than machines to a certain extent, artificial The speed of recognition is very slow, and it may only take a few minutes to use the model to make predictions, while manual recognition may take a day or even a few days, so this method is inefficient and increases labor costs
[0008] Method 2: Hand over the potential customer data to a third-party data company for identification. Since the third-party data companies in the market are uneven, the accuracy of the identification results is difficult to guarantee. At the same time, this kind of customer identification is a long-term demand, so In the long run, the capital cost of seeking a third party is high, and it is difficult to ensure that the third party can fully abide by the data confidentiality treaty, so there is a risk of data leakage
[0009] Method 3: Use a commonly used supervised model to learn the model from the customer data of the purchased car, but in practice, there is a small amount of labeled data, so the model trained with a small amount of data cannot fully grasp the inherent laws of data generation. Therefore, the generalization ability of the model is poor, that is, the prediction accuracy for new data is not high, and it also wastes a lot of information that exists in unlabeled data.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Identification method for potential customers of auto industry based on tri-training

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0039] A method for identifying potential customers in the automotive industry based on tri-training, comprising the following steps:

[0040] Step (1): data preprocessing, including the following steps:

[0041] Step (1.1): Abnormal value processing: For each attribute, check whether there is an abnormal attribute value in all data, and delete it if so;

[0042] Step (1.2): Missing value processing: For categorical attributes, the missing value is regarded as a new type; for continuous attributes, an attribute containing missing values ​​is regarded as the dependent variable Y, and other attributes without missing values ​​are regarded as independent variables X, and then use the samples without missing values ​​in Y and the corresponding samples in X as the training set, select the random forest model to train on the training set, and use the trained model to predict the missing values ​​in Y, so as to Fill in the missing values ​​in the original data; do the above processi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an identification method for potential customers of an auto industry based on tri-training. The method comprises the following steps of data preprocessing and data modeling, wherein the data preprocessing comprises abnormal value processing, missing value processing, classification attribute processing, combination feature generation, feature selection and data normalization; the data modeling adopts a tri-training cooperative training algorithm in semi-supervised learning and selects a BP neural network as a base learner in the process of cooperative training. The historical selling data (customer data with category labels) of an automobile brand dealer and the collected potential customer data (customer data without labels) are utilized to construct a semi-supervised customer identification model, so that accurate marketing targets are provided for accurate marketing of the auto industry, the manpower cost and cost of funds are saved.

Description

technical field [0001] The invention relates to a method for identifying potential customers, in particular to a method for identifying potential customers in the automotive industry based on tri-training. Background technique [0002] With the vigorous development and universal access of the Internet, people's behaviors on the Internet have generated a large amount of data. The demand for storage, processing, and analysis of these data has driven the development of databases, cloud computing and other related technologies. These data collection, storage The development of processing technology has in turn promoted the analysis and application of massive data by enterprises, and promoted the development of big data. At present, more and more enterprises are beginning to pay attention to the accumulation, application and realization of data assets. [0003] In the automobile industry, dealers of various automobile brands can obtain clues of potential consumers who are concer...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30G06K9/62G06N3/04G06N3/08G06Q30/02
CPCG06F16/215G06F16/285G06N3/084G06Q30/0201G06N3/045G06F18/24323Y02P90/30
Inventor 姚黎明李晓非张胤
Owner 成都达拓智通科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products