Software defect prediction method based on migration learning

A technology of software defect prediction and transfer learning, which is applied in software testing/debugging, computer components, error detection/correction, etc. It can solve problems such as few training samples, discard useful information, and difficult to train models to achieve good defect prediction The effect of accuracy

Active Publication Date: 2018-08-24
NANJING UNIV OF POSTS & TELECOMM
View PDF2 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, these classification techniques usually face a problem: when it is necessary to predict defects for new items, there are too few training samples, and it is difficult to train the correct model
The disadvantage of this method is that the dissimilar data discarded during the sample selection process will also cause the useful information contained in it to be discarded.
However, judging from the existing cross-engineering prediction performance, this method cannot produce better prediction results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Software defect prediction method based on migration learning
  • Software defect prediction method based on migration learning
  • Software defect prediction method based on migration learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] Below in conjunction with accompanying drawing, technical scheme of the present invention is described in further detail:

[0037] Different from the above methods, the present invention proposes a software defect prediction method based on Joint Distribution Based Feature Map (JDBFM) for the problem that new projects need to retrain the prediction classifier each time. This method uses the idea of ​​dimensionality reduction migration learning, comprehensively considers the probability distribution and conditional probability distribution between different item data samples, establishes a new feature representation between the source item and the target item, and minimizes the two in a new space. The difference between them, train a new classifier, and then realize transfer learning.

[0038] figure 1 It is a method flowchart of the present invention, and the method comprises the following steps:

[0039] Step 1, use Principal Component Analysis (PCA) for data reconst...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a software defect prediction method based on migration learning. The method is characterized in that the idea of dimensionality reduction migration learning is utilized, probability distribution and conditional probability distribution among different project data samples are comprehensively considered, a new feature representation between a source project and a target project is established, difference between the two in the new space is minimized, a new classifier is trained, and migration learning is realized. According to the method, firstly, a distance measurementmode between different distributions is employed, maximum mean square difference is utilized to quantify distribution difference and conditional distribution difference between source data and targetdata, a model is acquired through metric minimization, training data and test data mapped by the model have almost the same probability distribution and conditional probability distribution, and secondly, the test data is classified through the traditional machine learning algorithm.

Description

technical field [0001] The invention relates to the technical field of software engineering, in particular to a software defect prediction method based on migration learning. Background technique [0002] In the past 30 years, software defect prediction has gradually become a significant research direction, dedicated to estimating how many remaining defects exist in the west facade of a software system. Software defect prediction can timely and accurately predict whether software modules contain defects in the early stage of system development, reasonably allocate test resources, and analyze defect modules in a targeted manner to improve product quality. In recent years, with the development of statistical learning and machine learning technology and its excellent prediction performance, software defect prediction methods based on statistical learning methods and machine learning methods have been gradually adopted by researchers and become the mainstream defect prediction t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06F11/36
CPCG06F11/3668G06F18/24143G06F18/214
Inventor 张洋洋荆晓远吴飞孙莹
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products