Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Method for Detecting Software Repeated Defect Reports

A bug report, software technology, applied in software testing/debugging, etc.

Active Publication Date: 2017-02-22
重庆优霓空科技有限公司
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Compared with Runeson P's method, the correct rate has increased by about 20%, but it is lower than the research results of Wang X Y et al.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Method for Detecting Software Repeated Defect Reports
  • A Method for Detecting Software Repeated Defect Reports
  • A Method for Detecting Software Repeated Defect Reports

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0073] The present invention will be described in further detail below in conjunction with the accompanying drawings.

[0074] A method for detecting software repetitive defect reports, specifically comprising the following steps:

[0075] S1: Extract training sample set M and test sample set W from the defect report database of the software, training sample set M={m 1 ,m 2 ,...m i ..., m Q},i=1,2,...,Q,m i Indicates the i-th training sample in the training sample set, Q indicates the number of training samples in the training sample set; the test sample set W={w 1 ,w 2 ,...w j ...,w P},j=1,2,...,P,w j Indicates the jth test sample in the test sample set, P indicates the number of test samples in the test sample set, Q>P;

[0076] S2: Use the Mallet tool to create a topic model for the training sample set M;

[0077] S2a: Transform the training sample set M into a feature vector set in Indicates the training sample m i The corresponding training sample eigenvect...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method for detecting repeated software defect reports. The method comprises the following steps of firstly, extracting a training sample set and a test sample set from a software defect report database, establishing a subject model of the training sample set, then applying the subject model to test samples to obtain a document-subject matrix, calculating the subject similarity between two test samples, extracting classification information of the test samples to calculate the classification information similarity, and multiplying the classification information similarity and the subject similarity to obtain LDA (local data area) similarity between the two test samples; secondly, extracting an N-gram sequence of the test samples to calculate N-gram similarity, performing weighted summation on the N-gram similarity and the LDA similarity to calculate the whole similarity between the two test samples; and finally, if the whole similarity is greater than or equal to a preset threshold value, indicating that the two test samples are the repeated defect reports. According to the method, the accuracy of a detection result is greatly improved; the repeated defect reports can be prevented from being dispatched to a developer as much as possible, and human resources are saved.

Description

technical field [0001] The invention belongs to the technical field of software maintenance, and in particular relates to a method for detecting repeated defect reports. Background technique [0002] As software projects grow in size, software becomes more and more complex. The cost of the software maintenance phase has accounted for 2 / 3 of the cost of the software life cycle. A software defect report is a document description of possible defects and errors in software by software testers or users during software maintenance. Open source software such as Eclipse, Firefox, Open Office, etc., with the increase in scale and version updates, different users submit defect reports to the defect tracking management system every day, and the system will generate a large number of repeated defect reports, such as the open source software Firefox. The proportion of repeated defects in the software defect database is as high as 30%. In 2005, Mozilla programmers reported that "everyd...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F11/36
Inventor 徐玲邹杰葛永新杨梦宁洪明坚张小洪蒋欣志
Owner 重庆优霓空科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products