Code reviewer recommendation system and method based on random forest classifier
A technology of random forest and recommendation system, which is applied in the direction of instruments, computer parts, software testing/debugging, etc. It can solve the problems of large feature dimension, unsatisfactory recommendation effect, and difficulty in integration, and achieve the effect of saving communication costs
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0075] see figure 1 In the embodiment of the present invention, a code reviewer recommendation system based on a forest classifier is proposed, including an input module 310 , a calculation module 320 , a model training module 330 and a recommendation result output module 340 .
[0076] In the technical solution of the embodiment of the present invention, by converting the reviewer recommendation problem into a multi-classification task of machine learning, the historical review log and historical code change log of the project are analyzed, and the personnel information and code change of the project are mined. Information and file path information, and converted into personnel activity features, code change features, and file weight features, all review records contained in the project are converted into feature vectors, and used as a data set, input into the random forest model, training random Forest classifier; extract features from the historical code review records of t...
Embodiment 2
[0112] see image 3 As shown, the technical solution of this embodiment proposes a code reviewer recommendation method based on a random forest classifier. This method recommends a suitable reviewer for the code to be reviewed according to the historical code review records, saving the time required for no recommendation. The communication time, the specific steps are as follows:
[0113] Step 210, acquire project historical code review records, said historical code review records include:
[0114] Obtaining a code submission log that matches the software project, the code submission log includes: code submitter, submission time, branch, number of newly added code lines, number of deleted code lines, and file path set;
[0115] Obtaining a code review log that matches the software project, where the code review log includes: reviewer, review time;
[0116] Step 220, according to the historical code review records of the project, mining personnel activity characteristics, cod...
Embodiment 3
[0136] refer to Figure 4 , this embodiment provides a time series verification model to evaluate the effect of the code reviewer recommendation method based on the random forest classifier in the second embodiment;
[0137] In this embodiment, the time series model is used to verify the method, and the processed historical code review records are used as a data set, which is divided into a training set and a test set according to time series, such as Figure 4As shown, first sort the data set according to the review time and divide it into N time slices, N depends on the time interval between the first code review of the project and data collection, for example, the first code review of project A The first code review time is January 2019, and the data is collected in November 2019, that is, 10 months after the first submission. When "month" is used as the time slice, all records included in project A will be collected Divided into 10 time slices, then, based on the divided ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com