Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Constraint-restricted clustering and information metric software birthmark feature selection method, computer

A feature selection method, software feature technology, applied in software engineering design, calculation, software maintenance/management, etc., can solve the problems of lack of dynamic feature selection, low software recognition rate, no feature group, etc., to improve practicability and wide application Resilience, guaranteed anti-attack, and unique effects

Active Publication Date: 2021-04-13
XIAN UNIV OF FINANCE & ECONOMICS
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] To sum up, the existing problems in the prior art are as follows: First, the traditional software feature acquisition method without clustering and screening is characterized by a large feature library, which contains relatively large feature redundant information, and feature-based software identification The rate is low; or there is no contextual semantic analysis feature group combined with software, lack of dynamic feature selection

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Constraint-restricted clustering and information metric software birthmark feature selection method, computer
  • Constraint-restricted clustering and information metric software birthmark feature selection method, computer
  • Constraint-restricted clustering and information metric software birthmark feature selection method, computer

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach

[0104] Before feature extraction, equivalent semantic transformation is performed on the original software, and then the sub-behavioral features of the original software, similar software (software after equivalent semantic transformation) and heterogeneous software (reference software) are analyzed. The analysis and identification process is divided into two parts, such as Figure 4 , the first is the construction of birthmark features, and the second is the detection of unknown software. The specific implementation process is as follows:

[0105] The process of "construction of birthmark feature software recognition scheme" includes:

[0106] a) Extract the sub-behavior feature set of each type of software, and count the frequency of the sub-behavior features.

[0107] b) Measure each sub-line feature set of the original software, and compare its similarity with the same sub-behavioral features in similar software and heterogeneous software.

[0108] c) Calculate the simil...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of birthmark-based software identification, and discloses a software birthmark feature extraction method based on constraint clustering and information measurement, which adopts constraint-limited cluster analysis, and measures feature homogeneity and inter-category distances based on mutual information. In software feature selection, the equivalent semantic transformation of the software is performed first, and then the feature segmentation is performed, combined with the program slicing technology to classify the limited groups of features, and construct gain function and penalty function evaluation for the segmented feature fragment set, based on different structural groups Hierarchical clustering selection, which screens out invariant features in the same class and eliminates common features in different classes. The present invention considers the correlation between features, and the set of selected birthmark features has the largest amount of information and the smallest redundancy; it not only ensures the anti-attack performance of the birthmark features, but also ensures the uniqueness. The invention improves the robustness and reliability of the software birthmark feature, and greatly improves the feature-based recognition rate of the software.

Description

technical field [0001] The invention belongs to the technical field of software birthmarks, and in particular relates to a feature selection method of constraint-defined clustering and information measurement software birthmarks and a computer. Background technique [0002] Traditional software feature extraction techniques are summarized into two categories: one is segmental extraction, which is a fixed-length or variable-length segmentation mode combined with grammar. After feature extraction, it can be cut into segments by static word segmentation. Segmented feature selection, or selection based on variable-length P-gram segmentation; without combining semantic analysis and analyzing the correlation between features, the obtained birthmark features are less robust to software encryption, deformation, polymorphic attacks, etc. ; The other is slicing extraction, combined with the selection method of software semantic structure, the program slicing is based on the program de...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F8/75
CPCG06F8/751
Inventor 罗养霞
Owner XIAN UNIV OF FINANCE & ECONOMICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products