Multi-architecture binary similar code detection system and method

A binary code and code detection technology, applied in the field of multi-architecture binary similar code detection system, can solve the problem that program code cannot achieve cross-architecture matching, and achieve the effect of improving accuracy

Inactive Publication Date: 2017-11-17
SHANGHAI JIAO TONG UNIV
View PDF4 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Aiming at the defect that the existing technology cannot match program codes lacking API semantic information and cannot realize cross-architecture matching, the present invention proposes a multi-architecture binary similar code detection system and method, and automatically locates similar codes, thereby analyzing completed Code information is migrated to different platforms synchronously, reducing additional manual analysis costs and improving analysis efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-architecture binary similar code detection system and method
  • Multi-architecture binary similar code detection system and method
  • Multi-architecture binary similar code detection system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] like figure 1 As shown, this embodiment includes: a preprocessing module, a parameter identification module, an indirect jump identification module, a semantic generation module and a comparison module, wherein: the preprocessing module receives the binary code to be processed and outputs binary functions to the parameter identification module, The indirect jump recognition module and the semantic generation module, the parameter recognition module extracts the sorting table from it and outputs it to the semantic generation module, the indirect jump recognition module extracts the indirect jump statement from it and outputs it to the semantic generation module, and the semantic generation module uses dynamic insertion The pile simulates the execution of the binary function, extracts the semantic features and outputs them to the comparison module, and the comparison module uses the longest common subsequence algorithm alignment method to compare the received semantic feat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A multi-architecture binary similar code detection system and method are provided. The system comprises a preprocessing module, a parameter identification module, a Switch indirect jump identification module, a meaning generation module and a comparison module; the preprocessing module receives binary codes to be processed and outputs a binary function to the parameter identification module, the Switch indirect jump identification module and the meaning generation module; the parameter identification module extracts an ordering table from the binary function and outputs the ordering table to the meaning generation module; the indirect jump identification module extracts Switch indirect jump statements from the binary function and outputs the statements to the meaning generation module; according to the parameter information generated by the parameter identification module, the meaning generation module performs simulating execution of the binary function and extracts and outputs the meaning feature sequence to the comparison module; the comparison module adopts sequence alignment mode to compare the received meaning feature sequence and outputs similarity values. By automatically locating similar codes, the analyzed code information is synchronously migrated to different platforms, reduces the additional manual analysis cost and increases the analysis efficiency.

Description

technical field [0001] The present invention relates to a technology in the computer field, in particular to a multi-architecture binary similarity code detection system and method. Background technique [0002] With the popularity and application of smart devices, more and more programs have been migrated from desktop platforms to embedded devices based on ARM, MIPS, etc. Although the binary program analysis technology of desktop platforms (such as x86) has been relatively mature, However, because different architectures have huge differences in instruction sets, code offsets, and calling conventions, it is difficult to migrate program information analyzed by these techniques from one architecture (such as x86) to another architecture (such as ARM). Contents of the invention [0003] Aiming at the defect that the existing technology cannot match program codes lacking API semantic information and cannot realize cross-architecture matching, the present invention proposes a ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F9/44
CPCG06F8/751
Inventor 张媛媛胡易坤王晴李卷孺谷大武
Owner SHANGHAI JIAO TONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products