Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Digital forensic file fragment classification method based on digital image transformation and deep learning

A technology of file fragmentation and deep learning, which is applied in electronic digital data processing, character and pattern recognition, computer parts, etc., can solve the problems of low classification accuracy, unsatisfactory classification results, and low degree of automation, and achieve fragment classification. , high-precision fragment classification, and the effect of improving accuracy

Inactive Publication Date: 2018-10-23
HARBIN INST OF TECH SHENZHEN GRADUATE SCHOOL
View PDF5 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Traditional file fragment classification methods include using full file extensions, file metadata, byte frequency distribution features (BFD) or byte frequency correlation features (BFC), linear discriminant analysis (LDA), Fisher linear discriminant (FLD) and the longest common subsequence, extraction of N-gram, Shannon entropy, Hamming weight and byte and other methods with statistical regularity, support vector machine (SVM), etc., but in practical applications, there are unsatisfactory classification results and the degree of automation Problems with low classification accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Digital forensic file fragment classification method based on digital image transformation and deep learning
  • Digital forensic file fragment classification method based on digital image transformation and deep learning
  • Digital forensic file fragment classification method based on digital image transformation and deep learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0024] The invention firstly converts file fragments into grayscale images, and then uses data-driven deep learning to extract more hidden features of the images, so as to improve the performance of file fragment classification.

[0025] figure 2 Shows the process of converting file fragments to grayscale images. The first and last fragments of the original file are removed from the original file data to obtain file fragments, and each N bit is converted into 1 pixel to obtain a one-dimensional array, and then the one-dimensional array is converted into a two-dimensional matrix, which represents a grayscale image.

[0026] image 3 It shows the CNN network structure modified and optimized by the present invention for the file fragment classification problem. The first convolutional layer uses a convolution kernel with a ratio of 1x1, and th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a digital forensic file fragment classification method based on digital image conversion and deep learning. Firstly, file fragments are converted to grayscale images, and then deep learning is utilized to extract more hidden features of the images to improve performance of file fragment classification. The more hidden features include different texture features, random features and compressibility used for classification. Deep learning uses a modified and optimized CNN model. A first convolutional layer of the model uses a 1x1 ratio convolutional kernel, and the same uses many pipes to enable a network structure to be complicated. All layers have different-numbers-and-scales filtering kernels, and thus feature maps best matching the classification model are obtainedby training through gradient descents and reverse training. According to the method, high-dimensional features of the file fragments are extracted through utilizing the advantages of local connectionand weight sharing of a CNN, and the scheme can also realize high-precision fragment classification for files such as composite files and compressed files which are not easy to classify in previous schemes.

Description

technical field [0001] The invention relates to the technical field of digital forensics, in particular to a method for classifying fragments of digital forensics files. Background technique [0002] Classification of file fragments plays an important role in digital forensics, figure 1 It is an early step in the process of digital forensics. Correctly classifying file fragments is a necessary step to support effective file engraving. The accuracy of file fragment classification directly affects the efficiency and performance of file engraving. [0003] Traditional file fragment classification methods include the use of complete file extensions, file metadata, byte frequency distribution features (BFD) or byte frequency correlation features (BFC), linear discriminant analysis (LDA), Fisher linear discriminant (FLD) and the longest common subsequence, extraction of N-gram, Shannon entropy, Hamming weight and byte and other methods with statistical laws, support vector machin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06F17/30G06N3/04
CPCG06N3/045G06F18/241
Inventor 蒋琳方俊彬王轩陈倩李晔
Owner HARBIN INST OF TECH SHENZHEN GRADUATE SCHOOL
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products