Network protocol identification method and system based on semi-supervised learning

A semi-supervised learning and protocol recognition technology, applied in the field of network security, can solve the problems of inaccurate classification and dependence on data sets, and achieve the effect of fast recognition speed and high recognition accuracy.

Inactive Publication Date: 2012-07-25
TSINGHUA UNIV
View PDF1 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

These methods either use unsupervised machine learning or supervised machine learning, either cannot accurately classify, or rely on a large number of calibrated data sets
But there is currently no published method that satisfies all practical needs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Network protocol identification method and system based on semi-supervised learning
  • Network protocol identification method and system based on semi-supervised learning
  • Network protocol identification method and system based on semi-supervised learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.

[0043] figure 1 is a flow chart of the network protocol identification method based on semi-supervised learning of the present invention, such as figure 1 As shown, the method comprises the steps of:

[0044] A: Train multiple classifiers based on semi-supervised learning methods;

[0045] Described step A specifically comprises the steps:

[0046] A1: Carry out protocol identification and labeling on the first predetermined amount of first single streams. The calibration results include: the length information of the predetermined value network packets at the front of each of the first single streams, and the length information of each of the first single streams Desc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a network protocol identification method and system based on semi-supervised learning, relating to the technical field of network safety. The method comprises the following steps of: training a plurality of classifiers based on a semi-supervised learning method; carrying out split-flow treatment on a network flow rate to be identified, and counting a single flow to be identified, which is obtained by the split-flow treatment, so as to obtain length information of a pre-set quantity of network packets in each single flow to be identified; and utilizing the classifiers to carry out protocol identification on the single flow to be identified according to the length information of the pre-set quantity of the network packets in the single flow to be identified. According to the network protocol identification method and system based on the semi-supervised learning, disclosed by the invention, a semi-supervised machine learning method is introduced into the protocol identification, namely that less calibrated sample information is used and a lot of unlabelled information is sufficiently utilized, so that the classification performance and the practical expansibility are met simultaneously; and the network protocol identification method and system have higher identification accuracy and higher identification speed, and can identify an encrypted protocol without depending on a lot of calibration data sets.

Description

technical field [0001] The invention relates to the technical field of network security, in particular to a network protocol identification method and system based on semi-supervised learning. Background technique [0002] Network application layer protocol identification technology, referred to as protocol identification, is a key component and core technology of security gateway systems such as Deep Inspection Firewall (deep inspection firewall), NIDS / NIPS (network intrusion detection / defense system), UTM (unified threat management) It is also the basic technology for operators to conduct traffic statistics and billing management. [0003] Protocol identification technology mainly relies on certain characteristics of network traffic to identify and judge, and infers the type of protocol to which the traffic belongs. Its core technical points include: the selection of features used for identification, and the method of using features for identification. [0004] At presen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L29/06H04L12/56G06N5/00H04L47/43
Inventor 杨保华李军
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products