The invention discloses a semi-
supervised clustering integrated protocol identification method. The method comprises the following steps: various data packets in a network are acquired; received
network data is analyzed, and each field of the data packets is extracted and counted;
feature code of
network data obtained after the
network data is analyzed is matched with various feature codes preset in a data base, if the match is successful, the data packets are corresponding protocols; data not successfully matched is subject to
cluster analysis, a plurality of base clustering devices are used to cluster the data packets, and the result is fed back, and a priori
label value is modified; and a semi-supervised
statistical learning is carried out for the result of the clustering of the network data packets and each known protocol, and a
discriminant learner is trained. According to the invention, the terminal protocol
identification rate is improved, and the amount of calculation is moderate, so that the efficiency is high; one time of dialog generate less flow, inaccurate identification is not easy; and besides, the method integrates a plurality of identification methods, so as to achieve multi-dimension identification. The invention also discloses a corresponding semi-
supervised clustering integrated protocol
identification system.