Extraction method of TCP session reassembling and statistic data based on streaming processing

A statistical data and stream processing technology, applied in the transmission system, electrical components, etc., can solve the problems of low processing flexibility, slowing down the system speed, waste of physical and human resources, etc., and achieve flexible data stream transmission and flexible processing ways to improve overall performance

Active Publication Date: 2018-07-17
SOUTH CHINA UNIV OF TECH
View PDF6 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] Current session data extraction tools often rely on open source tools such as libnids, netflow, etc., and rely on the computing resources and storage resources of sensors, which can easily reach computing bottlenecks, lack reliability, redundancy and other mechanisms, and have low fault tolerance, hindering the overall performance and reliability of the system. sex, ultimately leading to more waste of physical and human resources
Tools such as Netflow rely on special network devices and are not universal, and the lack of flexibility in tools such as libnids is difficult to meet the needs of customized output, which brings more labor costs for subsequent data processing
In the actual production environment, the transmission of data streams often uses the original text to flow in the data pipeline. The processing flexibility is low, resulting in a waste of network resources, and it is easy to reach the network bottleneck, which in turn leads to the stagnation of the entire system.
In other statistical data extraction methods, offline calculations are often used. This calculation method will cause high data delay and cannot deliver data in the first time, thus slowing down the speed of the entire system.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Extraction method of TCP session reassembling and statistic data based on streaming processing
  • Extraction method of TCP session reassembling and statistic data based on streaming processing
  • Extraction method of TCP session reassembling and statistic data based on streaming processing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0013] A method for extracting TCP session reorganization and statistical data based on stream processing, comprising the following steps:

[0014] (1) if figure 1 As shown, a data pipeline layer and a real-time computing layer are built between the data collection layer and the data storage layer. The data collection layer collects network data packets and sends them to the data pipeline layer for caching. The real-time computing layer is used to extract data from the data pipeline layer. , for processing, and the processing results are stored in the data storage layer;

[0015] (2), Build three Kafka distributed message queues in the data pipeline layer as data pipeline services;

[0016] (3) Build three Flink stream processing engines in the real-time computing layer as stream computing clusters;

[0017] One of them is the main node, and three are secondary nodes at the same time. When the stream processing task is running, the main node will automatically assign the tas...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention provides an extraction method of TCP session reassembling and statistic data based on streaming processing. The method comprises the following steps of: constructing a data pipeline layer and a real-time calculation layer between a data collection layer and a data storage layer, wherein the data collection layer collects a network data package and sends the network data package to the data pipeline layer for cache, the real-time calculation layer is configured to extract data from the data pipeline layer for processing, and a processing result is stored in the data storage layer; the real-time calculation layer extracts data from the data pipeline to perform deserialization to form an object; and the object is taken as a data element in a streaming calculation process, and TCP session data and statistic data are output through data operation. The extraction method of TCP session reassembling and statistic data based on streaming processing effectively reassemblesthe TCP session of the network, excavates session statistic data according to the extracted session data, provides support for flow information excavation and abnormal behavior analysis, provides efficient and reliable session data and session statistic data calculation service and guarantees efficiency and stability of the system.

Description

technical field [0001] The invention relates to a stream processing TCP session recombination and statistical data extraction method. Background technique [0002] Current session data extraction tools often rely on open source tools such as libnids, netflow, etc., and rely on the computing resources and storage resources of sensors, which can easily reach computing bottlenecks, lack reliability, redundancy and other mechanisms, and have low fault tolerance, hindering the overall performance and reliability of the system. sex, ultimately leading to more waste of physical and human resources. Tools such as Netflow rely on special network equipment and are not universal. Moreover, the lack of flexibility in tools such as libnids is difficult to meet the needs of customized output, which brings more labor costs for subsequent data processing. In the actual production environment, the transmission of data streams often uses the original text to flow in the data pipeline. The pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L29/08H04L29/06
CPCH04L67/1044H04L67/141H04L67/142H04L69/14H04L67/568
Inventor 高英李若鹏靳亚洽刘煜
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products