Method of tcp session reorganization and statistical data extraction based on stream processing

A statistical data and stream processing technology, applied in transmission systems, electrical components, etc., can solve problems such as difficulty in flexibility to meet customized output, lack of reliability, redundancy, and inability to deliver data in the first time

Active Publication Date: 2021-05-28
SOUTH CHINA UNIV OF TECH
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] Current session data extraction tools often rely on open source tools such as libnids, netflow, etc., and rely on the computing resources and storage resources of sensors, which can easily reach computing bottlenecks, lack reliability, redundancy and other mechanisms, and have low fault tolerance, hindering the overall performance and reliability of the system. sex, ultimately leading to more waste of physical and human resources
Tools such as Netflow rely on special network devices and are not universal, and the lack of flexibility in tools such as libnids is difficult to meet the needs of customized output, which brings more labor costs for subsequent data processing
In the actual production environment, the transmission of data streams often uses the original text to flow in the data pipeline. The processing flexibility is low, resulting in a waste of network resources, and it is easy to reach the network bottleneck, which in turn leads to the stagnation of the entire system.
In other statistical data extraction methods, offline calculations are often used. This calculation method will cause high data delay and cannot deliver data in the first time, thus slowing down the speed of the entire system.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method of tcp session reorganization and statistical data extraction based on stream processing
  • Method of tcp session reorganization and statistical data extraction based on stream processing
  • Method of tcp session reorganization and statistical data extraction based on stream processing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0013] A method for extracting TCP session reorganization and statistical data based on stream processing, comprising the following steps:

[0014] (1) if figure 1 As shown, a data pipeline layer and a real-time computing layer are built between the data collection layer and the data storage layer. The data collection layer collects network data packets and sends them to the data pipeline layer for caching. The real-time computing layer is used to extract data from the data pipeline layer. , for processing, and the processing results are stored in the data storage layer;

[0015] (2), Build three Kafka distributed message queues in the data pipeline layer as data pipeline services;

[0016] (3) Build three Flink stream processing engines in the real-time computing layer as stream computing clusters;

[0017] One of them is the main node, and three are secondary nodes at the same time. When the stream processing task is running, the main node will automatically assign the tas...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A method for extracting TCP session reorganization and statistical data based on stream processing, comprising the following steps: constructing a data pipeline layer and a real-time computing layer between the data collection layer and the data storage layer, and the data collection layer collects network data packets and sends them to The data pipeline layer is used for caching, and the real-time computing layer is used to extract data from the data pipeline layer for processing, and the processing results are stored in the data storage layer; the real-time computing layer extracts data from the data pipeline and deserializes it into an object; the object will be used as a stream Data elements in the formula calculation process, and output TCP session data and statistical data through data operations. The present invention can effectively reorganize the TCP session of the network, and dig out session statistical data according to the extracted session data, provide support for traffic information mining and abnormal behavior analysis, provide efficient and reliable session data and session statistical data calculation services, and guarantee the system efficiency and stability.

Description

technical field [0001] The invention relates to a stream processing TCP session recombination and statistical data extraction method. Background technique [0002] Current session data extraction tools often rely on open source tools such as libnids, netflow, etc., and rely on the computing resources and storage resources of sensors, which can easily reach computing bottlenecks, lack reliability, redundancy and other mechanisms, and have low fault tolerance, hindering the overall performance and reliability of the system. sex, ultimately leading to more waste of physical and human resources. Tools such as Netflow rely on special network equipment and are not universal. Moreover, the lack of flexibility in tools such as libnids is difficult to meet the needs of customized output, which brings more labor costs for subsequent data processing. In the actual production environment, the transmission of data streams often uses the original text to flow in the data pipeline. The pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): H04L29/08H04L29/06
CPCH04L67/1044H04L67/141H04L67/142H04L69/14H04L67/568
Inventor 高英李若鹏靳亚洽刘煜
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products