Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Multi-data-stream anomaly detection method based on context

A multi-data stream, anomaly detection technology, applied in the field of anomaly detection of historical behavior information, can solve the problems of difficult threshold setting, no consideration, impractical real-time detection, etc., to achieve low time complexity, reduce difficulty, and improve detection accuracy Effect

Active Publication Date: 2014-10-29
NANJING UNIV OF SCI & TECH
View PDF4 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

(Park N H, Lee W S. Statistical grid-based clustering over data streams[J]. ACM SIGMOD Record, 2004, 33(1): 32-37.) The above anomaly detection methods either use the top-p method to quantify the abnormal value The highest p data streams are regarded as anomalies, or the data streams whose abnormal quantization value exceeds a predefined threshold are regarded as anomalies. These methods have problems in practical application: 1) The threshold is difficult to set
However, the LOF algorithm does not consider the difference in the value range of different dimensions, which may lead to the influence of some dimensions being significantly greater than others; in addition, its time complexity is acceptable for offline detection, but not for real-time detection. practical

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-data-stream anomaly detection method based on context
  • Multi-data-stream anomaly detection method based on context
  • Multi-data-stream anomaly detection method based on context

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] combine figure 1 , a context-based multi-stream anomaly detection method, comprising the following steps:

[0024] Step 1, multi-stream acquisition and snapshot generation, the process is as follows:

[0025] Step 1.1, given a distributed computing system composed of n isomorphic computing nodes;

[0026] Step 1.2, synchronously record the observation value of the i-th computing node at time t in Indicates the component of the i-th computing node on the m-th observation dimension of the observed value at time t, 1≤i≤n, each component represents a node metric of interest, and the metrics of isomorphic computing nodes are exactly the same , the value of m is determined by the total number of metrics;

[0027] Step 1.3, constitute the data flow S corresponding to the computing node i i ={s i1 ,s i2 ,s i3 ,...,s it}, S i is an ordered but infinite sequence of data observations;

[0028] Step 1.4, the data streams of n computing nodes form a data stream set S={...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a multi-data-stream anomaly detection method based on the context. The method comprises the following steps that 1, multiple data streams are obtained, and snapshots are generated; 2, the anomaly of the snapshots of the multiple data streams is quantified; 3, the anomaly of the data streams is quantified; 4, the anomaly of the data streams is recognized. The detection method aims to adopt the node anomaly detection of an isomorphism distributed computation system as the study background and adopt the data streams monitored by computational nodes as the study object, the anomaly detection method comprehensively considers the context information of the multiple data streams and the historical behavior information of a single data stream, and the detection rate is high.

Description

technical field [0001] The invention belongs to anomaly detection technology, in particular to an anomaly detection method that integrates context information of multiple data streams and historical behavior information of a single data stream. Background technique [0002] Data stream anomaly detection is an important direction in data stream mining research. Anomalies refer to data that are out of the ordinary in a data set, that are not due to random bias, but are generated by an entirely different mechanism. Since finding anomalies in data streams has a wide range of applications in the fields of network attack monitoring, credit card fraud, and computing system performance analysis, data stream anomaly detection methods are one of the current research hotspots. The detection and mining of abnormal behaviors on data streams The research has attracted the attention of academia and industry. [0003] In real-world applications such as distributed computing, the data stre...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/00
Inventor 徐建李涛张宏张琨朱保平衷宜陈龙
Owner NANJING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products