Anomaly point detection method based on data structure

A detection method and data structure technology, applied in the direction of instruments, character and pattern recognition, computer components, etc., can solve problems such as difficulties, time-consuming, unbalanced data distribution, etc., to improve performance, robustness, and scope of application Wide range and good stability performance

Inactive Publication Date: 2018-11-30
CHENGDU UNIV OF INFORMATION TECH
View PDF3 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

There are following deficiencies in this method: 1, need to determine threshold value (d in this detection method) min ), and the determination of this value is more difficult, different data sets d min There is a large gap, for a given different d min The obtained anomaly detection results are usually very unstable
However, simply calculating the Euclidean distance cannot effectively and accurately identify all abnormal points, and the calculation process is cumbersome, time-consuming, and computationally complex
According to whether the detection method needs a label, the detection method can be divided into supervised, semi-supervised and unsupervised. Among them, the unsupervised method is the most challenging method because of the lack of effective label training, and coupled with The imbalance of data distribution exacerbates the inaccuracy of detection results
[0007] To sum up, the existing outlier detection algorithms work better under certain conditions or in specific fields, or have a better effect on outlier detection in lower-dimensional spaces. When the dimensionality of the data is higher, the effect of these algorithms is not ideal. The generalization ability of the algorithm is weak, and for high-dimensional spatial data, the computational complexity is high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Anomaly point detection method based on data structure
  • Anomaly point detection method based on data structure
  • Anomaly point detection method based on data structure

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in combination with specific embodiments and with reference to the accompanying drawings. It should be understood that these descriptions are exemplary only, and are not intended to limit the scope of the present invention. Also, in the following description, descriptions of well-known structures and techniques are omitted to avoid unnecessarily obscuring the concept of the present invention.

[0028] It should be noted that the p threshold value in the present invention refers to: a determined critical constant, which is used to evaluate the degree to which the current observation point belongs to an abnormal point, and it is defined as the number of global points related to the current observation point distance function. Special points in the present invention include outliers.

[0029] like figure 1 As sh...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an anomaly point detection method based on a data structure. The anomaly point detection method comprises the steps that a data set is input; a multidimensional binary tree isconstructed according to the data set, and k neighbors closest to each node in the tree are searched through a binary tree search algorithm; the Euclidean distances between the data points are calculated based on a data structure diagram of constructed data points of the multidimensional binary tree and by combining the neighbor relations of all the nodes in the tree; and in consideration of thesimilarity between the data points and the neighbor relationship of the data points in the tree, anomaly points are automatically determined by sorting the calculated Euclidean distances and setting the threshold value p. According to the anomaly point detection method, the performance of anomaly point detection is improved, and the structural characteristics of the data set is better reflected; in addition, the anomaly point detection method is weakly affected by data distribution and data dimension and is wider in application range during practical application, and the problems that in the prior art, the detection accuracy of special points and the detection performance of high dimensional data are poor are solved.

Description

technical field [0001] The invention belongs to the field of data detection, in particular to a data structure-based abnormal point detection method. Background technique [0002] Outlier detection is the most important task in the process of identifying outliers. Due to the unbalanced distribution of outliers and other reasons, traditional outlier detection methods will lead to inaccurate or even wrong recognition results. Outlier detection technology can effectively improve the performance of outlier detection. Traditional outlier detection technologies are mainly clustering, classification, and pattern recognition. These traditional technologies are to find a common pattern to identify meaningful patterns in data, while outlier detection technology only needs to identify outliers, not Normal points are identified. For example, in a system for detecting life disorders, normal people are regarded as normal points, and patients with disordered vital signs or patients with ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62
CPCG06F18/24147G06F18/24323
Inventor 李孝杰郭峰史沧红娄苗苗王录涛吕建成吴锡
Owner CHENGDU UNIV OF INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products