A high-efficiency abnormal time-series data extraction method based on secondary screening

A secondary screening and time-series data technology, applied in the computer field, can solve problems such as difficult to obtain satisfactory results, high time complexity, and time-consuming

Active Publication Date: 2021-11-26
北京中科慧云科技有限公司
View PDF4 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the high time complexity of DTW itself, it is too time-consuming to calculate the nearest neighbor distance nearest_neighbor_dist through an inner loop in a large amount of time series data, so it is impossible to directly replace the Euclidean distance with the DTW distance to measure the difference between two subsequences OK, it is difficult to achieve satisfactory results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A high-efficiency abnormal time-series data extraction method based on secondary screening
  • A high-efficiency abnormal time-series data extraction method based on secondary screening
  • A high-efficiency abnormal time-series data extraction method based on secondary screening

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] Features and exemplary embodiments of various aspects of the invention will be described in detail below. The following description covers numerous specific details in order to provide a thorough understanding of the present invention. It will be apparent, however, to one skilled in the art that the present invention may be practiced without some of these specific details. The following description of the embodiments is only to provide a clearer understanding of the present invention by showing examples of the present invention. The present invention is by no means limited to any specific configuration and algorithm presented below, but covers any modification, replacement and improvement of related elements, components and algorithms without departing from the spirit of the present invention.

[0054] The embodiment of the present invention provides a method that can accurately search for abnormal timing in ultra-large-scale timing data. On the basis of the existing i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a high-efficiency time series data abnormal extraction method, which is used for finding abnormalities from electrocardiogram data (ECG) and detecting heart diseases. The method includes: the distance function in the system uses the DTW distance instead of the traditional Euclidean distance to reduce the phase shift error; the original time series data (ECG) is mapped into a series of string sequences through SAX technology and stored in the data structure Array array and Trie ternary tree ;Find the most likely abnormal sequence as a candidate abnormality through the Array array and Trie ternary tree; find out the nearest neighbor distance of the first candidate abnormality in the ECG data through secondary screening, as the first threshold distance; through nested The inner and outer loops verify that the candidate anomaly is the one we are looking for, otherwise, the candidate anomaly is updated; after the inner and outer loops are executed, the abnormal timing in the ECG data is finally obtained. The technical scheme of the present invention solves the problem that it is difficult to quickly and accurately find abnormalities in a large amount of ECG data due to excessively high DTW distance redundancy.

Description

technical field [0001] The invention belongs to the field of computers, and in particular relates to a method for realizing high-precision and rapid extraction of abnormalities in massive time-series data, which is applied to abnormal detection in ECG (electrocardiogram, electrocardiogram) data, so as to realize the detection of heart disease. Background technique [0002] In the past ten years, hundreds of thousands of articles have been studying how to find the subsequence most similar to a given timing in a large amount of time series data (time series data refers to data recorded in chronological order), and this patent studies how to Find the subsequence with the greatest difference from other time series in a large amount of time series data, which is called time series data anomaly. [0003] Abnormal time series data, simply put, means that in a very large time series data, there are some time series fragments that are very different from other time series data. Time...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G16H80/00
CPCG06F19/3418
Inventor 许泽文李建强莫豪文田猛刘璐孙靖超
Owner 北京中科慧云科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products