Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Multivariate time sequence similarity search method based on variable correlation

A multivariate time series and similarity search technology, applied in the information field, can solve the problems of inability to balance efficiency and accuracy, lack of interpretability of dimensionality reduction methods, etc. Effect

Pending Publication Date: 2022-02-01
HOHAI UNIV
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, most of the existing multivariate time series similarity search methods cannot balance efficiency and accuracy. At the same time, some dimensionality reduction methods still lack interpretability. It is necessary to have a multivariate time series similarity search method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multivariate time sequence similarity search method based on variable correlation
  • Multivariate time sequence similarity search method based on variable correlation
  • Multivariate time sequence similarity search method based on variable correlation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The present invention will be described in further detail below in conjunction with the accompanying drawings.

[0040] The present invention provides a multivariate time series similarity search method based on variable correlation, such as figure 1 shown, including the following steps:

[0041] Step 1: Obtain query sequence Q and time series dataset S in advance, and process multivariate time series into a unified format. The query sequence Q and the time series data set S are normalized by using the max-min normalization method.

[0042] Using the maximum and minimum normalization, each variable of the multivariate time series is normalized separately, and the original data is mapped to [01] through the normalization formula. For the multivariate time series:

[0043] x m×l ={x 1i , x 2i ,...,x mi}, i=1, 2, ..., l

[0044] The normalization formula is as follows:

[0045]

[0046] Among them, x' ij is the converted value, x ii is the original value, repr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multivariate time sequence similarity search method based on variable correlation. The method comprises the following steps of firstly, normalizing a pre-acquired multivariate time sequence; secondly, calculating a Pearson correlation coefficient among query sequence variables; on the basis of Pearson correlation, clustering by using OPTICS, and clustering variables with similar trends into one class; then, extracting a central sequence of each class after clustering, and representing the class by using the central sequence; and finally carrying out PAA feature extraction on all the center sequences, and obtaining a feature representation sequence of the multivariate time sequence; and calculating the DTW distance between the multivariate time sequences after feature representation, and finding out similar sequences. According to the method, the feature sequences which are concise and reserve original data features are obtained through variable clustering, and the distance between the feature sequences is calculated by using the multi-dimensional DTW, so that the retrieval efficiency and the query accuracy are improved.

Description

technical field [0001] The invention belongs to the field of information technology, in particular to a multivariate time series similarity search method based on variable correlation. Background technique [0002] Time series is a collection of observations arranged in chronological order, which widely exist in various fields of the real world, including finance, meteorology, medical treatment, engineering, etc. With the continuous development of science and technology, time series data is becoming more and more abundant, and time series similarity search is of great significance to time series prediction, classification, clustering, knowledge discovery, etc. Therefore, time series similarity search is more and more researched readers' attention. [0003] At present, there are many studies on univariate time series similarity measurement, and the research results are relatively rich, but there are not many studies on multivariate time series similarity measurement. The mai...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/2458G06K9/62
CPCG06F16/2465G06F18/23G06F18/22
Inventor 王继民张晨楠余祖愿张新华
Owner HOHAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products