A time series similarity searching method based on segmentation weight

A similarity search and time series technology, applied in the field of information processing, can solve the problem that it is difficult for users to effectively participate in the process of mining similarity patterns of time series data, and achieve the effect of improving accuracy and accuracy

Active Publication Date: 2019-02-19
HOHAI UNIV
View PDF5 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The current time series data similarity pattern mining algorithm lacks effective means to further obtain satisfactory results for users, and in the above algorithms, it is difficult for users to effectively participate in the mining process of time series data similarity patterns

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A time series similarity searching method based on segmentation weight
  • A time series similarity searching method based on segmentation weight
  • A time series similarity searching method based on segmentation weight

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0062] In this embodiment, the Adiac data set in the UCR data set is used as an example to introduce detailed steps. The segmentation threshold is only selected as 0.5 for demonstration. In practical applications, the segmentation threshold can be determined through experimental debugging or empirically. The calculation of segment distance takes Euclidean distance as an example.

[0063] (1) Select a piece of data in the Adiac dataset as the query sequence, such as figure 2 as shown in (a);

[0064] (2) Choose a threshold and segment the query sequence using important turning points, such as figure 2 As shown in (b), 6 segmentation points are extracted when the segmentation threshold is 0.5.

[0065] (3) Initialize the weight of each segment to 1 / 6, and use the similarity measurement method of segmental weight to measure, calculate the segmental Euclidean distance between each sequence to be searched and the query sequence, and use the topK method to obtain the first three...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a time series similarity searching method based on segment weight, which comprises the following steps: (1) segmenting a query sequence q by adopting an important turning pointof the time series; (2) establishing piecewise Euclidean distance with weights as similarity measure of time series; (3) performing k-nearest neighbor similarity querying on the q, searching for thek most similar sequences of the query sequence q; (4) ending the query if the user is satisfied with the query result obtained in the step (3); If the user is not satisfied with the query result obtained in the step (3), marking the result and entering the step (5); (5) allowing the system to update the weights of the segments by using the sequence of user tags and returning to step (2). The invention automatically updates the weights through user feedback, adaptively learns the attention degree of the user to different segments, can improve the accuracy of similarity measurement, and furtheroptimizes the search result.

Description

technical field [0001] The invention belongs to the technical field of information processing, in particular to a time series similarity search method based on segmentation weights. Background technique [0002] Data mining is to extract people's interested, implicit and previously unknown knowledge from the data of large databases. In many real-world databases, the data is often a series of observations recorded in chronological order, and the attribute values ​​of objects may change over time. Therefore, mining potentially useful knowledge from time series data has important theoretical and practical significance. The research on pattern mining of time series data mainly focuses on the similarity pattern mining of time series data. Time series data similarity pattern mining is to find time series sequences that are very similar to a given time series data pattern in the database. It has a wide range of practical values, for example: the similarity analysis of the time s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/2458
Inventor 王继民刘琪张鹏程
Owner HOHAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products