Time series data clustering method based on noise reduction encoder and attention mechanism

A time-series data and clustering method technology, applied in neural learning methods, instruments, computer components, etc., can solve problems affecting clustering accuracy, distraction, noise data sensitivity, etc., to improve clustering accuracy, Effects that are easy to classify

Pending Publication Date: 2021-02-09
SOUTHEAST UNIV
View PDF7 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, most of the existing time series clustering methods based on deep learning are implemented based on the deep autoencoder framework, which is sensitive to noisy data
Moreover, the existing temporal clustering algorithms have the problem of distraction, because the weights of features at different moments are ignored in the feature extraction process.
In short, the existing time series clustering methods cannot extract robust and discriminative time series features well, which greatly affects the subsequent clustering accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Time series data clustering method based on noise reduction encoder and attention mechanism
  • Time series data clustering method based on noise reduction encoder and attention mechanism
  • Time series data clustering method based on noise reduction encoder and attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0040] This embodiment is implemented on a Linux platform, written in Python, running on the Tensorflow framework, and the hardware is a NVIDIA GTX 1080Ti GPU processor. The basic configuration of the software is shown in Table 1:

[0041] Table 1 Software configuration information

[0042]

[0043] As shown in Table 2 to Table 5, the experimental part mainly evaluates the two algorithms of DTSC and DATC from the following three indicators: average error rate MER, RI, and NMI. In addition, the introduction of noise reduction autoencoder and attention mechanism The effectiveness of the experiment, the default settings of the parameters in the experiment are shown in Table 2 below.

[0044] Table 2 Experiment default parameter configuration

[0045]

[0046] Analyze the public time series data set: UCR data set, which contains 36 sub time series data sets, each data set has different categories and sample numbers.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a time series data clustering method based on a noise reduction encoder and an attention mechanism, which achieves clustering of label-free data in an unsupervised manner, and sequentially comprises the following steps of: 1, adding noise data into an original time sequence; step 2, adding an attention mechanism into the LSTM; 3, initializing a model by adopting a K-means method; 4, jointly optimizing the reconstruction loss and the KL divergence loss of the noise reduction auto-encoder by adopting an Adam optimization method; and iteratively optimizing the reconstruction loss and the clustering loss of the encoder to obtain a trained optimal model. Based on the deep noise reduction auto-encoder, the problem that time series data clustering is sensitive to noise is solved, and the clustering effect is improved by introducing an attention mechanism.

Description

technical field [0001] The invention belongs to the technical field of data mining, and relates to a time series clustering method, in particular to a time series data clustering technology based on a noise reduction encoder and an attention mechanism. Background technique [0002] Time series clustering research is an important research content in the field of data mining and machine learning. It is a research that digs out hidden patterns in time series data and divides time series into different categories. However, time series are usually high-dimensional, so the clustering research of time series is full of challenges. In recent years, deep learning has promoted the development of natural language processing, image processing and other fields. It can extract low-dimensional feature vectors from high-dimensional data, and this low-dimensional feature representation is needed in time series research. However, most of the existing time series clustering methods based on d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08
CPCG06N3/049G06N3/08G06N3/045G06F18/23213
Inventor 何洁月徐周
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products