Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Spectral clustering method based on Kendall Tau distance fused measurement

A technology of distance measurement and spectral clustering, applied in the field of spectral clustering, can solve the problem of ignoring the information of other samples around, and achieve the effect of improving the clustering accuracy

Inactive Publication Date: 2016-12-07
NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In order to solve the problem that Euclidean distance is commonly used in traditional spectral clustering algorithms, generally only the distance information between these two samples is considered and the information of other surrounding samples is ignored. This invention proposes a new method based on fusion Kendall Tau distance Spectral clustering method, the specific steps are as follows:

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Spectral clustering method based on Kendall Tau distance fused measurement
  • Spectral clustering method based on Kendall Tau distance fused measurement
  • Spectral clustering method based on Kendall Tau distance fused measurement

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] The technical content of the present invention will be further described below in conjunction with the accompanying drawings. The experimental data in this specific embodiment are all from real data sets in the UCI standard database.

[0021] attached figure 1 The specific flowchart of the spectral clustering method based on fusion Kendall Tau distance mentioned in the present invention is shown, including the following steps:

[0022] In the first step, the Euclidean distance and Kendall Tau distance between samples are calculated.

[0023] Given sample X={x 1 , x 2 ,..,x n}∈R D , then sample x i and x j The Euclidean distance between is:

[0024]

[0025] x in formula (2) im Indicates the m-th attribute of the i-th sample. The distance matrix E is a symmetric matrix. the ith column of the matrix except for E i,i and E j,i Sort the incoming elements to get a sequence: List i =(List 1i , List 2i ,...,List mi ,...,List ni ) m≠i;m≠j , where List mi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a spectral clustering method based on Kendall Tau distance fused measurement, and can be widely applied in a clustering analysis problem. In an existing spectral clustering algorithm, a traditional Euclidean distance is generally used for measuring a similarity between samples, but useful neighborhood information between the samples is neglected. Therefore, the invention provides a new spectral clustering method based on the Kendall Tau distance fused measurement. Firstly, the invention provides a new distance measurement method, the method fuses the Euclidean distance with the Kendall Tau distance in a nonlinear way, and a goal that underlying structure information between the samples is comprehensively mined from multiple perspectives to obtain a strengthened similarity measurement result between the samples. Then, the new similarity measurement result is applied to the spectral clustering algorithm to carry out clustering analysis. By use of the method provided by the invention, structural information between the samples can be comprehensively reflected, and the clustering accuracy of the spectral clustering algorithm is improved.

Description

1. Technical field [0001] The present invention relates to a spectral clustering method based on the fusion of Kendall Tau distance measures, involving distance measures, similarity fusion, spectral clustering analysis, etc. The clustering accuracy of the algorithm is mostly used in the field of data mining. 2. Background technology [0002] With the rapid development of the information age, big data and its related data analysis have received more and more attention. In order to dig out useful information from massive data, one of the most commonly used techniques in data analysis and exploration---cluster analysis has once again become the focus of discussions in various circles. At present, cluster analysis has been successfully applied in fields such as business intelligence, image processing, web search, biology and security. In recent years, spectral clustering has become one of the most popular clustering algorithms and a new research hotspot in the field of machine...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
CPCG06F18/23G06F18/25
Inventor 张道强光俊叶
Owner NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products