Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Score clustering analysis method based on t-SNE

A technology of cluster analysis and performance, applied in the field of performance analysis of clustering algorithms

Active Publication Date: 2020-09-04
NORTHWESTERN POLYTECHNICAL UNIV
View PDF5 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In order to solve the problems of multi-subject performance analysis and multi-dimensional data visualization and overcome the deficiencies in the prior art, the present invention proposes a t-SNE-based performance clustering analysis method, which first reduces the dimensionality of students' grades through the t-SNE algorithm and then passes K-Means algorithm clustering to improve the efficiency and effectiveness of multi-subject score analysis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Score clustering analysis method based on t-SNE
  • Score clustering analysis method based on t-SNE
  • Score clustering analysis method based on t-SNE

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0060] The present invention will be further described below in conjunction with the accompanying drawings and embodiments.

[0061] The technical solution adopted by the present invention to solve its technical problems is:

[0062] Step 1: Import raw data;

[0063] Step 2: Perform t-SNE dimensionality reduction on high-dimensional score data;

[0064] (2-1) Calculate the Euclidean distance of different student grades;

[0065] Assuming that all student grades are an m×n matrix, That is, there are m students, n subject scores, row vector x i =[x i1 x i2 ... x in ] indicates the results of each subject of the i-th student, x j Similarly; use the formula ||x i -x j || 2 , calculate the Euclidean distance between every two row vectors, and get an m×m matrix:

[0066]

[0067] d ij Indicates the Euclidean distance between the score row vectors of the i-th student and the j-th student, and the other elements in the matrix are the same;

[0068] (2-2) Calculate t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a score clustering analysis method based on t-SNE. The method comprises the steps: importing original data, carrying out the t-SNE dimension reduction of high-dimensional scoredata, carrying out the K-Means clustering of the score data after the t-SNE dimension reduction, and obtaining a clustering result. According to the method, after the original data is preprocessed, the data in the high-dimensional space is subjected to dimensionality reduction by using the t-SNE algorithm, and then the original data is clustered by using the K-Means algorithm, so that the problemof non-ideal clustering effect caused by over-high data dimensionality is effectively solved. Due to the fact that the distribution characteristics of the high-dimensional data are completely reservedthrough the t-sne dimension reduction method, the clustering result of the high-dimensional data is obtained through reduction of the clustering result of the data after dimension reduction. The superiority of the dimensionality reduction algorithm in the student score analysis algorithm can be seen by comparing the results of dimensionality reduction before clustering and direct clustering of the high-dimensional data.

Description

technical field [0001] The invention relates to the field of score analysis, in particular to a score analysis method of a clustering algorithm. Background technique [0002] Performance analysis is one of the important means for schools to evaluate the learning situation of students in school. Schools generally use indicators such as pass rate, average score, and total score ranking for analysis, hoping to use these indicators to reflect the recent learning status of students in various subjects. However, there is a lack of correlation between the contents reflected by these indicators, and it is difficult to obtain the comprehensive performance of students. Nowadays, with the rapid development of big data analysis technology, schools can use data mining technology and data visualization technology to analyze students' learning more efficiently and accurately. [0003] There are some performance analysis methods based on data mining. The patent "Rough Set-Based Decision-...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/2458G06K9/62G06Q10/06G06Q50/20
CPCG06F16/2465G06Q10/06393G06Q50/205G06F18/21355G06F18/23213Y02D30/70
Inventor 李波白双霞翟玉媛何瑞寅
Owner NORTHWESTERN POLYTECHNICAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products