Sample clustering method and device, equipment and storage medium
A clustering method and sample technology, applied in the field of data processing, can solve the problem that the sample set cannot be reasonably clustered, and achieve the effect of ensuring the rationality of the clustering and reducing the workload.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0075] figure 2 It is a flowchart of a sample clustering method provided by Embodiment 1 of the present invention. The sample clustering method provided in the embodiment can be executed by a sample clustering device, which can be realized by software and / or hardware, and the sample clustering device can be composed of two or more physical entities, or Can be a physical entity. For example, the sample clustering device can be a computer, mobile phone, tablet or interactive smart tablet and other smart devices with data computing and analysis capabilities.
[0076] Specifically, refer to figure 2 , the sample clustering method specifically includes:
[0077] Step 110: Statistically calculate the first sample distance corresponding to each sample in the sample set, where the first sample distance is the distance between the sample and the Sth nearest neighbor sample of the sample.
[0078] Exemplarily, the sample set includes multiple samples, and each sample has the same ...
Embodiment 2
[0101] Figure 4 It is a flowchart of a sample clustering method provided by Embodiment 2 of the present invention. This embodiment is embodied on the basis of the above-mentioned embodiments. refer to Figure 4 , the sample clustering method provided in this embodiment includes:
[0102] Step 201. Construct a K-nearest neighbor graph for each sample in the sample set, and the weight of each edge in the K-nearest neighbor graph is the distance between corresponding samples.
[0103] Specifically, after calculating the distance between each sample and other samples, when drawing the K-nearest neighbor graph of a certain sample, use the sample as a vertex, and obtain the K samples closest to the sample according to the distance between the samples and corresponding distance. Afterwards, the connection lines between the vertices and the K samples are respectively drawn, and the distance between the vertices and the corresponding samples is used as the weight of the connection...
Embodiment 3
[0176] Figure 9 It is a schematic structural diagram of a sample clustering device provided in Embodiment 3 of the present invention. refer to Figure 9 , the sample clustering apparatus includes: a distance statistics module 301 , a distance acquisition module 302 , an average calculation module 303 , a connection determination module 304 and a sample clustering module 305 .
[0177] Wherein, the distance statistics module 301 is used to count the first sample distance corresponding to each sample in the sample set, and the first sample distance is the distance between the sample and the Sth neighbor sample of the sample; distance acquisition Module 302, used to obtain the first sample distance within the set distance range among all the first sample distances; mean calculation module 303, used to obtain the first sample distance based on the set distance range Calculate the mean value of the distance; the connection determination module 304 is used to determine all connec...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com