Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Clustering analysis method and system for multi-omics data

A technology of cluster analysis and omics data, applied in the field of data analysis, can solve problems such as difficult to achieve good results, failure to use node path information, and affecting clustering accuracy

Pending Publication Date: 2021-09-14
瓴域影诺北京科技有限公司
View PDF0 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The similarity-based method uses the similarity between samples to cluster data. Since the number of samples is far smaller than the number of features in the current multi-omics data, the similarity-based method is very difficult when the sample size is insufficient. Hard to have a better effect
Algorithms based on spectral clustering do not use node path information, which affects the accuracy of clustering

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Clustering analysis method and system for multi-omics data
  • Clustering analysis method and system for multi-omics data
  • Clustering analysis method and system for multi-omics data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053]The implementation mode of the embodiment of the present application is illustrated by specific specific examples below. Those who are familiar with this technology can easily understand other advantages and effects of the embodiment of the present application from the content disclosed in this specification. Obviously, the described embodiment is the embodiment of the present application. The application embodiments are part of the embodiments, but not all of the embodiments. Based on the embodiments in the embodiments of the present application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the embodiments of the present application.

[0054] With the increasingly extensive research on graph neural networks, the network contains rich relational information, can fit multi-source heterogeneous data, and realize efficient clustering algorithms based on artificial intelligence, which ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a clustering analysis method and system for multi-omics data, and the method comprises the steps: segmenting MR image information through employing a neural network, and extracting a high-throughput image hyper-parameter according to the segmentation information of each part; processing the clinical data, the demographic data and the laboratory inspection data to generate vector representations of different dimensions; performing multi-source data fusion on the high-throughput image data and the vector representations of different dimensions to obtain fused multi-source heterogeneous data; constructing a multi-source heterogeneous data set, and training and testing a multi-source graph clustering model to obtain an optimal model; inputting MR image information into the optimal model, and analyzing differences of different categories and similarities of the same category. A graph structure mode is adopted, the correlation condition between data is visually expressed, different features are captured, the model is more robust, the efficient clustering algorithm based on the graph neural network model is achieved, and the practical value is very high.

Description

technical field [0001] The embodiments of the present application relate to the technical field of data analysis, and in particular to a method and system for cluster analysis of multi-omics data. Background technique [0002] In recent years, multi-omics analysis has been widely used in medical subtype analysis, and high-throughput data fusion to accurately define subtypes has become an important topic in precision medicine. At present, the integration of multi-omics is facing many problems, such as complex data, sparse data, and heterogeneous data. Data fusion methods can be divided into similarity methods, dimensionality reduction methods, and statistics-based methods. [0003] The similarity-based method uses the similarity between samples to cluster data. Since the number of samples is far smaller than the number of features in the current multi-omics data, the similarity-based method is very difficult when the sample size is insufficient. It is difficult to have a be...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62G06N3/08
CPCG06N3/08G06F18/23213G06F18/22G06F18/25
Inventor 怀晓晨穆红章
Owner 瓴域影诺北京科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products