Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Cancer subtype classification method for performing multi-omics data integration based on D-S evidence theory

A technology of evidence theory and omics data, applied in the field of cancer subtype classification, can solve the problems of harsh selection of integration methods and lack of interpretability of integration methods, and achieve good results

Inactive Publication Date: 2019-10-15
DALIAN UNIV OF TECH
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the effect of the similarity matrix method is very harsh on the selection of integration methods, and the existing integration methods also lack acceptable interpretability.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cancer subtype classification method for performing multi-omics data integration based on D-S evidence theory
  • Cancer subtype classification method for performing multi-omics data integration based on D-S evidence theory
  • Cancer subtype classification method for performing multi-omics data integration based on D-S evidence theory

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] The specific implementation of the method will be further described below in combination with the technical solution on the public data set. The public data set is only limited to explaining the present invention for easy understanding, rather than limiting the present invention.

[0039] From the TCGA database ( https: / / portal.gdc.cancer.gov / ) to obtain the invasive breast cancer (BIC) data set, which is composed of gene expression data, methylation data and miRNA data of 105 patients, and the feature numbers of each omics data are 17814, 23094 and 354 respectively. The following 10 patients TCGA-A1-A0SD, TCGA-A2-A04N, TCGA-A2-A04U, TCGA-A2-A04W, TCGA-A2-A0CL, TCGA-A2-A0CS, TCGA-A2-A0CV, TCGA- Take A2-A0D3, TCGA-A2-A0ES, and TCGA-A2-A0EW as examples for specific description.

[0040] (1) Use formula (1) to calculate the similarity s between patients in each omics data ij :

[0041] Table 1, Table 2, and Table 3 are the results calculated by formula (1) on the gene e...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a cancer subtype classification method for performing multi-omics data integration based on a D-S evidence theory, and belongs to the technical field of multi-omics data analysis. The cancer subtype classification method comprises the steps: firstly obtaining similarity matrixes of patients under different omics backgrounds, then integrating the similarity matrixes based ona D-S evidence theory to obtain final similarity measurement, and finally performing clustering on the similarity matrixes to obtain a clustering result which is divided into different subtypes. Thecancer subtype classification method is used for dividing cancer subtypes, uses various different omics data, and is more stable and credible and has biological significance when compared with a single omics method. In addition, the cancer subtype classification method applies an integration mode based on a D-S evidence theory, so that a reasonable fusion means for data from different sources is provided. The cancer subtype classification method has a good comprehensive analysis effect on multi-omics data, and is of great significance to division of cancer subtypes.

Description

technical field [0001] The invention belongs to the technical field of multi-omics data analysis, and relates to a data integration method based on D-S (Dempster-Shafer) evidence theory. A data-integrated approach to cancer subtype classification. Background technique [0002] Cancer is a heterogeneous disease. Generally speaking, it can be divided into different subtypes according to different pathogenesis, clinical manifestations, etc. For example, among the different subtypes of breast cancer, the estrogen receptor (ER)-positive subtype responds well to hormone therapy, while the human epidermal growth factor receptor 2 (HER2)-positive subtype responds well to chemotherapy. Therefore, corresponding prognosis and treatment methods should be adopted for different cancer subtypes, and the classification of cancer subtypes is very important for improving the accuracy of cancer diagnosis and assisting doctors in choosing treatment methods. However, at this stage we have very...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62G16H50/20G16H70/20
CPCG16H50/20G16H70/20G06F18/232G06F18/22
Inventor 高振博姚卫红林晓慧
Owner DALIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products