Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for expressing quantitative character and CNV association based on gene interaction network clustering and group sparse learning

A quantitative trait and gene expression technology, applied in the field of data mining in bioinformatics, it can solve the problems of insufficient in-depth and comprehensive research on CNVs, simple statistical models, and no consideration of multi-factor interactions.

Pending Publication Date: 2022-02-11
HUNAN UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] There are two main shortcomings in the existing calculation methods for studying the regulation mechanism of gene expression: first, the statistical models of the existing methods are relatively simple, and most of them are univariate correlation analysis methods, which do not consider the interaction between multiple factors
Second, the existing research methods are not deep and comprehensive enough to study the role of CNVs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for expressing quantitative character and CNV association based on gene interaction network clustering and group sparse learning
  • Method for expressing quantitative character and CNV association based on gene interaction network clustering and group sparse learning
  • Method for expressing quantitative character and CNV association based on gene interaction network clustering and group sparse learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047]The invention relates to the field of data mining in bioinformatics, in particular to a method for associating expression quantitative traits with CNVs based on gene interaction network clustering and group sparse learning. Specific embodiments of the present invention are described below. Those skilled in the art should understand that these embodiments are only used to explain the technical principles of the present invention, and are not intended to limit the scope of evidence collection of the present invention.

[0048] Step 1: Download the experimentally verified breast cancer FPKM RNA-seq information data from the TCGA database, including 1208 samples and 22682 genes. The CNV data of breast cancer were downloaded from the TCGA database, with a total of 1106 samples and 19729 genes. For the above data, only the overlapping samples in the CNV and mRNA datasets were kept, with a total of 1077 samples. Collect the top 50 high confidence risk genes for breast cancer l...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the field of data mining in bioinformatics, in particular to a synthetic lethal interaction prediction method based on a heterogeneous graph convolutional neural network. The method mainly comprises the steps of (1) collecting breast cancer FPKMRNA-seq information, breast cancer CNV data and breast cancer high-confidence risk gene information; (2) preprocessing breast cancer copy number variation (CNV) and mRNA data by a rank-based method; (3) establishing a gene-gene interaction network based on protein interaction knowledge and a signal channel, and generating a high-density sub-network by using a network clustering algorithm; and (4) constructing a group sparse learning-based model to describe the incidence relation between the subnet and the mRNA expression of the target gene, measuring the ability of the CNVs to predict the expression change of the target gene by using a root mean square error (RMSE), carrying out cross validation by adopting a k-fold cross validation algorithm, and carrying out correlation analysis on the gene expression and the CNV by using a Sperman correlation research method in combination with pathway enrichment analysis.

Description

technical field [0001] The invention relates to the field of data mining in bioinformatics, in particular to a method for associating expression quantitative traits with CNVs based on gene interaction network clustering and group sparse learning. Background technique [0002] Next-generation sequencing (NGS) technologies allow researchers to map the mutational landscape in a specific tumor or even in a pan-cancer fashion. During the development of tumors, different types of mutations have accumulated in the genome, including single nucleotide polymorphisms (SNPs), chromosome fragments, copy number variations (CNVs), gene fusions, etc. [0003] Since different tumor subtypes may adopt different immune evasion pathways, different subtypes have different drug resistance, pathology, and drug responses. The expression of disease-causing and immune genes is regulated at different stages or levels, such as expression quantitative trait loci (eQTLs), CNVs, epigenetic modifications,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G16B20/50G16B30/00G16B40/00
CPCG16B20/50G16B30/00G16B40/00
Inventor 陈浩文张翔宁斌屈强
Owner HUNAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products