Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Subgroup specific co-expression network identification method

A technology of co-expression and co-expression of genes, which is applied in the field of subgroup-specific co-expression network identification, can solve the problems of simple and crude, screening, and failure to pass the multiple of difference

Active Publication Date: 2020-01-17
BGI GENOMICS CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] 1. It is impossible to guarantee that all cells / samples are in the same state at the same time in one sampling, and the gene expression level is not necessarily consistent. Moreover, due to the low initial amount of single-cell data, there will often be some loss of expressed genes. It is simple, crude and inaccurate to take the average of gene expression in subgroups for analysis;
[0006] 2. By selecting genes with significant differences, only target genes with particularly obvious differences can be found, but regulatory genes with less drastic changes cannot be identified, which has limited effect on understanding the function of subgroups or explaining the reasons for phenotypic differences
[0007] 3. The regulation of gene expression in organisms is real-time and dynamic, and the realization of most functions is not achieved by drastic changes in one or several independent genes, but is regulated by a group of genes synergistically. The degree of change of the gene alone may not be significant, and it cannot pass the screening of the multiple of difference and p-value
[0009] 1. This method is only effective when the number of samples is small (a dozen to one hundred). When the number of samples increases, the consumption of memory and time increases sharply; in addition, the number of samples (cells) of 10x single-cell data reaches thousands level, sometimes even exceeding the number of expressed genes, at this time, the WGCNA algorithm is completely inapplicable

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Subgroup specific co-expression network identification method
  • Subgroup specific co-expression network identification method
  • Subgroup specific co-expression network identification method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0058] Example 1. Establishment of Subgroup-Specific Co-expression Network Identification Method

[0059] 1. Cells are divided into subpopulations based on gene expression

[0060] Obtain all gene expression amounts in all cells to be tested from database or detect (the present invention measures expression amount with UMI quantity); According to different expression amounts (UMI quantity) use Monocle2 (this method is recorded in following document: Trapnell C, Cacchiarelli D, Grimsby J, et al.The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells[J].Naturebiotechnology,2014,32(4):381.) clustered all the tested cells into different cell subpopulations .

[0061] 2. Establishment of identification method for cell subgroup-specific co-expression network

[0062] 1. Normalize the original UMI quantity of the gene

[0063] The original UMI (Uniquemolecular identifiers, UMI is the measurement method of expression) of each gene ...

Embodiment 2

[0101] Example 2, Application of Subgroup-Specific Co-expression Network Identification Method

[0102] 1. Cells are divided into subpopulations based on gene expression

[0103] The source is 4,000 cells of healthy human peripheral blood mononuclear cells (PBMC) published on the official website of 10x genomics as the cells to be tested, and the genetic data comes from:

[0104] https: / / support.10xgenomics.com / single-cell-gene-expression / datasets / 2.1.0 / pbmc4k.

[0105] After clustering the gene expression data of the 4000 peripheral blood mononuclear cells (PBMC) of the above-mentioned healthy people with the Monocle2 method, they were divided into six cell subgroups according to the gene expression level in the cells, as shown in figure 1 shown.

[0106] 2. Establishment of identification method for cell subgroup-specific co-expression network

[0107] According to the second step in the method of Example 1, six cell subpopulation-specific co-expression networks were obtain...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a subgroup specific co-expression network identification method. The invention provides a method for identifying a specific co-expression network of a subgroup of a sample to be detected. The method comprises the following steps of: 1) carrying out homogenization treatment on an original UMI number value of each gene of each sample in the subgroup of the sample to be detected; 2) filtering an original UMI homogenization result; 3) calculating the weighted median correlation bicor of every two filtered genes; 4) filtering the weighted median correlation result to obtaina strong co-expression gene relationship pair; 5) subtracting the shared strong co-expression gene relationship pair appearing in all cells in the subgroup from the filtered strong co-expression generelationship pair of the subgroup of the cells to be detected to obtain a specific co-expressed gene pair of the subgroup of the cells; and constructing a specific gene co-expression network of the subgroup of the sample to be detected by using the specific co-expression gene pair of the subgroup of the cells. The method for the subgroups is more targeted and more sensitive.

Description

technical field [0001] The invention belongs to the field of biotechnology, and in particular relates to a subgroup-specific co-expression network identification method. Background technique [0002] With the development of sequencing technology and the decline of sequencing costs, the sample size for gene expression research is increasing, especially the introduction of 10x single-cell RNA-Seq technology, which makes the number of samples for gene expression research jump to a thousand. For expression The study of differential changes has changed from simple sample-to-sample and group-to-group comparisons to clustering of subgroups of samples based on gene expression changes, and then looking for specifically expressed genes based on the results of the subgroups , or differentially expressed genes, to explain the functions performed by different subgroups of cells, or the reasons why individuals of different subgroups have differences. [0003] The current method used to f...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G16B20/20
Inventor 袁永娴唐冲陈伟填李丛
Owner BGI GENOMICS CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products