Gene expression profile distance measurement method based on deep learning

A gene expression profile and distance measurement technology, which is applied in the field of gene expression profile distance measurement based on deep learning, can solve the problems of high time cost, poor performance of calculating gene expression profile distance, etc., and achieve the effect of low accuracy

Active Publication Date: 2019-07-19
HUNAN UNIV
View PDF6 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The technical problem to be solved in the present invention is to give full play to the advantages of deep metric learning that can accurately obtain the characteristics of data and can quickly and effectively calculate the distance between data, so as to solve the problem of poor performance and time overhead in calculating gene expression spectrum distance in traditional methods. big problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Gene expression profile distance measurement method based on deep learning
  • Gene expression profile distance measurement method based on deep learning
  • Gene expression profile distance measurement method based on deep learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] according to figure 1 The flow shown in the implementation mode includes the following four steps:

[0034] Step 1: data conversion processing, including the following steps,

[0035] 1.1. Convert the gene expression profile data into a square data matrix, and the length of the square matrix is ​​calculated according to the dimension of the expression profile data. The specific calculation method is: convert the sample whose data dimension is N into a square matrix of x*x, where x is passed through the formula Obtained, the extra pixel position is filled to 0.

[0036] 1.2. Perform normalization and mean subtraction data preprocessing operations on the square matrix.

[0037] 1.3. Assign different category labels to the expression spectrum matrices of different categories, and divide the training, verification and test sample sets.

[0038] Step 2: extracting high-level features of training sample data, including the following steps,

[0039] 2.1. Pass the trainin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the field of gene expression profile classification, discloses a gene expression profile distance measurement method based on deep learning, and belongs to mining and application of deep learning on biological big data. Firstly, a convolutional neural network model suitable for gene characteristic metric learning is designed to extract data characteristics, then the distance between the data is calculated by applying an improved cosine distance, and finally, the good performance of the method is measured through the classification effect of a classification algorithm.According to the method, the similarity between different gene expression profiles can be quickly and efficiently measured, and data is provided for subsequent researches such as gene classification,clustering, differential expression analysis and compound screening. Compared with a traditional gene enrichment method, the method has the advantages that the distance measurement effect between thedata is obviously improved, the manual intervention during gene expression profile analysis can be effectively reduced, the overfitting phenomenon easily generated by a conventional deep network is avoided, and the method has relatively high mobility.

Description

[0001] Technical field: [0002] The present invention belongs to the field of gene expression spectrum classification, and more specifically relates to the mining and application of deep learning on gene expression spectrum data, and in particular to a method for measuring the distance of gene expression spectrum based on deep learning. [0003] Background technique: [0004] At present, with the rapid development of biotechnology, the experimental methods and research methods in the field of biomedicine have undergone tremendous changes, showing the trend of "big data". Among them, the similarity comparison of expression profile data can be applied to compare the expression levels of genes in normal and abnormal cells, help identify disease-related genes and drug targets, and analyze the pathogenic mechanism of complex diseases. Therefore, the similarity of gene expression profiles Research has gradually become a research hotspot. At present, the calculation method of gene ex...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06N3/045G06F18/213G06F18/22
Inventor 彭绍亮刘伟李非杨亚宁李肯立卢新国张磊毕夏安
Owner HUNAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products