Parallel drug-target correlation prediction method based on sorting learning

A sorting learning and prediction method technology, applied in the field of bioinformatics, can solve the problems of further exploration of the degree of drug-protein correlation, not convincing, etc., to achieve the benefits of redirection, performance improvement, and redundancy removal Effect

Active Publication Date: 2020-08-28
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF9 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The introduction of machine learning has indeed made great progress in terms of speed, but both feature-based methods and similarity-based methods have certain shortcomings: on the one hand, similarity-based methods only rely on unilateral (drug or target), the second is that when the number of known ligands (or targets) that can act on the target (or ligand) is small, the similarity between the analyte and only a few samples can be drawn It is obviously not convincing enough; when using feature-based methods, drug information and protein sequence information may not be well represented in digital form due to the algorithm used
[0004] In addition, when using machine learning to predict drug-protein correlation, many researchers simply predict whether the drug is related to the protein, and classify the research into two Classification problem, no further exploration of the degree of drug-protein correlation, that is, no further exploration of which protein (drug) has the strongest correlation with a given drug (protein)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Parallel drug-target correlation prediction method based on sorting learning
  • Parallel drug-target correlation prediction method based on sorting learning
  • Parallel drug-target correlation prediction method based on sorting learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0048] refer to figure 1 , a preferred embodiment of the present invention provides a parallel drug-target correlation prediction based on a ranking learning algorithm, including:

[0049] S1. Obtain the chemical structure sample set of the drug and the sequence sample set of the target;

[0050] S2. Based on the above sample set, multiple feature extraction algorithms are used to extract the drug, target, and drug-target in parallel to extract its chemical space features, gene space features, similarity and correlation features;

[0051] S3. Combine all the data features, and use principal component analysis (PCA) to perform dimensionality reduction processing on the feature set;

[0052] S4. Taking the feature set obtained by dimensionality reduction as input, using various sorting learning methods to sort the proteins or ligands that are relatively related to each query (drug or target), and calculate the correlation between each protein or ligand involved and the query S...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a parallel drug-target correlation prediction method based on sorting learning, and belongs to the field of bioinformatics. According to the method, multiple types of similarity, correlation characteristics, chemical spatial characteristics and gene spatial characteristics are extracted through multiple characteristic extraction methods; then, due to the fact that a characteristic set with high dimensionality can be obtained through multi-angle characteristic extraction and samples do not have conventional positive and negative example labels, dimensionality reduction processing is conducted through a principal component analysis method, then, the dimensionality reduced characteristic set is input into a sorting learning algorithm, and finally the drug-target correlation degree under each query can be predicted and output. By utilizing sorting learning, the drug-target correlation is no longer simply divided into correlation or uncorrelation, and sorting is performed according to the correlation degree of the drug and the target, so that research and development of a new drug are facilitated, and redirection of the drug is also facilitated.

Description

technical field [0001] The invention belongs to the field of biological information systems, and in particular relates to a parallel drug-target correlation prediction method based on ranking learning. Background technique [0002] There are many methods and techniques for predicting drug-protein correlations. Traditional prediction methods are divided into two types: ligand-based and target-based: the ligand-based method requires the correlation information of the known ligand of the target protein, and uses this to define a pharmacophore model to describe the binding of the ligand. shared features, which means that this type of method is not suitable for situations where there is little known ligand information; target-based methods need to obtain the 3-dimensional structure of the target in advance, but the 3-dimensional structure of some protein sequences is unknown and difficult Obtain. [0003] Although traditional forecasting methods can guarantee high accuracy, the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B15/30G16B40/30
CPCG16B15/30G16B40/30Y02A90/10
Inventor 邹权茹晓青
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products