Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Deep learning Residue2vec-based protein structure prediction method

A protein structure and prediction method technology, which is applied in the field of protein structure prediction based on deep learning Residue2vec, can solve the problems of poor conformation space sampling ability and insufficient conformation update accuracy, and achieve high matching degree and high prediction accuracy.

Active Publication Date: 2017-02-01
ZHEJIANG UNIV OF TECH
View PDF6 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But so far there is no perfect method to predict the three-dimensional structure of proteins. Even if good prediction results are obtained, it is only for some proteins. At present, the main technical bottlenecks lie in two aspects. First, On the one hand, it lies in the sampling method. The existing technology is not strong in sampling the conformation space;

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep learning Residue2vec-based protein structure prediction method
  • Deep learning Residue2vec-based protein structure prediction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The present invention will be further described below in conjunction with the accompanying drawings.

[0030] refer to figure 1 and figure 2 , a protein structure prediction method based on deep learning Residue2vec, the conformational space optimization method includes the following steps:

[0031] 1) given input sequence information;

[0032] 2) Build the residue vector in the template library:

[0033] 2.1) Download the resolution less than from the protein database (PDB) website high-precision protein, where is the distance unit, m; removing redundant polypeptide chains whose similarity is greater than a preset threshold (for example, 30%), to obtain a non-redundant protein template library;

[0034] 2.2) Segment the non-redundant protein template into residues of length n by a sliding window;

[0035] 2.3) Through the CBOW model combined with Huffman coding, the residue model is modeled in the neural network, and the representation of the residue in the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a deep learning Residue2vec-based protein structure prediction method. The method comprises the following steps of: giving input sequence information, regarding a known protein structure on a PDB website as a corpus to train, partitioning the proteins with known structures into residues with lengths of n, obtaining the expression of each residue in a vector space through a CBOW model and a Huffman code, and judging the similarities between the residues through calculating the distances between residue vectors, so as to obtain the front N fragment structures on each residue position of a query sequence and then form a fragment library of Residue2vec; carrying out random folding on the query sequence to form an initial conformation; randomly selecting a residue with the length of n, and carrying out dihedral angle replacement on the residue and fragments in the fragment library; and comparing the energy, if the energy is decreased, receiving the conformation, and if the energy is increased, receiving the conformation via a Metropolis criterion and finally obtaining a metastable-state conformation through continuous iteration. According to the method disclosed by the invention, the matching degree and prediction precision in the query sequence are relatively high.

Description

technical field [0001] The present invention relates to the fields of bioinformatics and computer applications, in particular to a protein structure prediction method based on deep learning Residue2vec. Background technique [0002] Protein molecules play a vital role in the process of biological and cellular chemical reactions. Their structural models and bioactive states have important implications for our understanding and cure of many diseases. Only when proteins are folded into a specific three-dimensional structure can they produce their unique biological functions. Therefore, to understand the function of a protein, it is necessary to obtain its three-dimensional structure. [0003] The ab initio prediction method of protein structure needs to solve two basic problems: (1) construct an appropriate energy model to calculate the interaction between different atoms in the protein; (2) develop an effective algorithm to find the global minimum of conformational space ene...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F19/16G06N3/04
CPCG16B15/00G06N3/045
Inventor 张贵军俞旭锋周晓根郝小虎王柳静
Owner ZHEJIANG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products