Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

BP neural network based protein secondary structure prediction method

A BP neural network, secondary structure technology, applied in the field of biological information, can solve the problem of low accuracy of protein secondary structure prediction

Inactive Publication Date: 2016-07-06
HUNAN UNIV OF TECH +1
View PDF0 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention aims at the problem that the prediction accuracy of protein secondary structure is low and the defects of BP neural network, and the learning process of the network is improved, and provides a BP neural network training and prediction method for protein secondary structure prediction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • BP neural network based protein secondary structure prediction method
  • BP neural network based protein secondary structure prediction method
  • BP neural network based protein secondary structure prediction method

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment approach 1

[0024] Specific implementation mode one: the following combination figure 1 , figure 2 , image 3 This embodiment will be specifically described.

[0025] Step 1. Select a set of protein structure data from the PDB in which the α-helix, β-sheet and coil structures account for a normal proportion to form a training sample set. The proportions of the three types of structures are all between 20% and 40%, and the coil structure is more;

[0026] Step 2.1: Input the amino acid sequence information of the encoded protein by using six digits. Among them, a five-dimensional binary vector is used to represent the amino acid type, and the remaining one is used to represent the offset of the residue relative to the center position of the window. The closer to the center position, the greater the impact on the structure of the residue to be predicted, and the greater the value of this bit; the farther the distance, the smaller the value of this bit. Assuming an offset of n from the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the field of protein secondary structure prediction methods, relates to a BP neural network training and prediction method used for protein secondary structure prediction and solves the problem of bad prediction effect of the protein secondary structure. The BP neural network training and prediction method comprises the steps of firstly selecting a group of training sample sets with [alpha]-helix, [beta]-sheet and coiling structures accounting for normal proportions from PDB, coding an amino acid sequence of a protein and regarding the coded amino acid sequence of the protein as a network input, and regarding a secondary structure of the corresponding amino acid as a network output; optimizing based on a gradient method, introducing a learning rule which is attached with a momentum item and a self-adaptive learning rate to avoid an oscillation phenomenon and prevent from being trapped in a local minimum value; adopting a six-bit input coding way and a sliding window technology in an input layer, setting a hidden layer structure based on an experience formula and the size of a sliding window; and outputting and predicting classification of the protein secondary structure by an output layer based on a DSSP algorithm.

Description

technical field [0001] The invention relates to the field of biological information, in particular to a method for predicting protein secondary structures. Background technique [0002] The high-level structure of a protein determines its biological function, and the secondary structure of a protein is determined by its amino acid sequence characteristics, which is the basis for predicting the high-level structure. The protein secondary structure mainly uses amino acid sequence information to predict the secondary structure of the protein through experimental analysis or statistical methods. When the prediction accuracy reaches more than 80%, the spatial structure of the protein can be described more accurately. With the development of DNA analysis and sequencing technology, a large amount of protein sequence information has been obtained through the derivation and analysis of DNA, but the protein structure obtained through traditional experimental and statistical methods is...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F19/16G06N3/08
CPCG06N3/084G16B15/00
Inventor 傅娟汤达祺汤德佑
Owner HUNAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products