Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Method and system for constructing models for predicting protein-RNA interaction binding sites

A binding site and protein technology, applied in the field of RNA-protein interaction prediction, can solve the problems of model prediction accuracy, increased calculation time, overfitting, etc.

Active Publication Date: 2020-05-22
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF17 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although its accuracy rate is relatively improved, it greatly increases the complexity of the training model, which easily leads to overfitting, and also greatly increases the calculation time for the model to be trained on the computer.
[0008] In addition, the RNA structure used by the Deepnet-rbp method is the predicted in vitro structure, not from the real in vivo data in the experiment, so it is not enough to capture the real RNA structure information under in vivo conditions, so the prediction accuracy of the model is also affected accordingly

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for constructing models for predicting protein-RNA interaction binding sites
  • Method and system for constructing models for predicting protein-RNA interaction binding sites
  • Method and system for constructing models for predicting protein-RNA interaction binding sites

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] According to one aspect of the present invention (RBPnet), the original data used for training the model not only includes the sequence data of the RNA bound to the protein, but also introduces the structural omics measurement data generated by the RNA structural omics measurement experimental technology, such as DMS-seq structural omics determination data were used as input data for the model. Compared with the existing technologies (such as the models such as Deepnet-rbp mentioned above), the RNA structural omics data based on DMS-seq technology can provide the secondary structure information of RNA in the real cell state in vivo, so the present invention uses it In the study of RNA-protein interaction, it is used to solve the problem that the prediction of RNA structure by software is not accurate and cannot reflect the real RNA structure information in vivo.

[0051] According to one aspect of the present invention (RBPnet), two modules are designed in the processin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method and a system for constructing a model for predicting protein-RNA interaction binding sites. Correspondingly, the invention also provides a method and a system for predicting the protein-RNA interaction binding sites by using the method. A deep learning model is trained using sequence characteristics at and upstream and downstream of an RNA and protein binding site and measured RNA structure characteristics, and the model is used to predict a protein-RNA interaction binding site. In the feature extraction process, a motif acquisition module constructed based on aconvolutional neural network and a context semantic acquisition module constructed based on a recurrent neural network are used. Compared with the prior art, the trained model has remarkable progressin the aspects of judgment accuracy, calculation time and universality of an application platform.

Description

technical field [0001] The present invention relates to RNA-protein interaction prediction technology, in particular to a method and system for constructing a model for predicting protein-RNA interaction binding sites, and corresponding methods for predicting protein-RNA interaction binding sites using the method methods and systems. Background technique [0002] At present, the methods for predicting RNA-protein interactions based on deep learning technology mainly include DeepBind (see Alipanahi, B et al., (2015). Nature Biotechnology 33, 831– [0003] 838.), Deepnet-rbp (see Zhang, S et al., (2016). Nucleic Acids Res 44, e32– [0004] e32) and iDeepE (see Pan, X et al., (2018). Bioinformatics 34, 3427–3436), etc. [0005] In the prior art DeepBind method, the model structure uses a convolutional neural network and is trained based on RNA sequence data. Due to the earlier time and the simple structure of the model, the modeling ability is insufficient and the accuracy r...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G16B20/00G16B5/00G06N3/04G06N3/08
CPCG16B20/00G16B5/00G06N3/08G06N3/045Y02A90/10
Inventor 吴杨杨瑞赵屹
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products