Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Antibacterial peptide prediction method and device based on protein pre-training representation learning

A prediction method and pre-training technology, applied in the field of computer identification of antimicrobial peptide components, can solve the problem that the model cannot be used universally

Pending Publication Date: 2021-04-06
XIAMEN UNIV
View PDF0 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] In view of the above-mentioned defects (deficiencies) of the prior art, the object of the present invention is to provide a method for predicting antimicrobial peptides based on natural language processing, to further improve the accuracy and speed of antimicrobial peptide identification and prediction, and to solve the problem that the built model cannot Common questions across datasets

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Antibacterial peptide prediction method and device based on protein pre-training representation learning
  • Antibacterial peptide prediction method and device based on protein pre-training representation learning
  • Antibacterial peptide prediction method and device based on protein pre-training representation learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0033] The antimicrobial peptide prediction method based on protein pre-training characterization learning of the present invention applies the method of pre-training + fine-tuning in the deep learning of natural language processing, including the following steps: first train a language model on a large amount of unmarked corpus, and then Fine-tuning the model using a specific data set with a small amount of data, such as a text classification data set, a named entity recognition data set, etc., usually takes only a short time. Therefore, after pre-training a model, the model can be quickly migrated to any natural language processing task, which helps the model save training time and computing resources. The number of experimentally determined antimicrobial peptides is far less than the number of proteins known to exist and sequenced. Using the antimicrobial peptide sequence to fine-tune the pre-trained model obtained from a large amount of protein data will help to mine riche...

Embodiment 2

[0104] Based on the same inventive concept as the antimicrobial peptide prediction method based on protein pre-training characterization learning in the foregoing embodiment 1, the present invention also provides a computing device, including one or more processors and memories, on which the There is a computer program, and when the program is executed by a processor, it realizes the steps of any one of the aforementioned methods for predicting antimicrobial peptides based on protein pre-training representation learning.

[0105] The computing device in this embodiment may be a general computer, a dedicated computer, a server or cloud computing, all of which are well known in the art.

[0106] Those skilled in the art should understand that the embodiments of the present invention may be provided as methods, systems, or computer program products. Accordingly, the present invention can take the form of an entirely hardware embodiment, an entirely software embodiment, or an embo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an antibacterial peptide prediction method and device based on protein pre-training representation learning; the method comprises the following steps: S1, employing a pre-training strategy to carry out the word segmentation and covering of a label-free protein sequence from a protein database, and obtaining a pre-training representation learning model, carrying out pre-training of two tasks of covering a language model and sentence continuity prediction, capturing expressions of a word level and a sentence level, and helping the model to learn general structural features of a protein sequence; S2, for the antibacterial peptide pre-recognition and prediction task, changing an output layer of a pre-training model, and performing fine adjustment on the model by using an antibacterial peptide data set with a label to generate an antibacterial peptide prediction model; and S3, according to the antibacterial peptide pre-identification and prediction task, adopting an antibacterial peptide prediction model for identification, and outputting a prediction result. Pre-training is applied to the field of antibacterial peptide recognition and prediction, and an efficient antibacterial peptide prediction model is established based on a known antibacterial peptide sequence with small data volume and unbalanced distribution.

Description

technical field [0001] The invention relates to the technical field of computer recognition of antimicrobial peptide components, in particular to a method and device for antimicrobial peptide prediction based on protein pre-training representation learning. Background technique [0002] Due to the abuse of antibiotics, the problem of drug resistance of pathogenic bacteria is becoming more and more serious, which has become a huge threat to human health. Finding new raw materials for antibiotics is an effective way to protect human health. As an important part of the natural immune system of organisms, antimicrobial peptides (AMPs) are a class of small molecule polypeptides that widely exist in natural organisms. They kill target bacteria by destroying cell membranes and interfering with DNA replication and transcription processes. , viruses, fungi, parasites, tumor cells have a certain inhibitory effect. Antimicrobial peptides are regarded as the best substitutes for antib...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G16B5/00G16B30/10G16B35/00G16B40/00G06N3/08
CPCG16B30/10G16B35/00G16B5/00G16B40/00G06N3/084
Inventor 刘向荣张悦曾湘祥林剑远赵连敏
Owner XIAMEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products