Method for identifying DNA enhancer element based on sequence frequency information

A technology of frequency information and sequence information, which is applied in the field of functional component prediction algorithms to achieve high prediction accuracy and good prediction performance

Pending Publication Date: 2021-11-02
TIANJIN UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention is characterized in that it solves the problem of identification of DNA enhancer element and its intensity prediction, and contains the following steps in turn:

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for identifying DNA enhancer element based on sequence frequency information
  • Method for identifying DNA enhancer element based on sequence frequency information
  • Method for identifying DNA enhancer element based on sequence frequency information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The present invention is described in detail below in conjunction with accompanying drawing

[0040] The enhancer involved in the present invention is a short DNA fragment, which can regulate the transcription level by recruiting transcription factors, forming a transcription complex and binding to the promoter site during the transcription process. By predicting the enhancers in the DNA sequence, it can help researchers in the biological field to find the reasons for the abnormal increase of transcription levels, and the enhancers of different strengths make it possible to program and regulate the transcription level. At present, the identification of enhancers mainly relies on biological experiments, but the experimental methods are often time-consuming and laborious; in contrast, it is easier and faster to use machine learning methods to predict enhancers.

[0041] The basic idea of ​​the invention is to extract the position-specific information of the enhancer seque...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for identifying a DNA enhancer based on sequence frequency information. The method is based on a double-layer DNA enhancer prediction model established by a support vector machine, and the prediction model is generated by the following steps: step (1): constructing a DNA enhancer sequence data set through chromatin database information of a cell line; step (2) processing the DNA enhancer sequence data set through a PSTNP algorithm to obtain DNA enhancer information of a trinucleotide sequence with position specificity; step (3), the trinucleotide sequence information of the DNA enhancer information being optimized through a Kullback-Leibler divergence algorithm; step (4) carrying out dimension reduction processing on the characteristic data of the trinucleotide sequence of the DNA enhancer information by adopting an LASSO algorithm. The prediction problem of the DNA enhancer and the strength of the DNA enhancer is solved, the extracted sequence frequency information is improved by adopting a feature optimization and feature screening method, and the prediction precision is obviously improved.

Description

technical field [0001] The invention belongs to the field of functional element prediction algorithms in bioinformatics, in particular to a method for identifying DNA enhancer elements based on sequence frequency information. Background technique [0002] Transcription is the first and crucial step in gene expression, which is controlled by regulatory elements such as promoters and enhancers. Among them, enhancer is a short sequence (50-1500bp) on DNA, which has the ability to recruit transcription factors and their complexes, thus increasing the possibility of certain gene transcription. By predicting the enhancers in the DNA sequence, researchers in the biological field can help find the reasons for the abnormal increase in transcription levels, and enhancers of different strengths make it possible to program and regulate transcription levels. Therefore, the predictive classification of enhancers has important practical significance. However, because enhancers act in cis...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G16B30/00G16B40/00G16B50/00G06K9/62
CPCG16B30/00G16B40/00G16B50/00G06F18/213G06F18/2411
Inventor 郭菲吕一诺何文颖唐继军曹晶
Owner TIANJIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products