Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and system for quickly comparing whole genome annotation intervals

A whole-genome, annotation technology, applied in bioinformatics, instruments, etc., can solve the problems of processing speed limit of reading and writing, complicated operation, long time-consuming, etc., and achieve the effect of small calculation, simple logic and accurate judgment

Active Publication Date: 2021-09-07
HAINAN UNIVERSITY
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] At present, in the field of pests and diseases or rapid detection, it is often necessary to calculate the relationship between a large number of different annotation fragments in the whole genome. Since the number of genomes of organisms is usually very large, the number of annotation fragments involved is huge. Traditional calculation methods mainly include two , one is to establish a database index and compare it through a specific data structure. For example, in the paper "Building a Populus trichocarpa Gene Database and Realization of a Visualization Platform", it is disclosed that an index is established in a database visualization platform to search for genome fragments. The operation of the first method is relatively complicated, and for very large-scale data, its processing speed will also be limited by the read and write speed of the hard disk; the other method is to search by comparing the intervals of genome annotations. For example, in the paper "Massive Genome Annotation Parallel Optimization and Implementation of Algorithms" discloses that most of the current gene function annotation software is written in scripting language, which analyzes the input sequence one by one, which is inefficient and takes a long time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for quickly comparing whole genome annotation intervals
  • Method and system for quickly comparing whole genome annotation intervals
  • Method and system for quickly comparing whole genome annotation intervals

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] The principles and features of the present invention will be described below in conjunction with the accompanying drawings, and the enumerated embodiments are only used to explain the present invention, and are not intended to limit the scope of the present invention.

[0065] refer to figure 1 , this embodiment provides a method for quickly performing genome-wide annotation interval comparison, and the method includes the following steps:

[0066] S1. Obtain the coordinate set of the segment A of the gene annotation interval and the coordinate set of the segment B of the gene annotation interval respectively.

[0067] Exemplarily, the gene annotation interval A fragment coordinate set and the gene annotation interval B fragment coordinate set are respectively used to store different types of genome annotation interval fragments, each set stores multiple genome annotation interval fragments, and each annotation interval fragment Both include start and end coordinates. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method and a system for quickly comparing whole genome annotation intervals. The method comprises the following steps: S1, respectively obtaining a gene annotation interval A fragment coordinate set and a gene annotation interval B fragment coordinate set; s2, combining the coordinates of the fragment set A and the coordinates of the fragment set B to obtain a combined fragment set; s3, performing redundancy elimination operation and sorting operation on the combined fragment set; s4, calling a preset interval intersection function to calculate the intersection of all the annotation intervals in the merged fragment set in a single cycle; and S5, outputting a calculation result of the preset interval intersection function. According to the method, the intersection of different types of gene annotation intervals can be rapidly calculated, and compared with a traditional comparison method, the method is simple in implementation logic, small in calculation amount, accurate in judgment and beneficial to improvement of comparison efficiency.

Description

technical field [0001] The invention relates to the technical field of genome annotation, in particular to a method for rapidly comparing annotation intervals of the whole genome. Background technique [0002] At present, in the field of pests and diseases or rapid detection, it is often necessary to calculate the relationship between a large number of different annotation fragments in the whole genome. Since the number of genomes of organisms is usually very large, the number of annotation fragments involved is huge. Traditional calculation methods mainly include two , one is to establish a database index and compare it through a specific data structure. For example, in the paper "Building a Populus trichocarpa Gene Database and Realization of a Visualization Platform", it is disclosed that an index is established in a database visualization platform to search for genome fragments. The operation of the first method is relatively complicated, and for very large-scale data, i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G16B50/10G16B50/30
CPCG16B50/10G16B50/30Y02A90/10
Inventor 李志刚缪卫国纪晓贝
Owner HAINAN UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products