Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Animal mitochondrial genome assembling method based on second-generation data

A technology for genome assembly and animal mitochondria, applied in the fields of bioinformatics, informatics, instruments, etc., can solve the problems of incomplete assembly results, time-consuming secondary assembly, limited application, etc., to ensure universality and versatility, Improve work efficiency and strong compatibility

Pending Publication Date: 2021-05-14
CHENGDU INST OF BIOLOGY CHINESE ACAD OF S
View PDF11 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The artificially provided seed sequences need to be retrieved from the database (for example, NCBI) for specific species, but since the data in the database are uploaded by researchers themselves, the number of species contained in it is limited, so for many species, in Results may not necessarily be retrieved from the database
Therefore, the requirement to manually provide the seed sequence limits the application of this type of algorithm
[0006] Algorithms (MitoZ and Norgal) that do not need to provide seed sequences take advantage of the higher sequencing depth of mitochondrial data in the second-generation data, and filter the low-depth data for assembly, but this will lead to incomplete assembly results, and Very time consuming due to possible secondary assembly
[0007] In addition to the above problems, the aforementioned algorithm also has the problem of data compatibility
In the previous work of the inventor, he tried to use the software using the above algorithm to process the second-generation data of animal mitochondrial genes, but due to incompatibility of the data, it was always impossible to completely extract the whole mitochondrial genome
In addition, it is difficult to satisfy the assembly speed and integrity of the aforementioned algorithms at the same time.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Animal mitochondrial genome assembling method based on second-generation data
  • Animal mitochondrial genome assembling method based on second-generation data
  • Animal mitochondrial genome assembling method based on second-generation data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0032] This embodiment includes a software for extracting animal mitochondrial genomes using second-generation data without providing seed sequences, including the following modules:

[0033] 1. The mitochondrial DNA conserved domain database (profile HMM) constructed by hidden Markov chain prediction, which was developed by MitoZ et al. (A toolkit for animal mitochondrial genome assembly, annotation and visualization. Nucleic acids research, 2019, 47(11), e63.) build;

[0034] 2. The SSAKE linear assembly module is an assembly code written in C++ language based on a linear algorithm, which is used to assemble mitochondrial seed sequences and mitochondrial genomes. The linear assembly technology itself belongs to the existing technology, and the module code can be implemented by those skilled in the art based on the existing technology .

[0035] The algorithm flow of the animal mitochondrial genome assembly method based on the second generation data in this embodiment is as ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of genome sequencing, and particularly relates to an animal mitochondrial genome assembling method based on second-generation data. The assembling method comprises the following steps: firstly, carrying out rapid screening on huge data through a profile HMM to obtain potential mitochondrial coding DNA fragments, then, carrying out assembling acquisition on the fragments by utilizing a linear iterative algorithm, and carrying out scoring by utilizing the profile HMM to obtain a seed sequence; and finally, quickly assembling the mitochondrial DNA reads on the seed sequence from the sequencing data by using a linear iteration algorithm until the mitochondrial whole genome is assembled. According to the method, the seed sequence does not need to be manually delivered, the mitochondrial genome can be rapidly, accurately and completely assembled, and the method has good compatibility for second-generation data of different species, data sizes and DNA reads lengths.

Description

technical field [0001] The invention belongs to the technical field of genome sequencing, and in particular relates to an animal mitochondrial genome assembly method based on second-generation data. Background technique [0002] For animals, mitochondrial DNA is an important marker in biological research because of its maternal inheritance, fast evolution rate, and lack of recombination. It is widely used in phylogeny, evolutionary biology, biodiversity detection, and population genetics. Research areas. Compared with mitochondrial fragments, the whole mitochondrial genome is a more objective and reliable marker than mitochondrial fragments because it is complete and less susceptible to convergence. [0003] With the continuous updating of next-generation sequencing technology, the field of biological research has entered an era of data explosion, and a considerable amount of second-generation data can be obtained for both model animals and non-model animals. Mitochondrial...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G16B30/20G16B50/30
CPCG16B30/20G16B50/30
Inventor 李家堂宋梦洹严超超
Owner CHENGDU INST OF BIOLOGY CHINESE ACAD OF S
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products