Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for Determining Sequencing Digestion Combination in Sequencing Genotyping Technology

A genotyping and genome technology, applied in the direction of biochemical equipment and methods, microbial measurement/inspection, etc., can solve the problems of affecting the efficiency of enzyme digestion, reducing the density of SNP mining, increasing the cost of experiments, etc.

Active Publication Date: 2018-11-20
CHINA AGRI UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, different simplified genome sequencing methods are quite different in terms of library construction strategy, single-enzyme digestion / double-enzyme digestion combination selection, and sequencing platform selection, which will significantly affect the efficiency and cost of subsequent typing
For example, in the RAD sequencing method, the library construction strategy is complicated, and too many steps will interfere with the results of subsequent experiments; the frequency and distribution of different restriction endonucleases on the genomes of different species are quite different. Species, which enzyme to use for experiments becomes the decisive factor in determining the number and cost of SNPs obtained in experiments; 2b-RAD technology uses type IIB restriction endonucleases, although 2b-RAD technology can obtain enzyme-digested fragments at the whole genome level, but The size of this enzyme-digested fragment is only 25-35bp. According to the average frequency of genome-wide variation, too short enzyme-digested fragments are difficult to enrich SNP sites, which will cause a large loss of sequencing data; When comparing the repetitive regions of the genome, a large number of alignment errors will be brought about, and it will also strongly interfere with the typing accuracy of SNPs, thereby seriously interfering with downstream applications
When analyzing SNP marker sites, single enzyme digestion is not conducive to the screening of enzyme fragments in subsequent experiments, while some double enzyme digestion combinations have the disadvantage of too many or too few enzyme fragments, and too many enzyme fragments will increase the efficiency of the experiment. Cost, too few enzyme fragments will reduce the density of SNP mining, which will affect the subsequent biological analysis, and some enzyme digestion combinations will affect the efficiency of enzyme digestion due to genome methylation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for Determining Sequencing Digestion Combination in Sequencing Genotyping Technology
  • Method for Determining Sequencing Digestion Combination in Sequencing Genotyping Technology
  • Method for Determining Sequencing Digestion Combination in Sequencing Genotyping Technology

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0099] Example 1 uses the bovine genome to illustrate the analysis method provided by the present invention.

[0100] 1. Prediction of the number of digested fragments

[0101]Write the perl script Site_predict.pl as follows, and the input files are the chromosome name and sequence of the bovine genome and the cleavage site sequence of the predicted enzyme. The running command is: perl Site_predict.pl. The genome sequence of cattle is downloaded from Ensembl, the version number is:

[0102]

[0103]

[0104] After obtaining the result of single restriction enzyme digestion, it is necessary to process the simulation results of restriction enzymes combined in pairs to obtain the simulation results of the simultaneous action of two enzymes. Taking EcoR I and Msp I as examples, the commands are as follows:

[0105] less-S ecor1.pos|awk'OFS="\t"{print$1,$2,"1"}' less-S>ecor

[0106] less-S msp1.pos|awk'OFS="\t"{print$1,$2,"2"}'|less-S>msp

[0107] cat ecor msp|sort -k1,1...

Embodiment 2

[0186] The inventor has carried out experiments on sheep, an important agricultural economic animal. The blood samples of three Dorper sheep were collected, and the experimental procedure was the same as in Example 1, and the enzyme digestion and SNP of the sheep genome were analyzed. The actual digestion results of each enzyme digestion combination are shown in Table 6.

[0187] Table 6

[0188] Sheep Digestion Combination

[0189] From the analysis in the above table, it can be seen that the number of SNPs obtained by the enzyme digestion combination of EcoR I-Msp I is relatively moderate (this experiment is 3 individuals, and the number of SNPs for large-scale group experiments will increase), and the enzyme digestion will not be affected by formazan. Due to the influence of modification such as kylation, the restriction fragments were evenly distributed, so the combination of EcoR I-Msp I was selected as the restriction combination used in the analysis of sheep ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a determining method for sequencing enzyme digestion combination in a sequencing genotyping technology. The determining method comprises the following steps that 1, restriction enzyme digestion site predicting is performed on a target genome, and the number of enzyme digestion segments obtained through different enzyme digestion modes is counted; 2, a joint sequence and a PCR amplification primer sequence at the two ends of each enzyme digestion segment are designed according to the predicted enzyme digestion segments in the various enzyme digestion modes in the step 1; 3, sequencing libraries are constructed through a GBS technology for the different enzyme digestion modes; 4, sequencing is performed through the sequencing libraries constructed in the step 3; 5, SNP marker sites are obtained according to sequencing results; 6, the specific enzyme digestion combination for the target genome is determined according to the number of the SNP marker sites and the enzyme digestion segment sizes which are obtained through different enzyme digestion combinations.

Description

technical field [0001] The invention relates to the field of biotechnology, in particular to a method for determining a combination of sequencing restriction enzymes in sequencing genotyping technology. Background technique [0002] Genetic molecular markers (measurable and inheritable polymorphisms among different individuals within one or more populations) firmly occupy the core position of modern genetics and are also important for disciplines such as population genetics, ecology and developmental biology. research direction. The current mainstream genetic markers have been developed to the third generation, namely single nucleotide polymorphisms (Single Nucleotide Polymorphisms, SNP) molecular markers. This kind of genetic marker is characterized by the substitution of a single base, and generally consists of only two bases. It is a dimorphic marker, and the difference in length between the first-generation RFLP and the second-generation STR is used as a genetic marker....

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): C12Q1/6858
CPCC12Q1/6869C12Q2525/191C12Q2563/143C12Q2521/301
Inventor 胡晓湘王宇哲曹学敏李宁
Owner CHINA AGRI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products