Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Double-ended library tag composition and application thereof in MGI sequencing platform

A technology of composition and tag group, applied in the field of plasma DNA library construction, which can solve problems such as sample crosstalk

Active Publication Date: 2020-11-10
NANODIGMBIO (NANJING) BIOTECHNOLOGY CO LTD
View PDF5 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The main purpose of the present invention is to provide a paired-end tag amplification primer composition and its application in the MGI sequencing platform to solve the problem that the existing MGI sequencing platform uses single-end tagging library prone to sample crosstalk

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Double-ended library tag composition and application thereof in MGI sequencing platform
  • Double-ended library tag composition and application thereof in MGI sequencing platform
  • Double-ended library tag composition and application thereof in MGI sequencing platform

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0081] Example 1 Database construction scheme one and scheme two

[0082] Specific steps: refer to NadPrep TM The only difference in the manual of the DNA library construction kit (for MGI) (201909Version2.0) is the difference in the sequence of the bubble adapter and the sequence of the amplification primer

[0083] (1) Option 1:

[0084] Bubble linker sequence:

[0085] Linker sequence 1 shown in SEQ ID NO:769 and linker sequence 2 shown in SEQ ID NO:770:

[0086] SEQ ID NO: 769: (31bp) / phos / agtcggaggccaagcggtcttaggaagacaa;

[0087] SEQ ID NO: 770 (40bp): ttgtcttcctaacaggaacgacatggctacgatccgact*t.

[0088] Amplification primer 1 shown in SEQ ID NO:771 and amplification primer 2 shown in SEQ ID NO:772:

[0089] SEQ ID NO:771: (64bp)

[0090] / phos / ctctcagtacgtcagcagttnnnnnnnnnncaactccttggctcacagaacgacatggctacga; Wherein, the sequence ( / phos / ctctcagtacgtcagcagtt) before nnnnnnnnnn is recorded as SEQ ID NO:793, and the sequence after nnnnnnnnnn (caactccttggctcacagaac gac...

Embodiment 2

[0120] Example 2 12 sample mixed data split comparisons with 4 balance and 8 balance

[0121] The double-ended labeling scheme can effectively remove the crosstalk between samples (also known as label skipping), but since splitting the data requires correct labels at both ends, the valid sequencing data can be split, so the label balance requirements when using the machine Stricter than single-ended labeling requirements. This application optimizes two sets of schemes of 4 balance and 8 balance. In this embodiment, 4 balance and 8 balance are used respectively, and 12 library mixed samples are tested on the computer to detect the effective resolution rate of each sample by the two sets of schemes. The specific experimental steps and information are as follows:

[0122] Specific steps: refer to NadPrep for the steps of building a library TM Instructions for DNA Library Construction Kit (for MGI) (201909Version2.0), the only difference is that the single-end index adapter is c...

Embodiment 3

[0133] In order to ensure the performance difference between the 8-balanced 48-group tag sequence of this application and the 8-balanced 12-group tag sequence provided by Huada Manufacturing, the 8-balanced 48-group tag sequence of this application was designed in consideration of the 8-balanced 48-group tag sequence provided by Huada Manufacturing. The compatibility of the balanced 12 sets of tag sequences when used on the machine, therefore, there are 3 bases in any two sequences between the 8 balanced 48 sets of tag sequences of this application and the 8 balanced 12 sets of tag sequences provided by Huada Manufacturing base difference.

[0134] Additionally, other major points of difference are:

[0135] 1. The base composition of the tag sequence of the present invention is more balanced, with a GC% content of 40%-60%; while MGI’s GC% content is 20%-80%;

[0136] 2. The tag sequences of the present invention have been calculated for matching with the linker sequences of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a double-ended tag amplification primer composition and application thereof in an MGI sequencing platform. The double-ended library label composition comprises a plurality of library labels at a 5' end and a plurality of library labels at a 3' end, the lengths of the library labels at the 5' end are the same, the lengths of the library labels at the 3' end are the same, andin the double-ended library label composition, the occurrence frequency of each base at the same position is the same. When optimized double-ended library labels are used for data splitting, the problem of crosstalk occurred in synthesis, experimental links and computer sequencing processes can be solved. The lengths of the library tags at the 5' end are controlled to be the same, the lengths of the library tags at the 3' end are also the same, and the occurrence frequency of each base at the same position are the same, so a plurality of libraries good in base balance of the double-end librarytags can be obtained; and when the plurality of libraries are mixed and sequenced on a computer, the double-ended tag reading accuracy of each library is high, so the effective resolution rate of thelibraries is further improved.

Description

technical field [0001] The invention relates to the field of plasma DNA library construction, in particular to a double-end tag amplification primer composition and its application in an MGI sequencing platform. Background technique [0002] In the sequencing process of the MGI high-throughput sequencer, in order to achieve more sample sequencing, each sample needs to be tagged and sequenced with a different index sequence (Index) and then split. However, the current MGI sequencing platform basically uses libraries with single-end tags. Due to the natural defects of the single-end label (Index), it is easy to cause crosstalk between samples. Due to the contamination of index adapters or primers in the synthesis, experimental operation and sequencing, mutual crosstalk is inevitable, so it is necessary to solve the low-frequency mutual crosstalk between samples. At present, the best way is to use double-ended tags To solve the problem, the double-ended labeling method can ef...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): C40B70/00C12Q1/6806C12N15/10C12N15/11C40B50/06
CPCC40B70/00C12Q1/6806C12N15/1093C40B50/06C12Q2525/191C12Q2535/122C12Q1/6869C12Q2525/161C12Q2531/113C12Q2537/143
Inventor 汪彪胡玉刚郑文莉吴强
Owner NANODIGMBIO (NANJING) BIOTECHNOLOGY CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products