Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for haploid typing and variation detection of diploid genome sequencing fragments

A genome sequencing and diploid technology, applied in genomics, sequence analysis, proteomics, etc., can solve the problems of low sensitivity, short typing region length, poor accuracy, etc., and achieve high accuracy and high sensitivity typing The effect of improving the analysis and mutation detection effect

Active Publication Date: 2021-05-04
SHENZHEN HUADA GENE INST
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the problems of poor accuracy, low sensitivity, and short typing region length of the existing diploid genome haploid typing technology, the present invention provides a method and device for haploid typing of diploid genome sequencing fragments , to cluster the sequenced fragments in any region of the diploid genome to distinguish the sequenced fragments corresponding to the two haplotypes to achieve the purpose of haplotype typing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for haploid typing and variation detection of diploid genome sequencing fragments
  • Method and device for haploid typing and variation detection of diploid genome sequencing fragments
  • Method and device for haploid typing and variation detection of diploid genome sequencing fragments

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0089] Capture and sequence the HLA target region on human chromosome 6, and analyze the information of the full-length region of the HLA-DRB1 gene.

[0090] 1) The BGI-YH cell line sample is captured by the HLA chip using the mature and public experimental technology, that is, a DNA library with a length of 10k is constructed, and then sequenced using the PacBio RSII sequencer. And for the same BGI-YH sample, 5 parallel and independent capture, library construction, and sequencing operations were performed. The length distribution of sequenced fragments is as follows figure 2 As shown, the curve is smooth and contains the main peak of the sequenced fragment (subreads) with a length of 2.5k and the more obvious tailing around the length of 5k.

[0091] 2) Use the RS_ReadsOfInsert.xml protocol in SMRT Analysis2.3 to perform CCS correction (circular consensus sequencing) on ​​the bax.h5 file (including polymerase read) of the PacBio offline data to obtain the fastq file. The ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method and device for haploid typing and variation detection of diploid genome sequencing fragments. The typing method includes: aligning the CCS sequence of the third-generation sequencing fragments to a reference genome to obtain the optimally aligned sequencing fragments, and storing them in a linked list object In the structure; filter each SNP site to obtain the site corresponding to the candidate heterozygous single nucleotide polymorphism, and store it in a dictionary type variable; build a binary tree structure, and each layer of nodes corresponds to the candidate heterozygous single nucleotide polymorphism Assign values ​​to these nodes at the corresponding sites; repeatedly cut out the two sub-nodes connected to the two nodes of each layer until all the SNP information is stored in the binary tree structure, and the hybrid single nucleotide The haplotype information of the acid polymorphism is transferred to the dictionary type variable; look up the SNP information of all objects in the linked list object structure, and distinguish and record the CCS sequences corresponding to the two haplotypes according to the haplotype information. The invention greatly improves the effects of haploid typing and variation detection.

Description

technical field [0001] The invention relates to the technical field of genotyping, in particular to a method and device for haploid typing and variation detection of diploid genome sequencing fragments. Background technique [0002] After Pacific Biosciences released the first commercial third-generation sequencing platform (PacBio RS) in 2011, it successively released PacBio RSⅡ and so on. Third-generation sequencing can generate sequencing fragments (reads) up to 60kb in length. Technical details can be found in the article (Eid, John, et al. Real-time DNAsequencing from single polymerase molecules. Science 323.5910(2009):133-138.). [0003] Taking the human genome as an example, the distance between adjacent heterozygous SNPs is about 1kb-1.5kb. The short sequences obtained by using the second-generation sequencing platform are mostly 100bp, which cannot span most heterozygous single nucleotide polymorphism sites. Statistical analysis and population analysis are require...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G16B30/00G16B20/10
CPCG16B25/00G16B20/00
Inventor 周泽张涛
Owner SHENZHEN HUADA GENE INST
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products