Genome structure variation distribution detection method and detection device
A technology for distribution detection and structural variation, applied in genomics, sequence analysis, proteomics, etc., can solve problems such as limited size, data congestion, and inability to distinguish overlapping events in rainfall maps
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0023] Such as figure 1 As shown, in this embodiment, a detection method for the variation distribution of genome structure is proposed, and the method includes the following steps: S1, acquisition and filtering of genome sequencing data; S2, calculation of the distance between adjacent mutations; S3, utilization of analysis The Piecewise Constant Fitting (PCF) algorithm segmented the genome; S4, visualized the distribution of variation along the genome.
[0024] In some specific embodiments, S1, the specific steps of obtaining and filtering genome sequencing data are as follows: Two formats of cancer genome sequencing files (VCF and MAF) can be obtained, including variants '#CHROM', 'POS', The information of 'REF', 'ALT', 'FILTER'; filter the variation according to the column 'FILTER' in the file, and extract the variation data corresponding to the column 'FILTER' as "PASS".
[0025] In some specific embodiments, the specific steps in S2, calculating the distance between adj...
Embodiment 2
[0048] Such as figure 2 As shown, in this embodiment, a detection device for genome structure variation distribution based on high-throughput sequencing technology is proposed, which is characterized in that it has an input module, a calculation module, a genome segmentation module, and a visualization module;
[0049] A: Input module, which contains two file reading units including VCF (Variant Call Format) unit and MAF (Mutation Annotation Format) unit;
[0050] B: Calculation module, which sorts the mutations according to the genome coordinates and calculates the distance between adjacent mutations, and outputs new mutation coordinates;
[0051] C: Genome segmentation module, which uses the Piecewise Constant Fitting (PCF) algorithm to segment the genome, and outputs the position of the segmented segment and the number of variations contained therein;
[0052]D: Visualization module, which shows the distribution of variation along the genome.
[0053] In some specific em...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com