Generic genome construction method and corresponding structural variation mining method thereof
A technology for structural variation and construction methods, applied in the fields of genomics, proteomics, instruments, etc., can solve the problems such as the difficulty of widespread application of pan-genome, the huge requirement of computing resources, and the difficulty of direct processing and analysis, and achieve large-scale accurate structural variation. Effectiveness of analysis, reduction of computational resource requirements, accurate structural variant analysis and identification
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0034] Construction of rice pan-genome:
[0035] Taking the rice Nipponbare genome (IRGSP1.0, downloaded from https: / / rapdb.dna.affrc.go.jp / website, the genome sequence file is Nipponbare.fasta), L32 and P106 are complete assemblies of two rice varieties The genome and genome sequence files are L32.fasta and P106.fasta, respectively.
[0036] The pan-genome is constructed as follows:
[0037] 1) Generate a file named location.lg, the file information is as follows:
[0038] Mummer= / home / lfp / soft / mummer-4.0.0beta2 /
[0039] Lastz= / home / lfp / soft / lastz / src /
[0040] svmu= / home / lfp / soft / svmu /
[0041] bowtie2= / home / lfp / miniconda3 / bin /
[0042] Samtools= / home / lfp / miniconda3 / bin /
[0043] ref=Nipponbare.fasta
[0044] query=L32.fasta, P106.fasta
[0045]This file is used to set the location of executable files of Mummer, Lastz, svmu, bowtie2 and Samtools software, and is used for calling during operation. Set ref (reference genome) to Nipponbare.fasta, query (comparison ge...
Embodiment 2
[0064] A mining method for genome structure variation based on Illumina next-generation sequencing data and pan-genome:
[0065] a) Taking the pan-genome sequence constructed in Example 1 as the reference genome, and using the Illumina sequencing data of rice material R91, the genome structure variation analysis and identification of R91 was carried out. Use the comparison software Bowtie2 to compare the R91 sequencing data to the pan-genome and generate a comparison file; it was found that the data ratio of the original reference genome (Nipponbare) on R91 was only 82.52%, while the data ratio on the pan-genome was compared Reached 93.25%. It proves that the pan-genome constructed in Example 1 is more complete and representative than the original reference genome (Nipponbare), can significantly improve the efficiency of sequencing data comparison, and provide an important data basis for capturing more structural variations;
[0066] b) According to the variation information ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com