Method for evaluating HRD score based on low-depth WGS
A technology of depth and calculation method, applied in the fields of genomics, instrumentation, sequence analysis, etc., can solve the problems of high accuracy and high cost
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0088] Example 1 Preprocessing of low-depth WGS data processing
[0089] 1. Use the fastp software to remove the joints on the reads from the low-depth WGS off-machine data;
[0090] 2. Compare the processed off-machine data with the reference genome of the whole human genome through bwa software, and obtain the first comparison file in bam format;
[0091] 3. Correct the base quality value of the first comparison file through GATK software;
[0092] 4. Use picard software to remove duplicate reads from the corrected first comparison file, and obtain a second comparison file that does not contain duplicate reads. The format of the file is bam format.
[0093] 5. Divide the whole human genome into windows of 100Kbp size according to the order of arrangement.
Embodiment 2
[0094] Example 2 Construction of DR fragments
[0095] 1) Taking the reads in the second comparison file in Example 1 as the basic unit, count the number of reads falling in each window in Example 1, as the reads count of the window, and record it as RC i , i is the order of the windows divided in the whole genome according to the order of arrangement, and i is 1, 2, 3....
[0096] 2) Count the GC base content of each window, merge adjacent windows with the same GC content into one group, and record the jth group as W j , the number of windows contained in group j is denoted as M j, the kth window contained in the jth group is recorded as W kj , j, k are 1,2,3....;
[0097] 3) Calculate W j The median value RC of , denoted as RC j , and the average RC of the sample to be tested as a whole, denoted as RC p , by the following formula for RC i Make corrections:
[0098]
[0099] i=M 1 +M 2 +M 3 ...+M (j-1) +k;
[0100] 4) Process the low-depth WGS data of 30 healt...
Embodiment 3
[0103] Example 3 Calculation of copy number
[0104] Count the median value of DR in each DR segment, as the DR value of the DR segment, recorded as DR q , calculate the copy number of the DR segment, denoted as C q , the calculation formula is:
[0105] .
[0106] In this embodiment, the internal cause of the patient's cancer can be preliminarily understood through the calculation of Cq (copy number).
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com