Genetic diagnosis using multiple sequence variant analysis

a gene and multiple sequence technology, applied in the field of nucleic acid-based genetic analysis, can solve the problems of complex structure, shaped, and not simple functions of ld, and achieve the effects of improving the accuracy of results and reducing the difficulty of diagnosis

Inactive Publication Date: 2009-04-23
METHEXIS GENOMICS
View PDF2 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0081]Also provided is an article comprising a machine-accessible medium having stored thereon instructions that, when executed by a machine, cause the machine to: obtain a sufficient number of cluster tag polymorphisms from a genomic region of interest for use in genotyping; assess the cluster tag polymorphisms to identify an association between a trait or phenotype and at least one cluster tag polymorphism, wherein identification of the association identifies the cluster tag polymorphism as a marker for the trait or phenotype. Such an article may further have instructions that, when executed by the machine, cause the machine to correlate a cluster tag polymorphism with a trait or phenotype selected from the group consisting of a genetic disorder, a predisposition to a genetic disorder, susceptibility to a disease, an agronomic or livestock performance trait, a product quality trait. In addition, the article may further have instructions that, when executed by the machine, cause the machine to identify the plurality of polymorphisms in the target nucleic acid sequences based on an assay selected from the group consisting of direct sequence analysis, differential nucleic acid analysis, sequence based genotyping, DNA chip analysis and polymerase chain reaction analysis.

Problems solved by technology

Unfortunately, LD is not a simple function of distance and the patterns of genetic polymorphisms, shaped by the various genomic processes and demographic events, appear complex.
Thus, an important analytical challenge is to identify the minimal set of SNPs with maximum total relevant information and to balance any reduction in the variation that is examined against the potential reduction in utility / efficiency of the genome-wide survey.
Any SNP selection algorithm that is ultimately used should also account for the cost and difficulty of designing an assay for a given SNP on a given platform—a particular SNP may be the most informative in a region but it may also be difficult to measure.
The determination of haplotypes from diploid unrelated individuals, heterozygous at multiple loci, is difficult.
Conventional genotyping techniques do not permit determination of the phase of several different markers.
These probabilistic methods all have limitations in accuracy (dependent on the number of SNPs being handled and the size of the population being examined) and scalability.
It should be noted, however, that for example the haplotype block concept remains to be validated, that not all regions of the human genome may fit the concept and / or that the concept may have limited value in other species.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Genetic diagnosis using multiple sequence variant analysis
  • Genetic diagnosis using multiple sequence variant analysis
  • Genetic diagnosis using multiple sequence variant analysis

Examples

Experimental program
Comparison scheme
Effect test

example 1

Intraspecies SPC Map of the Sh2 Locus of Maize

[0321]The present example provides proof of concept that the methods of the present invention can be used to generate an SPC map of a complete gene locus that has been sequenced in a number of individuals of a particular species. Many studies on the genetic diversity of specific genes have been conducted in a broad range of plant and animal species, and these sequences are publicly available from GenBank (http: / / www.ncbi.nlm.nih.gov). In most of these studies relatively short gene segments, less than 1000 bp, have been sequenced and only in a few studies have complete genes been sequenced. From the available complete or near complete gene sequences available in GenBank, the shrunken2 (sh2) locus from maize was chosen to exemplify the different aspects of the invention. The published shrunken2 locus sequences from 32 maize cultivars (Zea mays subsp. mays) comprise a region of 7050 bp containing the promoter and the coding region of the sh...

example 2

Intraspecies SPC Map of the sh1 Locus of Maize

[0329]The present example provides proof of concept that the methods of the present invention can be used to generate an SPC map of a complete gene in which extensive recombination has occurred. This example presents an analysis of the polymorphic sites in the shrunken1 (sh1) locus from maize to exemplify further aspects of the invention. The published shrunken1 locus sequences from 32 maize cultivars (Zea mays subsp. mays) comprise a region of 6590 bp containing the promoter and the coding region of the sh2 gene [Whitt et al., Proc. Natl. Acad. Sci. USA 99: 12959-12962, 2002].

[0330]The sequences for this analysis were retrieved from GenBank (http: / / www.ncbi.nlm.nih.gov) accession numbers AF544100-AF544131. The sequences were aligned to generate a genetic variation table as described in detail in Example 1. The genetic variation table of the sh1 gene comprises 418 polymorphic sites. Because of this very large number of polymorphic sites,...

example 3

Intraspecies SPC Map of the Y1 Locus of Maize

[0333]The present example provides proof of concept that the method of the present invention can be used to generate an SPC map of a locus in which several historical recombination events have occurred. This example presents an analysis of the polymorphisms in the Y1 phytoene synthase locus of maize to exemplify further aspects of the invention. The Y1 phytoene synthase gene, which is involved in endosperm color, was sequenced in 75 maize inbred lines [Palaisa et al., The Plant cell 15: 1795-1806, 2003], comprising 41 orange / yellow endosperm lines and 32 white endosperm lines.

[0334]The sequences for this analysis were retrieved from GenBank (http: / / www.ncbi.nlm.nih.gov) accession numbers AY296260-AY296483 and AY300233-AY300529. The sequences comprise 7 different segments from a region of 6000 bp containing the promoter and the coding region of the Y1 phytoene synthase gene. The individual sequences were aligned to generate 7 genetic varia...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

PropertyMeasurementUnit
nucleic acid sequenceaaaaaaaaaa
nucleic acidaaaaaaaaaa
differential nucleic acid analysisaaaaaaaaaa
Login to view more

Abstract

The present invention is in the field of nucleic acid-based genetic analysis. More particularly, it discloses novel insights into the overall structure of genetic variation in all living species. The structure can be revealed with the use of any data set of genetic variants from a particular locus. The invention is useful to define the subset of variations that are most suited as genetic markers to search for correlations with certain phenotypic traits. Additionally, the insights are useful for the development of algorithms and computer programs that convert genotype data into the constituent haplotypes that are laborious and costly to derive in an experimental way. The invention is useful in areas such as (i) genome-wide association studies, (ii) clinical in vitro diagnosis, (iii) plant and animal breeding, (iv) the identification of micro-organisms.

Description

[0001]The present application claims the benefit of priority of, and is a continuation of U.S. Application No. U.S. application Ser. No. 11 / 312,088 which was filed on Dec. 19, 2005 as a continuation-in-part application of U.S. application Ser. No. 11 / 077,564, which was filed on Mar. 9, 2005. The benefit of these two priority applications is claimed and each of those two applications is incorporated herein by reference in its entirety.FIELD OF INVENTION[0002]The present invention is in the field of nucleic acid-based genetic analysis. More particularly, it discloses novel insights into the overall structure of genetic variation in all living species.BACKGROUND OF THE INVENTION[0003]Variation in the human genome sequence is an important determinative factor in the etiology of many common medical conditions. Heterozygosity in the human population is attributable to common variants of a given genetic sequence, and those skilled in the art have sought to comprehensively identify common g...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(United States)
IPC IPC(8): C12Q1/68G01N33/48G16B30/00G16B20/00G16B20/20G16B30/10G16B40/00
CPCG06F19/18G06F19/24G06F19/22G16B20/00G16B30/00G16B40/00Y02A90/10G16B30/10G16B20/20
Inventor ZABEAU, MARCSTANSSENS, PATRICKGANSEMANS, YANNICK
Owner METHEXIS GENOMICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products