Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Re-sequencing pathogen microarray

a pathogen and microarray technology, applied in the field of resequencing pathogen microarrays, can solve the problems of pathogen detection, introduction of a new set of problems, and conventional methods for amplification that do not scale well

Inactive Publication Date: 2006-09-21
THE UNITED STATES OF AMERICA AS REPRESENTED BY THE SECRETARY OF THE NAVY +1
View PDF1 Cites 51 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0349] A very important bioinformatics aspect of the disclosed invention involves the assembly, annotation and selection of pathogen diagnostic targets into database(s) for incorporation into microarray design, as well as the concomitant task of relating detection events on the microarray to such database(s). An advantage of the present invention is that the information contained in the publicly available databases is ever increasing, thus further adding to the robust nature of the present invention. The present invention describes a process of manually selecting pathogen target sequences from the published literature (e.g. GenBank) and / or ascertaining an empirically determined diagnostic target sequence from published literature. The described approach has the advantage that a consortium of scientists, each possessing sufficient “domain expertise” for each of a large number of disparate pathogen species, can provide relevant, pathogen diagnostic sequence information that can be incorporated into an automated array design process without specific regard to specific probe, reagent, amplification, and sample preparation methods.
[0350] In one very preferred embodiment, the requisite domain expertise for each of a large number of unrelated pathogens will be maintained in an up-to-date fashion through a web-portal enabled database. Thus, an extended consortium, comprised of individual researchers of specific pathogens, would be able to provide the latest annotated target sequence information via a “pathogen page” formatted web portal, analogous to the “molecular page” model adopted by the Alliance for Cellular Signaling (AfCS). The AfCS database then maintains an otherwise incomprehensible amount of specific information on thousands of molecules involved in intracellular signaling cascades. In this format, individual researchers without specific knowledge about individual signaling molecules can access detailed parameters that can be used in numerical simulations of signaling events. Thus, in another very preferable embodiment, the annotated target sequence data for individual pathogens is organized into an automated data pipeline in which will impose user-defined design constraints (e.g. number of probe features, number of pathogen targets, the levels of sensitivity and specificity required for array performance, etc.) upon the total information content of a pathogen database, allowing automated, optimal target selection and submission of those targets to a vendor in a format necessary for microarray fabrication.
[0351] In yet another very preferred embodiment, the selected target sequences determined by the previous process will be correlated with the data that is collected in actual use of the microarray, such that metrics for probability and quality can readily used for decision-making. Two preferable approaches for performing such automated pipelining of data and algorithms are VIBE (Visual Integrated Bioinformatics Environment) software (Incogen, Inc., Williamsburg, Va.) and iNquiry (BioTeam, Boston, Mass.) which are representative of a class of integrated bioinformatics environments that could be used to equal effect for the intended purpose. Data Acquisition
[0352] Raw sequence data from the resequencing microarray chips is provided by the Genetic Data Analysis Software version 2.0 (GDAS) packaged with the microarray reader from Affymetrix.
[0353] The Affymetrix resequencing array contains a defined number of probe cells or features. During scanning, the software divides each feature into subunit squares or pixels (3×3 μm). Each feature contains many copies of a unique 25-base oligonucleotide probe of defined sequence, while a series of eight features query a specific site in a known reference sequence. Four features interrogate the sense strand and contain probes that are identical except for the central base which is A, C, G, or T and four features interrogate the anti-sense strand and contain probes that are identical except for the central base which is A, C, G, or T.
[0354] GDAS uses the cell intensity data to make base calls for every base position represented on the resequencing array. Under the manufacturer setting for GDAS, the algorithm uses the intensity data from multiple samples to improve its calling accuracy and assigns a quality score for each call.

Problems solved by technology

One technical challenge for pathogen detection with microarrays arises due to the difficulty in obtaining samples with a sufficient quantity of pathogen nucleic acid.
Unfortunately, conventional methods for this amplification do not scale well in comparison to the number of probes that can be placed on a microarray chip.
Although PCR-based assays are sensitive, accurate, and rapid, these methods also introduce a new set of problems.
Hybridizations patterns allow the detection of single point mutations or substitution / deletion events to a resolution of half probe lengths (e.g. 7-10 bp) but does not allow for exact determination of position(s) or the nature of the mutation.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Re-sequencing pathogen microarray
  • Re-sequencing pathogen microarray
  • Re-sequencing pathogen microarray

Examples

Experimental program
Comparison scheme
Effect test

preparation example 1

RPM Version 1 Chip Design

[0433] DNA sequences were provided to Affymetrix for creation of the resequencing microarray chip (RPM Version 1 chip) utilized in the following examples. Submission of the DNA sequence and instruction files to Affymetrix were in accordance with the manufacturer instructions CustomSeq™ Array Protocol and product literature. Probe lengths were nominally 25-nucleotides long and contained a variable (interrogation point) central nucleotide for each of four possible variants (A, C, T or G) in both the sense and antisense directions.

[0434] The target genes selected for the RPMV1 pathogens listed above are described in the version 1 layout shown in Table 8 and the Sequence Listing along with the respective PCR primers used for amplification of the same. The sequences submitted for tiling and chip fabrication were based on the Affymetrix instruction file summarized in Table 7, which corresponds to the sequences appearing as SEQ ID NOs: 1-58. The corresponding “in...

preparation example 2

PCR Primer Design and Amplification Protocols

Degenerate PCR Primers Design

[0442] The objective of primer selection to support conserved (degenerate) multiplex PCR is to design primers that target the conserved regions flanking species-specific variable regions of E1A, fiber, and hexon genes. In general, this method may be applied to any organism, as conserved sequences within a species are a ubiquitous in nature. These target genes were selected based on their function and location within the linear adenoviral genome. E1A is located at the 5′ end of the adenoviruses genome and encodes a trans-acting transcriptional regulatory factor that is necessary for transcriptional activation of early genes. The hexon and fiber genes, which are located in the middle and 3′ end of the adenovirus genome, encode antigenic determinants ε and γ respectively, which determine the viral serotype. Thus, detection and serotyping of ARD-causing adenoviruses can be effectuated by targeting the nucleic a...

preparation example 3

REPI Software

[0447] Raw sequence data from the resequencing microarray chips is provided by the Genetic Data Analysis Software version 2.0 (GDAS) packaged with the microarray reader from Affymetrix. GDAS base calling is based on a previously described base-calling algorithm (Cutler et al., 2001). Each of the FASTA output files containing the base calls obtained from the GDAS software was analyzed using specialized software (REPI) that the present inventors developed.

[0448] In the case of the present invention, the sequence output of GDAS is most often a scattered mixture of contiguous sequence calls (A, T, C or G) that are interspersed with varying amounts of no-calls (n's) where the GDAS software does not make a base call due to lack of amplification, weak hybridization signal on the chip and / or high background hybridization caused by non-specific binding (Cutler et al., 2001). An example output of the GDAS output for the Adenovirus 4 prototype sample for the Ad4FIBER tile region...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

PropertyMeasurementUnit
Lengthaaaaaaaaaa
Timeaaaaaaaaaa
Lengthaaaaaaaaaa
Login to View More

Abstract

The present invention relates to pathogen detection and identification by use of DNA resequencing microarrays. The present invention also provides resequencing microarray chips for differential diagnosis and serotyping of pathogens present in a biological sample. The present invention further provides methods of detecting the presence and identity of pathogens present in a biological sample.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS [0001] The present application claims priority to U.S. provisional Application Ser. No. 60 / 590,931, filed on Jul. 2, 2004, U.S. provisional Application Ser. No. 60 / 609,918 filed on Sep. 15, 2004, U.S. provisional Application Ser. No. 60 / 631,437 filed on Nov. 29, 2004, U.S. provisional Application Ser. No. 60 / 631,460 filed on Nov. 29, 2004 and U.S. provisional Application Ser. No. 60 / 691,768 filed on Jun. 16, 2005. This application is also related to U.S. non-provisional application Ser. No. ______, titled “Computer-Implemented Biological Sequence Identifier System and Method,” filed along with this application on Jul. 2, 2005. The entire contents of these applications are incorporated herein by reference. STATEMENT REGARDING FEDERALLY FUNDED PROJECT [0002] The United States Government owns rights in the present invention pursuant to funding from the Defense Threat Reduction Agency (DTRA; Interagency Cost Reimbursement Order (IACRO #02-4118), M...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): C12Q1/70C12Q1/68C12M1/34G16B30/10
CPCC12Q1/6874C12Q1/6888C12Q1/689C12Q1/6893C12Q1/701G06F19/22G16B30/00G16B30/10
Inventor AGAN, BRIAN K.HANSON, ERIC H.KRUZELOCK, RUSSELL P.LIN, BAOCHUANROWLEY, ROBB K.SETO, DONALDSTENGER, DAVID A.JOHNSON, JENNIFERTIBBETTS, CLARK J.THACH, DZUNG C.VORA, GARY J.WALTER, ELIZABETH A.WANG, ZHENG
Owner THE UNITED STATES OF AMERICA AS REPRESENTED BY THE SECRETARY OF THE NAVY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products