Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2007;8(7):R133.
doi: 10.1186/gb-2007-8-7-r133.

Strategy for encoding and comparison of gene expression signatures

Affiliations
Comparative Study

Strategy for encoding and comparison of gene expression signatures

Yajun Yi et al. Genome Biol. 2007.

Abstract

EXALT (EXpression signature AnaLysis Tool) is a computational system enabling comparisons of microarray data across experimental platforms and different laboratories http://seq.mc.vanderbilt.edu/exalt/. An essential feature of EXALT is a database holding thousands of gene expression signatures extracted from the Gene Expression Omnibus, and encoded in a searchable format. This novel approach to performing global comparisons of shared microarray data may have enormous value when coupled directly with a shared data repository.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Schematic representation of EXALT system. A data flow diagram of computational steps is shown on the left, illustrating input of subject (Gene Expression Omnibus [GEO] dataset [GDS]) and query datasets, extraction of gene expression signatures, comparison with signature database, and generation of reports. Representative EXALT outputs are illustrated on the right, with three report levels including gene alignment, signature matches, and dataset matches. Reports are coded in HTML and include hypertext links (underlined blue text) to publicly accessible data sources or to other report levels. Some special terms were used in the gene alignment report. 'Total score' is the sum of positive identity score (PIS) and negative identity score (NIS). 'Concordance Identity Avg' is the average PIS per signature gene. 'Discordance Identity Avg' is the average NIS per signature gene. 'Concordant Similarity' is the number of concordant genes expressed as a percentage of the total number of genes in the signature. 'Discordant Similarity' is the number of discordant genes expressed as a percentage of the total number of genes in the signature. 'Alignment' refers to the list of query and matched subject triplets. AESP, array expression signature pipeline; EXALT, EXpression signature AnaLysis Tool; SigDB, signature database.
Figure 2
Figure 2
Significance trends among matching datasets. Six representative query datasets from the 124 self-matching results were compared using EXALT with all Gene Expression Omnibus (GEO) records. Corresponding adjusted mean P values for each dataset match are plotted on a log10 scale. The match number reflects the rank order of adjusted mean P values for each query dataset with one representing the best match. Self-matches exhibited the lowest adjusted mean P value for all query datasets and were ranked 1 in each search. EXALT, EXpression signature AnaLysis Tool; GDS, GEO dataset.
Figure 3
Figure 3
Cross-platform dataset matching revealed by EXALT. Heat-map illustrating the statistical significance among NCI-60 gene expression profiling experiments. The datasets include those reported by Staunton and coworkers [13] (dataset A), Butte and colleagues [12] (dataset B), Ross and coworkers [14] (dataset C), and GDS89. Datasets A and B were generated using Affymetrix arrays, whereas datasets C and GDS89 were generated using spotted cDNA arrays. Dataset comparisons exhibited adjusted mean P values below 0.01 (corresponding to values > 2.0 on the -log10 scale). The greatest significance levels were observed for self-matching and matches between studies using the same microarray platform. EXALT, EXpression signature AnaLysis Tool; GDS, Gene Expression Omnibus dataset.

Similar articles

Cited by

References

    1. Ball CA, Awad IA, Demeter J, Gollub J, Hebert JM, Hernandez-Boussard T, Jin H, Matese JC, Nitzberg M, Wymore F, et al. The Stanford Microarray Database accommodates additional microarray platforms and data formats. Nucleic Acids Res. 2005;33:D580–D582. doi: 10.1093/nar/gki006. - DOI - PMC - PubMed
    1. Edgar R, Domrachev M, Lash AE. Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 2002;30:207–210. doi: 10.1093/nar/30.1.207. - DOI - PMC - PubMed
    1. Barrett T, Suzek TO, Troup DB, Wilhite SE, Ngau WC, Ledoux P, Rudnev D, Lash AE, Fujibuchi W, Edgar R. NCBI GEO: mining millions of expression profiles--database and tools. Nucleic Acids Res. 2005;33:D562–D566. doi: 10.1093/nar/gki022. - DOI - PMC - PubMed
    1. Barrett T, Troup DB, Wilhite SE, Ledoux P, Rudnev D, Evangelista C, Kim IF, Soboleva A, Tomashevsky M, Edgar R. NCBI GEO: mining tens of millions of expression profiles - database and tools update. Nucleic Acids Res. 2007;35:D760–D765. doi: 10.1093/nar/gkl887. - DOI - PMC - PubMed
    1. EXALT web server http://seq.mc.vanderbilt.edu/exalt/

Publication types

LinkOut - more resources

  NODES
twitter 2