PharmaDD Top News: Business, Technology, Strategic Briefings - Tracking leading techniques and approaches in therapeutic drug discovery and development

 

Sponsored Links:
Prescription Drug Addiction

 

 

Pharmaceutical Discovery, Jun 1, 2005 
Compound Management: Integrating Chemistry, Biology and Technology in the Modern Drug Discovery Environment
Michael J. Sofia, Jay M. Stevenson, John Houston

Microarray Probes That Mask Differential Expression
This column discusses types of malperforming probes, illustrates their impact on identifying differentially expressed genes and describes a simple method for handling them when performing data analysis of Affymetrix microarrays.
Wendell Jones
Pharmaceutical Discovery

 

Wendell Jones
Several years ago, when commercial microarrays first were becoming widely accepted as a discovery platform for understanding critical aspects of biology at the molecular level, the actual content of the physical probes on the arrays was not publicly known. Affymetrix (Santa Clara, California, USA) was one of the first major microarray providers to fully disclose probe content. Most recently, in April 2005 Agilent Technologies (Palo Alto, California, USA) announced that they would be disclosing the detailed probe sequence information of their microarrays. In parallel, several articles have been published recently on cross-platform comparisons (1–4) with positive results, and it is hoped that the disclosure of the detailed microarray content by the major microarray providers will improve efforts related to cross-platform reproducibility.

Knowledge of microarray probe content also enables more refined analysis of a specific platform. Unlike many other microarray platforms, Affymetrix GeneChips contain multiple distinct 25-mer oligonucleotide probes per transcript. They are arranged in pairs of perfect match (PM) probes that are complementary to the target sequence and mismatch probes (MM) probes, which contain a single base-pair mismatch and act as negative controls. One expression value is calculated for each transcript associated with a particular probe set using the probe intensity values of the PM and MM probes. The more recent Affymetrix designs, such as the HG-U133 family, typically contain 11 PM and 11 MM probes per probe set. In general, the multi-probe design provides a degree of robustness to probe-specific effects and resistance to minor defects in the hybridization image as these probes are scattered throughout the surface of the chip.

 

Steve Casey
Because each probe in a probeset has a unique sequence and distinct hybridization characteristics, not all probes in the same probeset hybridize equally well to their intended target — a characteristic termed probe affinity. In the differential expression context, the probe affinity effect can be reduced to a great degree with appropriate algorithms, especially ones that attempt to model probe affinity or remove its effect directly. However, studies at Expression Analysis (Durham, North Carolina, USA) and other institutions have discovered several situations where certain probes in a given probeset perform so poorly that they have a negative impact on the analysis of differential expression. We term these probes as malperforming probes.

Malperforming Probes There are several possible reasons for malperforming oligonucleotide probes, including:

  • the probe is based on poor quality sequence data from the respective Unigene build (or some other source is used as a reference for the synthesized sequence)
  • the probe has undesirable hybridization properties, such as a strong affinity to non-specific targets or a tendency to not hybridize to any target, including its intended target
  • the probe is not constructed to full length
  • the probe or its target have a tendency to form secondary structures that hinder hybridization
  • the probe is 5' biased while Affymetrix chip architecture and protocols are heavily 3' biased.

 

Gautier et al. (5) and Zhang et al. (6) have compared Affymetrix's human U95 and U133 probe sequence with reference sequences from RefSeq, a curated sequence database available from the national Center for Biotechnology Information (NCBI, Bethesda, Maryland, USA) as well as other sources. For example, Gautier et al., found that, in one experiment, only 61 out of 163 Affy probe sets that were significantly different between conditions and that were associated with reference sequences still would be intact (that is, composed of all 11 PM probes), given our current knowledge of the transcripts. This implies that many probe sets have one or more probes that are based on earlier, poorer-quality sequences. Since the content of these chips was designed more than four years ago, this finding is not completely surprising. When Expression Analysis examined many of these types of sequences in detail, we frequently found that the reference sequence used as a basis for the probeset had been updated several times in recent years.

In addition to comparing sequence information, Expression Analysis also has analyzed thousands of HG-U133A hybridizations from a large variety of mRNA source materials (multiple human tissues and cell lines) and found that approximately 70,000 of the 240,000+ PM probes on the HG-U133A array may be classified as malperforming, primarily due to reasons a) and b) described earlier. The signal intensity of malperforming probes typically did not vary or varied only slightly under very different conditions when compared to their within-probeset probe cohorts. Probe invariance generally obscures differential expression when different transcript abundance levels are present.

 

Figure 1. Sequence alignment for reference sequence NM_000918 and the sequence basis for Affy probeset 200654_at, where the actual probes are shown in red.
Examples To illustrate malperforming probes with a concrete example, consider the probeset denoted by Affymetrix as 200654_at, which is present on both the HG-U133A and HG-U133 Plus 2.0 array. This probeset is annotated as P4HB, procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline 4-hydroxylase), beta polypeptide (protein disulfide isomerase; thyroid hormone binding protein). Figure 1 shows a sequence alignment of the Reference Sequence for the P4HB (NM_000918) with the corresponding Unigene 133 sequence (gb:J02783.1) on which the 200654_at Affymetrix probeset was originally based. Probe locations are indicated by number and the NM_000918 sequence is on the top. As Figure 1 indicates, probes 1–7 from this probeset show excellent alignment with the reference sequence while probes 8–11 show little or no alignment at all. A BLAST on the sequences from probes 8–11 did not match any known human mRNA sequence (although two did match a mouse sequence).

 

Figure 2. A scatterplot matrix of the 11 probes of the 200654_at the probe set over thousands of hybridizations. Each graph inside the matrix is a scatterplot of one probe's intensity values versus another probe's. Probes that tend to agree would form a 45o line. Probes that disagree would show a random pattern or a pattern other than a 45o line.
To validate further that probes 8–11 do not detect the same transcript as probes 1–7, consider the scatterplot matrix of Figure 2. This scatterplot matrix illustrates the correlation in log intensity for every probe pair within the 200654_at probeset. The matrix shows individual scatterplots of each pairing of probes within the 200654_at probeset for thousands of HG-U133A hybridizations performed at Expression Analysis. Probes that generally agree with each other would create a cloud of points that resembles a 45o line. Probes that do not agree would show either a random pattern or a pattern other than a 45o line. We would expect probes in the same probeset to agree since they are targeting the same transcript, just different sections. One can easily see from the scatterplot matrix that the first seven probes show agreement, but the agreement changes dramatically when we examine the bottom or right sides of the scatterplot matrix. The correlation of the first seven probes with the last four often is zero, or can even be negative. Cleary, using the last four probes with the other seven will tend to obscure any differential expression that may be indicated by an experiment for this probe set. The first seven probes will have to prevail over the disinformation provided by the last four.

We have provided a detailed illustration of only one probeset. Many probe sets have much better overall behavior, but many also have much worse. Our analysis shows that almost half of the HG-U133A probe sets have at least four malperforming probes and that these malperforming probes often mask true differential expression no matter what signal algorithm is used to summarize the various probe intensities within a probe set.

Although malperforming probes may be difficult to find or characterize, the solution is surprisingly simple: ignore or mask the malperforming probes when computing probeset signal. There are many ways of doing this within the multiple software packages available for computing signal for Affymetrix chips, some being simpler than others. For some, rebuilding a library file may be required. For others, it is as simple as removing entries in a table. Surprisingly, malperforming probes may provide utility when it performs chip normalization, as these probes tend to be relatively invariant — a desired attribute during normalization.

To understand the impact of these malperforming probes on detecting differentially expressed genes, a particular study conducted by Expression Analysis on human subjects uncovered 110 transcripts that were considered statistically significant using the robust multi-array average (RMA) for signal computation and using SAM significance analysis of microarrays (SAM) to assess significance and estimate false positives (which were estimated to have at least a two-fold difference between experimental groups). This type of analysis is fairly standard when comparing two classes of samples with Affymetrix arrays. When malperforming probes were removed and the same analysis steps were repeated with all other factors the same, our list grew to 251 transcripts. Moreover, all 110 transcripts in our original list were contained in the new list of 251. This implies a large number of transcripts had their estimated fold change compressed below two-fold by malperforming probes. When we repeat the analysis with other signal measures, such as MAS5 or dChip, we see a similar impact.

When examining a wider array of experiments, we (7) have seen that malperforming probes could be obscuring 20%–75% of the differential expression that is detectable in an experiment using techniques that address multiple testing and that use thresholds for fold change and/or significance (i.e., techniques that customarily are used to reduce noise and false positives).

Expression Analysis recently has completed additional studies on malperforming probes for the rat and mouse in addition to our studies of the most current human arrays. Not surprisingly, the results for rat and mouse are very similar to human, both in their nature and magnitude.

Conclusion The good news is that microarrays, given sufficient and accurate information, perform as expected; what, in essence, are random sequences show relatively invariant hybridization intensities across a range of tissues. On the other hand, accurate probe sequences within the same probeset generally have intensities that positively correlate, as expected. For this reason, we feel that, both for understanding your experimental results, and when comparing your results with others, microarray probe content should be made available to users. In addition, when we analyze probe content and compare the content data with what is currently known, we are much more sensitive and efficient in detecting differential expression.

References 1. J.E. Larkin, B.C. Frank, J. Quackenbush et al., Nature Methods 2, 337–344 (2005).

2. B.H. Mechanm, G.T. Klus, Z. Szallasi et al., Nucleic Acids Res. 32, 9e74 (2004).

3. R.A. Irizarry, D. Warren, W. Yu et al., Nature Methods 2, 345–350 (2005).

4. J. Parker, Pharmaceutical Discovery 5(2), 22–26 (2005).

5. L. Gautier, M. Moller, L. Friis-Hansen and S. Knudsen, BMC Bioinformatics 5, 111 (2004).

6. J. Zhang, R.P. Finney, H. Buetow et al., Genomics 85, 297–308 (2005).

7. W.J. Jones, Identification of malperforming short oligonucleotide probes and their quantitative impact on Affymetrix expression algorithms and differential expression experiments. In preparation.

Wendell Jones is senior research statistician and Steve Casey* is the founder and COO for Expression Analysis Inc., a provider of regulatory compliant genomic processing services. Steve Casey can be reached at
.

*To whom all correspondence should be addressed.