Abstract
Introduction
Single-celled organisms such as yeasts regularly face variable and often harsh external environments which threaten its survival or at least prevent it from performing optimally. Environmental changes may be of a physical or chemical nature: temperature, oxidation, osmolarity, acidity, nutrient availability, etc (Hohmann and Mager, 2003). Yeasts have evolved to survive environmental stresses by rapidly responding to changes in environmental conditions. The adaptation mechanisms to stress are highly complex. One aspect of this cellular adaptation is the reorganization of genomic expression. The genomic expression program required for maintenance of the optimal cell physiology in one environment may be far from optimal in a different environment. Thus, when environmental conditions change abruptly, the cell rapidly adjusts its genomic expression program to adapt to the new conditions (Gasch et al. 2000).
The reprogramming of genomic expression can be unveiled using genome-wide DNA microarrays, which measure the relative transcript levels of essentially every gene in the yeast genome at any given moment, providing a snapshot of the genomic expression program (Gasch and Werner-Washburne, 2002). Exploring the dynamic nature of the yeast genome through time-course experiments can illuminate the yeast stress response. For example, Gasch et al. (2000) and Causton et al. (2001) used genome-wide expression analysis to explore how gene expression in yeast is remodeled over time as cells respond to heat shock, oxidative shock, osmotic shock, acidic stress, nitrogen depletion, amino acid starvation, etc. They discovered that more than half of the genome is involved in responding to at least one of the investigated environmental changes. A set of genes (~10% of yeast genes), termed as the environmental stress response (ESR) genes or common environmental response (CER) genes, showed a similar drastic response to almost all of these environmental changes. Other gene expression responses appeared to be specific to particular environmental conditions. Characterizing environmentally triggered gene expression changes provides insights into when each gene is expressed (Gasch and Werner-Washburne, 2002). However, the complete network of the regulators of stress responses and the details of their actions remain to be elucidated (Gasch et al. 2000).
In this study, we try to identify the stress TFs of six kinds of stresses. We used two kinds of data sets. First, genome-wide gene expression time profiles under various stress conditions such as heat shock, oxidative shock, osmotic shock, acidic stress, nitrogen depletion, and amino acid starvation are from Gasch et al. (2000) and Causton et al. (2001). Under each stress condition, samples are collected at multiple time points for all genes in the yeast genome. Each Cy5-labeled sample was compared with a Cy3-labeled reference pool, consisting of an equal mass of all of the RNA samples. Then the data were mathematically “zero transformed” for visualization by dividing the expression ratios for each gene measured on a given array by the corresponding ratios measured for the unshocked, time-zero cells. Therefore, the ratios represent the expression level at each time point relative to the expression level in the unshocked, time-zero sample. Second, genome-wide regulatory targets of a TF are retrieved from YEASTRACT database (Teixeira et al. 2006b). The regulatory associations between genes and TFs are reported if they are supported by experimental results from the literature or genome-scale studies such as motif data (TF-binding sites), mutant data (genome-wide gene expression changes in mutant strains defective in TFs), and ChIP-chip data (genome-wide binding targets of TFs). Therefore, it is a comprehensive data set to include all possible TF-gene associations under various kinds of conditions. Total of 147 TFs, which are collected in YEASTRACT database, are used in this study. We developed Stress Transcription Factor Identification Algorithm (STFIA), which integrates these two kinds of high throughput genome-scale data (gene expression and TF-gene association data) to identify plausible stress TFs that regulate the target genes to confer stress protection.
Methods
Stress Transcription Factor Identification Algorithm (STFIA)
STFIA finds out the genes in the yeast genome that respond to a specific stress. A gene is said to be in response to a specific stress if more than
For each of the 147 TFs, STFIA determines whether the TF under study is involved in responding to a specific stress. The rationale is that a TF is involved in responding to a specific stress if a statistically significant portion of its regulatory targets is in response to that specific stress. The hypergeometric distribution is used to test the statistical significance (Mendenhall and Sincich, 1995). The
For the illustrative purpose, let us take Hsf1, a well-known heat shock TF, as an example. Let
This
Repeat Step 1 and Step 2 for different parameter settings of T, The flowchart of STFIA.
Results
Biologically Significant Stress TFs have been Found
We applied STFIA to find the stress TFs that are involved in responding to heat shock, oxidative shock, osmotic shock, acidic stress, nitrogen depletion, and amino acid starvation, respectively. Table 1 shows the high-confidence stress TFs for each of the above six stresses. The identified stress TFs can be divided into two categories. The first category is the well-known stress TFs with solid literature evidences which could directly indicate involvement of these TFs in response to that specific stress. The second category is the novel stress TFs that have partial (indirect) or no literature supports. We found that 51% (24/47 counting multiplicity) of the predicted stress TFs belongs to the first category, validating the effectiveness of STFIA. Besides, 65% (15/23 counting multiplicity) of the second category has partial (indirect) literature supports, showing the prediction power of STFIA. Therefore, the eight novel stress TFs that have no literature evidences yet are worthy to be experimentally verified in the future.
The high-confidence stress TFs involved in responding to each of the six stresses. The high-confidence stress TFs involved in responding to heat shock, oxidative shock, osmotic shock, acidic stress, nitrogen depletion and amino acid starvation are shown. The TFs for each stress are ranked according to the number of times that they are identified under different parameter settings. For example, among all osmotic shock TFs, Hot1 is the one that is identified with the largest number of times. Besides, the stress TFs are colored blue (red) if there exist solid (partial) literature evidences showing that they are involved in responding to the same stress as we predicted.
Biological Validation of Our Results
Now we discuss in details the evidences from the literature that support our findings.
Heat Shock
As shown in Table 1, Msn2, Msn4 and Hsf1 are well-known heat shock TFs. Msn2 and Msn4 bind DNA at stress response element (STRE) and activate many STRE-regulated genes in response to many stresses such as heat shock, oxidative shock, osmotic shock, etc (Cherry et al. 1998). Hsf1 binds DNA at heat shock element (HSE) and activates multiple genes in response to hyperthermia (Cherry et al. 1998).
Sfp1, Pdr3, Rpn4 and Stp1 are novel heat shock TFs. Two previous studies partially support our findings. First, Sfp1 regulates RP gene expression in response to heat shock. In response to heat shock, Sfp1 is released from RP gene promoters and leaves the nucleus, and RP gene transcription is down-regulated (Marion et al. 2004). Second, Hsf1 activates expression of
Oxidative Shock
As shown in Table 1, Msn2, Msn4, Yap1 and Skn7 are well-known oxidative shock TFs. Msn2 and Msn4 activate many STRE-regulated genes in response to many stresses such as oxidative shock, heat shock, osmotic shock, etc (Cherry et al. 1998). Yap1 and Skn7 are known to regulate genes that respond to oxidative shock. For example, they co-operate to regulate
Hsf1 and Pdr3 are novel oxidative shock TFs. The phosphorylation state of Hsf1 changes during oxidative stress, indicating the activity of Hsf1 is modified under oxidative stress (Güldener et al. 2005).
Osmotic Shock
As shown in Table 1, Hot1, Msn2 and Msn4 are well-known osmotic shock TFs. Hot1 is required for the transient induction of glycerol biosynthetic genes
Pdr3, Hsf1, Ino2 and Ino4 are novel osmotic shock TFs. Two previous studies partially support our findings. First, DNA microarray analysis of Pdr3 mutants indicates novel Pdr3 regulatory targets including those induced by NaCl and/or those conferring salt sensitivity upon deletion. These results reveal an unexpected role for Pdr3 in salt tolerance (Onda et al. 2004). Second, it has been shown that Hsf1 is rapidly activated by either hyper- or hypo-osmotic stress, indicating that Hsf1 may serve some physiological functions during osmotic stress (Caruccio et al. 1997).
Acidic Stress
As shown in Table 1, Msn2 and Msn4 are well-known acidic stress TFs.
Pdr3, Stp1, Ino2 and Ino4 are novel acidic stress TFs. Teixeira et al. (2006a) analyzed the global gene transcription pattern of the yeast in response to sudden aggression with the 2,4-dichlorophenoxyacetic (2,4-D) acid. They found that most of the up-regulated genes in response to 2,4-D acid are known targets of Msn2, Msn4, Pdr3 and Stp1, partially validating our findings.
Nitrogen Depletion
As shown in Table 1, Gln3, Dal80, Dal81 and Gcn4 are well-known nitrogen depletion TFs. Gln3 and Dal80 are known to regulate the expression of nitrogen catabolite pathways in
Stp1, Stp2, Rpn4, Sfp1, Ifh1, Arr1 and Ino4 are novel nitrogen depletion TFs. Several previous studies partially support our findings. First, stp1stp2 double deletion mutants exhibit a number of transcriptional phenotypes, such as the increased expression of genes subject to nitrogen catabolite repression and genes involved in stress response, indicating that Stp1 and Stp2 play roles in pathways for the assimilation of nitrogen (Eckert-Boulet et al. 2004). Second, on solid growth media with limiting nitrogen source, diploid budding-yeast cells differentiate from the yeast form to a filamentous, adhesive, and invasive form. Both low availability of nitrogen and a solid growth substrate are required to induce diploid filamentous-form growth. It is known that Rpn4 regulates filamentous growth, indicating that Rpn4 may be involved in responding to nitrogen depletion (Prinz et al. 2004). Third, ribosomal protein (RP) genes in eukaryotes are coordinately regulated in response to growth stimuli and environmental stress, thereby permitting cells to adjust ribosome number and overall protein synthetic capacity to physiological conditions. Sfp1 and Ifh1 are known to regulates RP gene expression in response to nutrients depletion (Cherry et al. 1998; Marion et al. 2004).
Amino Acid Starvation
As shown in Table 1, Gcn4, Dal80, Gln3, Met4, Met28, Met31, Leu3 and Gat1 are well-known amino acid starvation TFs. Several previous studies support our findings. First, Gcn4 is a well-known transcriptional activator of amino acid biosynthetic genes in response to amino acid starvation (Cherry et al. 1998). Second, the expressions of the genes encoding the general amino acid permease and the ammonium permease are regulated by Gln3 and Dal80. Another group of genes whose expressions are also regulated by Gln3 and Dal80 are some proteases,
Stp1 and Stp2 are novel amino acid starvation TFs. One study partially supports our prediction (Ecker-Boulet et al. 2004).
Discussion
To have the ability to respond rapidly to abrupt and dramatic fluctuation in the external conditions is crucial for cell survival. Sudden changes in the external environment can perturb the internal system of the cells, disrupting cellular functions and preventing growth. Therefore, unicellular organisms such as yeasts have evolved complex adaptation mechanisms to cope with environmental stresses. One aspect of this cellular adaptation is the reorganization of genomic expression (Gasch and Werner-Washburne, 2002). The genome-wide stress-responsive genes in yeast have been discovered by DNA microarrays (Gasch et al. 2000; Causton et al. 2001). However, the complete network of the regulators of stress responses and the details of their actions remain to be elucidated (Gasch et al. 2000).
In this study, we developed STFIA which integrates gene expression and TF-gene association data to identify stress TFs of six kinds of stresses. Some general stress TFs that are in response to various stresses and some specific stress TFs that are in response to one specific stress are identified. For example, STFIA found out the general stress TFs Msn2 and Msn4 (Cherry et al. 1998) and the well-known heat shock TF Hsf1 (Cherry et al. 1998), oxidative shock TFs Skn7 and Yap1 (Güldener et al. 2005), osmotic shock TF Hot1 (Rep et al. 1999), nitrogen depletion TFs Gln3 and Dal80 (Hofman-Bang, 1999), and amino acid starvation TFs Gcn4, Met4, Met28 and Met31 (Cherry et al. 1998). The ability to find out these well-known stress TFs validates the power of STFIA.
STFIA identified 24 distinct TFs (Arr1, Dal80, Dal81, Gat1, Gcn4, Gln3, Hot1, Hsf1, Ifh1, Ino2, Ino4, Leu3, Met28, Met31, Met4, Msn2, Msn4, Pdr3, Rpn4, Sfp1, Skn7, Stp1, Stp2, and Yap1) to be in response to at least one of the six stresses under study (see Fig. 2). That is, a small number of TFs may be sufficient to control a wide variety of expression patterns in yeast under different stresses. Two implications can be inferred from this observation. First, the adaptation mechanisms to different stresses may have a bow-tie structure (Csete and Doyle, 2004). As shown in Figure 2, the core stress TFs make up the ‘knots’ of a bow tie, facilitating the fan in of a large of variety of environmental stresses through signal transduction pathways and fan out of an even larger variety of stress-adapting proteins through activating stress-responsive target genes. Actually, approximately two-thirds of the yeast genome (about 4000 genes) is involved in responding to the changes in environment (Causton et al. 2001). Second, there exists extensive regulatory cross-talk between different stress responses (see Fig. 3). We found that heat shock, oxidative shock, osmotic shock, and acidic stress all can trigger the stress TFs Msn2, Msn4 and Pdr3, indicating these four stresses share a similar stress adaptation mechanism. Moreover, we found that nitrogen depletion and amino acid starvation both can trigger the stress TFs Gcn4, Gln3, Dal80, Stp1 and Stp2, indicating a cross-talk between the cellular responses to these two stresses. This is not surprising because both nitrogen depletion and amino acid starvation belong to the nutrient deprivation stress and could have a similar stress adaptation mechanism. The fact that different stress adaptation mechanisms share some, but not all, of their regulators suggests a higher level of modularity of the yeast stress response network (Segal et al. 2003).

The transcriptional regulatory network of the yeast stress response. Environmental stresses are represented by octagons, stress TFs are represented by ellipses, and stress-responsive genes are represented by rectangles. Solid (Dashed) lines indicate the known (predicted) regulatory relationships among environmental stresses, stress TFs, and stress-responsive genes. Oxidative shock responses are colored red, osmotic shock responses are colored green, heat shock responses are colored blue, acidic stress responses are colored orange, nitrogen depletion responses are colored brown, and amino acid starvation responses are colored purple.

Regulatory cross-talk among different stress responses. The cellular responses to heat shock, oxidative shock, osmotic shock and acidic stress have significant regulatory cross-talk. They all trigger stress TFs Msn2, Msn4 and Pdr3. Besides, nitrogen depletion and amino acid starvation responses have regulatory cross-talk. They both trigger stress TFs Gcn4, Gln3, Dal80, Stp1 and Stp2.
Step 3 of STFIA is very crucial for filtering out the low-confidence stress TFs that may be found only under some specific parameter setting. For example, 15 TFs (Arr1, Fhl1, Gcn4, Hsf1, Ifh1, Msn2, Msn4, Pdr3, Rap1, Rgt1, Rpn4, Rtg1, Rtg3, Sfp1, Stp1) are identified in Step 2 of STFIA as the heat shock TFs with the parameter setting

Statistics of the performance of STFIA using different cutoff threshold. The false positive and false negative rates of STFIA using different cutoff threshold are shown. When the cutoff threshold equals 25%, STFIA has the best performance in terms of the trade-off between the false positive and false negative rates to find out the high-confidence heat shock TFs.
Conclusion
The adaptation mechanisms to stress are highly complex. They require a complex network of sensing and signal transduction leading to adaptations of cell growth and proliferation as well as to adjustments of the gene expression program, metabolic activities, and other features of the cell (Hohmann and Mager, 2003). This study focuses on the regulation of the genomic expression, which is only one part of cellular adaptation mechanisms. Nevertheless, our study proposed a network of the regulators of stress responses and their mechanism of action. Thus, it provides a starting point for understanding the mechanisms that yeast uses to survive some of the environmental conditions that cells experience in the wild. We believe that as more genomic expression data emerge, in combination with data from other whole-organism approaches, novel computational algorithms such as STFIA have potential to construct a dynamic picture of the integrated cellular response of yeast cells to environmental changes.
