|
|
||||||||
1 The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02141
2 Department of Pharmacognosy, University of Mississippi, University, Mississippi 38677
3 College of Marine and Earth Studies, University of Delaware, Lewes, Delaware 19958
* To whom correspondence should be addressed. E-mail: choover{at}broad.mit.edu
| Abstract |
|---|
|
|
|---|
Abbreviations: Hflr, a Shannon-Weaver–type entropy statistic that reflects complexity estimates among different samples on the basis of kinetic rates and maximun fluorescence PCA, principal component analysis
| Introduction |
|---|
|
|
|---|
S. polydactyla is abundant in shallow coral reef habitats in Guam, and synthesizes two terpenoid secondary metabolites, pukalide and 11β-acetoxypukalide (Slattery et al., 2001). The S. polydactyla colonies, predation intensity, and secondary metabolite production at our study sites are well characterized (Slattery et al., 1998, 1999, 2001; Kamel et al., 2004; Kamel and Slattery, 2005). The synthesis pathways for these two compounds are unknown but may involve the mevalonic acid pathway, a common terpene synthesis pathway. Temporal and spatial variation is observed in both metabolites, which are known for having feeding-deterrent and antimicrobial properties (Slattery et al., 2001). Variation in secondary metabolite content or phenotypic plasticity may be an adaptation to several environmental factors. Preliminary evidence suggests that intensity of predation by butterflyfishes, Chaetodon unimaculatus and C. melannotus, may be responsible for differential chemical synthesis (Farmer and Ryan, 1992; Slattery et al., 2001; Van Alstyne et al., 2001; Puyana et al., 2003).
Chemical defenses can be induced through several mechanisms. Activation, or pre-formed defense, is often found when a quick response is necessary. This type of defense involves the storage of nontoxic or mildly toxic compounds that are mixed to form compounds with a greater deterrent effect (either by enzymatic reaction or by synergistic effects of the compounds) immediately after tissue damage and cellular disruption (Karban and Baldwin, 1997; Van Alstyne et al., 2001; Van Alstyne and Houser, 2003). Other pre-formed defenses occur when defensive compounds are quickly transported to the site of feeding damage (Dussourd and Eisner, 1987). Induced responses occurring on longer time scales (days to weeks) are often controlled by synthesis mechanisms (Karban and Baldwin, 1997). For example, methyl jasmonate, a molecule involved in the jasmonate synthesis pathway and a common plant secondary metabolite, induces the production of proteinase inhibitors in plants (Farmer and Ryan, 1990, 1992). Proteinase inhibitors disable endo- and exopeptidases, thus interfering with the digestion processes of many herbivores. Because secondary metabolites produced by marine and terrestrial organisms span many chemical classes, control of chemical induction may employ other biosynthetic pathways. Since synthesis of feeding-deterrent secondary metabolites is likely to be triggered by a change in a gene or group of genes, changes in gene expression after an experimental manipulation can often be profiled (Farmer and Ryan, 1992; Van Alstyne et al., 2001; Puyana et al., 2003).
Here we investigate transcriptome-level changes (i.e., changes in gene expression) in the soft coral Sinularia polydactyla, to assess the affect of experimental predation on both biochemical content and gene expression patterns. We employ a novel mRNA-pool profiling technique using an informatics-based analysis of kinetic profiles (Fielman and Marsh, 2005; Marsh and Fielman, 2005). Although other methods exist for profiling transcriptome changes (e.g., ESTs, SAGE, microarray), none are inexpensive or rapid and all depend upon having substantial genomic sequence available. This fact alone makes transcriptome profiling in non-model organisms rare in the literature. The method used in this study allows for quick, high-throughput analysis of sequence complexity in mRNA pools and has been used successfully to compare transcriptome-level differences in soft corals (Hoover et al., 2007b, c). In conjunction with the transcriptome analyses, chemical extracts from each sample were analyzed by HPLC to investigate changes in secondary metabolite levels. Examining phenotypic plasticity of secondary metabolites in S. polydactyla at a molecular level will allow us to describe the relationship between gene expression and defensive metabolite production. Here we attempt to provide direct evidence for changes in gene expression in response to predation as the primary mechanism influencing biochemical phenotypic plasticity.
| Materials and Methods |
|---|
|
|
|---|
2 m) with a mean annual tidal range of 0.5 m (Slattery et al., 2001). S. polydactyla is a highly branched species. Each colony consists of multiple branches that separate into smaller branches, terminating with multiple fingerlike extensions. In 2004, the experiment ran for 25 days, divided into two periods around a mass spawning event. On days 1–3 of the experiment, five clippings were taken daily from the tips of each treatment coral. Tips are defined as the ends of the multiple extensions of a branch. These tissue pieces were placed in the RNA preservation solution RNAlater (Ambion) immediately after collection in an effort to preserve transcriptome changes that closely follow predation stress. A separate sample of tissue was harvested from a different branch of each treatment coral and frozen at –80 °C for chemical analysis, so as not to confound the biochemical analysis with the RNA analysis. After the first 3 days of the experiment, sample clippings were taken every other day for 7 days (days 5, 7, 9, etc.). Mass spawning occurred on day 13 of the experiment. The treatment corals were subjected to a second round of artificial predation after spawning, starting on day 16. Five clippings were again taken every day for 3 days, and then every other day for the next week. At the end of the experiment, a piece of tissue was harvested from each of the control (non-manipulated) colonies. These samples were frozen at –80 °C for chemical analyses and comparison to treatment samples. In addition, tissues from five randomly selected coral colonies were collected and preserved for gene expression and chemical analyses, serving as baseline reference samples. Using the same experimental design and sampling regime, the experiment was repeated in 2005, this time beginning after the mass spawning event and running for 12 days. Bite scars appear on S. polydactyla as obvious light or white spots between 1 and 3 mm in diameter, and are visible for about 5 days after a predation event. The clippings taken in both sets of experiments simulate natural predation, with the quantity of tissue taken in each clipping mimicking a bite from a butterflyfish. To distinguish our experimental predation from naturally occurring fish predation, we often refer to this as "artificial predation." Care was taken to select healthy treatment and control coral colonies, excluding those with apparent damage or signs of severe predation.
Secondary metabolite analysis
Tissues were freeze-dried and ground into a powder, and secondary metabolites were extracted using an accelerated solvent extractor (ASE; Dionex 200). Each sample was extracted using 50:50 methanol/methylene chloride under high temperature and pressure. The resulting extracts were concentrated using a Buchi R-200 rotovapor, dried, and weighed. A 10-mg sample of crude extract was dissolved in 250 µl of 40:60 hexane/ethyl acetate, and 10 µl of diluted extract was injected into a Waters 2695 separations module with a Waters 2996 photodiode array detector. Each sample was run on a Phenomenex normal phase, silica column with 60:40 hexane/ethyl acetate at 0.5 ml min–1 for 25 min. Pure 11β-acetoxypukalide was used as a standard and was run under the same HPLC conditions as the experimental samples. Because crude extracts from the experimental samples were used, several peaks representing unknown compounds were present. Since 11β-acetoxypukalide is well characterized and known to discourage fish predation, our chemical analysis focused on changes in this compound as well as total differences across the entire chemical "fingerprint."
Statistical comparisons of 11β-acetoxypukalide concentrations were analyzed using Sidák and ANOVA tests in SPSS 13.0.
RNA extraction and precipitation
Soft coral tissues were preserved in RNAlater. Portions of tissue weighing 100 mg were weighed and placed in 1 ml of Trizol reagent (Invitrogen) overnight. Trizol reagent is a monophasic solution of phenol and guanidine isothiocyanate. RNA extraction is rapid and based on the single-step method developed by Chomczynski and Sacchi (1987). To prevent cross-contamination of samples, each individual was homogenized in Trizol, using a sterile microcentrifuge tube pestle. Extractions were carried out as described in the Trizol manual. Each RNA sample was then resuspended in DEPC-treated water and its integrity checked on an agarose gel. Intact samples were then ethanol-precipitated for future cDNA library synthesis.
cDNA library synthesis
Each RNA sample was quantified using a RiboGreen assay (Molecular Probes) run on a Fluoro-Star Optima fluorimeter. One microgram of RNA was ethanol-precipitated and used in library synthesis following the protocol presented in Marsh and Fielman (2005). First-strand synthesis was accomplished using a "switch-prime" method that yields single-stranded cDNA copies with unique oligo adapters on the 3' and 5' ends (Stirewalt et al., 2004). The RNA pellet was resuspended in a 10-µl mixture of 1x reverse transcriptase buffer, 1 µmol l–1 XDNA oligo (5'–GAG AGA CTC GAG AGA GAC CGT CTA CGA ATT CTG CAT C (T)24-3), and RNasin (20 units). The XDNA oligo was annealed to the RNA transcripts by incubating each sample at 45 °C for 5 min, followed by a 5-min incubation at 37 °C. Next, 10 µl of a first-strand reaction mixture, containing 1x reverse transcriptase buffer, 50 mmol l–1 DTT, 2 mmol l–1 dNTPs, 1.0 µmol l–1 Xho-switch oligo (5'-AGA TGT CTC GAG GCG TAC CAG ATG ACG GG–3'), and 350 units of Superscript III reverse transcriptase (Invitrogen), was added to each sample. The entire mixture was then incubated at 37 °C for 12 h.
First-strand cDNAs were ethanol-precipitated after the 12-h incubation. Second-strand cDNA synthesis was accomplished by means of a linear PCR reaction using the Xho-switch nucleotide sequence site on the 3' end of first-strand cDNA molecules. Pellets from the above precipitation were resuspended in a 20-µl PCR reaction containing 0.25 mmol l–1 dNTPs, 1x Ex Taq reaction buffer, 0.5 µmol l–1 ZDNA-F oligo, and 0.5 units of Ex Taq DNA polymerase (Takara). These PCR reactions were amplified on a thermal cycler using 25 cycles of the following: 30 s at 94 °C, 15 s at 58 °C, 6 min at 72 °C. After linear amplification, cDNA libraries were amplified in a 50-µl PCR reaction containing 1 µmol l–1 ZDNA-F, 1 µmol l–1 XDNA-R, 0.5 mmol l–1 dNTPs, 1x Ex Taq reaction buffer, and 0.5 units of Ex Taq. Three microliters of the linear amplification reaction were used as the cDNA template for each reaction. A thermal cycler was used with the following profile for 35 cycles: 30 s at 94 °C, 15 s at 55 °C, 6 min at 72 °C.
Amplified cDNA libraries were ethanol-precipitated and resuspended in 50 µl of distilled water. A PicoGreen (Molecular Probes) assay was used to quantify the nucleic acid concentration, and libraries were checked visually on agarose gels.
Reannealing rate assays
For 2004 samples, reannealing rate assays, described below, were run in a 96-well-plate format on a quantitative PCR machine (ABI 7700); 2005 samples were run in a 384-well-plate format on the similar ABI 7900. All assays were run with 2x SSC, 18% formamide, 5% DMSO, 0.05% Tween-20, and 0.005x PicoGreen with 200 ng of cDNA in a final volume of 20 µl in each well. Four replicate wells were run for each sample. After being placed in the ABI machine, the samples were heat-denatured at 95 °C for 10 min. After denaturation, the temperature profile stepped down by 1 °C increments. During each 30-min temperature plateau, the fluorescence produced by the PicoGreen fluorophore in response to changes in the concentration of double-stranded DNA was measured. For the data presented here, the first temperature plateau occurred at 89 °C and continued down to 55 °C, for a total of 35 temperature plateaus. This type of assay requires 18 h to run and produces a data file containing raw fluorescence traces for each well at each temperature point (Hoover et al., 2007c). Comparisons of data run on both the ABI 7700 and the ABI 7900 showed no significant differences in overall gene expression patterns. Profiles from the ABI 7900 did have higher values of Hflr, a Shannon-Weaver–type entropy statistic that reflects complexity estimates among different samples on the basis of kinetic rates and maximun fluorescence, and appeared to have smoother transitions between each temperature step, but the relative transcriptome profile remained the same.
Data were collected from reannealing rate assays for both experimental and baseline (non-manipulated) samples from the start, midpoint, and end of the experiment for five individuals in each treatment, with four well duplicates for each individual (total n = 80).
Data analysis
Raw fluorescence data files were processed using a C++ program called RotKin, ver. 2.05a (ABI 7700) or 2.08 (ABI 7900), written for MS-Windows (Marsh and Fielman, 2005; Hoover et al., 2007c). This program sequentially separates the continuous data for each well into the experimental temperature plateaus, filters the fluorescence data to increase the signal-to-noise ratio, and normalizes each fluorescence data set. It then iteratively fits the fluorescence versus time data for each well at each temperature to a second-order kinetic regression, and finally transforms each data set into a frequency distribution of observations to calculate an information entropy statistic (Shannon, 1948; Weaver, 1948; Shpak and Churchill, 2000; Marsh et al., 2006).
The reannealing profiles are fit to a second-order kinetic regression, using an equation that describes fluorescence as a function of time. The kinetic-regression statistics are primarily used for quality control to automatically screen more than 3300 reannealing profiles that are produced on a single run with a 96-well plate. The fluorescence profiles for each well at each temperature are subsequently converted into a frequency distribution based on maximum fluorescence for that particular profile. These distributions are on a 0–100% scale, with 5% bin intervals. Information complexity of the reannealing curves can then be calculated from the frequency distribution, using a Shannon-Weaver–type entropy statistic: H'= –1 x
pi x log2 (pi), where pi = the proportion of the ith observation in the total distribution. Thus, the Shannon-Weaver statistic can quantitatively represent the sequence complexity of gene transcripts in a cDNA library, which integrates information about both the distribution and abundance of mRNA sequences in a transcriptome (Marsh and Fielman, 2005; Hoover et al., 2007c). The resulting statistic, Hflr, estimates complexity, distribution, and abundance of mRNA transcripts in the transcriptome among different samples on the basis of kinetic rates and maximum fluorescence. Hflr is calculated using the following equation: Hflr = –Fmax x
pi x log2 (pi), where Fmax is the maximum fluorescence in a fluorescence profile. Using these calculations based on kinetic rates and fluorescence (Hflr or complexity), the different treatment groups were examined for transcriptome-level differences at both the inter-individual and between-treatment groups levels.
For each of the samples, run in replicates of four, a mean Hflr value and standard error were calculated. Mean Hflr values were log-transformed and graphed against temperature (°C) to yield a transcriptome profile for each sample. Thus, differences in Hflr between two samples or groups indicate changes in the distribution and abundance in the pool of mRNA transcripts.
Changes in gene expression among the different treatment groups were further analyzed using a median normalization strategy. Here, a median Hflr value for all individuals being compared was calculated for each temperature. Next, the individual Hflr values were compared to the median at each temperature. These data were analyzed using a hierarchical clustering strategy with a Euclidean distance similarity metric in the program Cluster 3.0 (Eisen et al., 1998). The resulting dendrograms were visualized with Tree View 1.60 (Page, 1996). These dendrograms were based on median normalized data in a temperature-by-individual matrix that uncovered patterns that are not evident when looking at the raw profiles of gene expression.
A better understanding of the relationship between transcriptome profile and secondary metabolite data was achieved by integration of these two types of data using principal component analysis (PCA). PCA aims to find underlying dimensions or factors that can be used to explain complex data sets (Norusis, 1990). PCA searches for a linear combination of variables that extracts the maximum variance from the variables. Next, it removes this variance and seeks a second linear combination that explains the maximum proportion of the remaining variance. This process continues for each component. Data for PCA are in the form of ratios of the maximum value for each variable.
An integrated data matrix was compiled with all transcriptome complexity and metabolite concentrations (known and unknown) for the PCA analysis. The unique aspect of our analytical approach is that we can relate high-throughput data sets for both transcriptome and metabolome constituents.
| Results |
|---|
|
|
|---|
|
In both the 2004 and 2005 experiments, statistically significant differences in 11β-acetoxypukalide concentration were seen between groups exposed to artificial predation and those that were not manipulated (Fig. 2). For both years, 11β-acetoxypukalide concentrations were significantly higher in endpoint samples from treatment groups than in either initial (T0) or control samples. An ANOVA with Sidák tests for multiple comparisons was conducted for each experiment. For 2004 data, the ANOVA showed highly significant differences in 11β-acetoxypukalide concentration, with P
0.008. Multiple comparisons indicated that endpoint 11β-acetoxypukalide concentrations were significantly different from T0 and control samples (P
0.019). Control and T0 samples were not significantly different from each other, suggesting that the increase in secondary metabolite concentration is driven by increased predation pressure. Similarly, for 2005 data, the ANOVA resulted in a value of P
0.001, again indicating significant differences in 11β-acetoxypukalide concentration among different treatment groups. As in the 2004 experiment, multiple comparisons showed statistically significant differences in 11β-acetoxypukalide concentration in mid- and endpoint samples versus controls (P
0.015). Again, control and T0 samples were not significantly different from each other.
|
|
In Figure 4, which illustrates gene expression patterns explored using median normalization plots, the lighter colors indicate values above the median, or upregulation, and darker colors indicate values below the median, or downregulation. Samples from T0 formed a distinct cluster from endpoint samples (T24 and T12 for the 2004 and 2005 experiments, respectively). This may indicate that S. polydactyla responds to artificial predation or continued intense predation by upregulating certain classes of genes.
|
|
|
| Discussion |
|---|
|
|
|---|
The marked increase in transcriptome complexity at 64 °C and the correlation to secondary metabolite production suggest that an inducible gene expression event is occurring in response to increased predation. The sharp transcriptome complexity increase indicates that the gene or genes that produce the transcripts reannealing at 64 °C are being upregulated. Thus changes in the metabolic pathways involved in the stress response (increased metabolite production) seem to be targeted, not general, such that stress activates one specific pathway rather than multiple pathways. Although the exact mechanism responsible for the upregulation of these genes is unknown, one possibility is that tissue loss triggers the production of polypeptide growth factors. These growth factors may bind to receptors in the genes involved in the secondary metabolite pathway and ultimately increase transcription of genes that produce secondary metabolites. Certainly, there may be a relationship between genes controlling growth or wound healing and those controlling secondary metabolite production. In fact, some may argue that the gene upregulation we observed was actually that of repair genes. However, the correlation we found between 11β-acetoxypukalide and Hflr supports the idea of upregulation of transcripts involved in producing defensive metabolites. Furthermore, the transcriptome profile peak at 77 °C is evident in all coral samples, including baseline controls. Considering that these corals are constantly exposed to environmental stressors necessitating cellular repair, it is more plausible that the 77 °C peak is associated with expression of repair genes, whereas the 64 °C peak is associated with metabolite production. Future studies should elucidate which genes are being expressed in Sinularia to produce defensive secondary metabolites.
Additionally, differences in secondary metabolite production in S. polydactyla may be sex-related. A recent study indicated significant differences in defensive diterpene production between male and female clones of the soft coral Sarcophyton glaucum (Fleury et al., 2006). This finding presents interesting questions for future studies of S. polydactyla. Perhaps a clearer picture of secondary metabolite production could be gained by comparing male and female clones of S. polydactyla at both the biochemical and gene expression levels.
Previous attempts to identify the genes responsible for 11β-acetoxypukalide regulation focused on isolating and measuring changes in 3-hydroxy-3-methylglutaryl-CoA (HMG CoA reductase), a central enzyme in common pathways for terpene synthesis (Karban and Baldwin, 1997). These attempts were unsuccessful at isolating or identifying an orthologous gene of HMG CoA reductase. As an alternative method, the correlation of 11β-acetoxypukalide concentration to a class of mRNA transcripts presents another potential avenue for identifying functionally relevant genes. The isolation of cDNA fractions that are kinetically labile at 64–65 °C could produce a small subset of gene targets that may be related to the synthesis of secondary metabolites, all without any a priori knowledge of what specific metabolic functions may be represented in that pool (i.e., without having to generate a list of specific target genes for designing oligo primers or probes). Isolation of these transcripts may provide us with a few highly represented transcripts that could indicate what the most active metabolic pathways might be after the induction of chemical defenses in response to predation.
The reannealing rate assay method used in this study has advantages in that it is rapid, relatively inexpensive, and high-throughput. For work with natural populations of non-model organisms, it is very important to be able to sample at a statistically significant scale. Few studies have looked at gene expression patterns in non-model organisms; those that have, used sample sizes of less than 20 individuals, primarily because of limitations in handling microarray-based measurements of gene expression (Oleksiak et al., 2001, 2002; Hashimoto et al., 2004). For a marine invertebrate like S. polydactyla, few sequence data are available, making microarray work difficult. In this study, we have profiled the transcriptome complexity of more than 50 individual samples, allowing us to describe significant changes in gene expression patterns in response to an artificial predation stress, and providing a detailed assessment of the interindividual variance in transcriptome activities among individuals. This ability to screen mRNA samples for specific characteristics indicative of transcriptional activation is important for subsequent studies to identify specific genes involved in response pathways.
An additional advantage of the method we have developed here is the statistical integration of the resulting transcriptome profile data with other biochemical response data sets. Here, secondary metabolite levels were highly correlated with the transcriptome data at 64–65 °C. Relationships between transcriptome and secondary metabolite data were validated by the linkage that was shown to exist for 11β-acetoxypukalide, a known inducible feeding deterrent in this soft coral. However, this method also revealed other potentially important metabolites worth further characterization because of the apparent linkage to gene expression activities associated with predation stress. Our cross-correlations of molecular and biochemical data in response to an ecological-level process (predation-wound stress) make this transcriptome profiling approach highly applicable for research focusing on the molecular ecology of any non-model organism collected from field populations.
The relatively recent ability to apply genomics-based assays to non-model organisms, such as S. polydactyla, opens enormous potential to better understand the impact of environmental changes on coral reef habitats and marine ecosystems. Given recent interest in marine and plant-derived secondary metabolites for biomedical use and drug development (Fenical, 2006), a better understanding of the relationship between gene expression and the pathways controlling secondary metabolite production is necessary. Therefore, new techniques like the one we describe here offer an inexpensive, high-throughput mechanism to integrate transcriptome and metabolite data (for review, see Hoover et al., 2007a). The application of genomics-based assays to ecological studies, especially those investigating the impacts of environmental stressors, will allow us to identify the direct linkages that exist between molecular/genetic and biochemical/physiological levels of biological organization.
| Acknowledgments |
|---|
| Footnotes |
|---|
| Literature Cited |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |