Biol. Bull. 214: 205-217. (June 2008)
© 2008 Marine Biological Laboratory
Does the High Gene Density in the Sponge NK Homeobox Gene Cluster Reflect Limited Regulatory Capacity?
Bryony Fahey*,
Claire Larroux*,
Ben J. Woodcroft* and
Bernard M. Degnan
School of Integrative Biology, University of Queensland, Brisbane QLD 4072, Australia
To whom correspondence should be addressed. E-mail: b.degnan{at}uq.edu.au
 |
Abstract
|
|---|
A huge discrepancy in morphological diversity exists between poriferans and eumetazoans. The disparate evolutionary outcomes of these two ancient metazoan lineages may be reflected in the composition, architecture, and regulation of genomes of modern representatives. As a case study, we compare the sizes of upstream intergenic regions of genes found within the NK homeobox cluster of the demosponge Amphimedon queenslandica with eumetazoan orthologs. This analysis includes NK genes as well as five structural genes interspersed in the cluster. The upstream intergenic regions of the homeobox genes are significantly smaller in Amphimedon than in eumetazoan orthologs, suggesting that the sponge genes house less cis-regulatory information. In contrast, the upstream intergenic regions of the structural genes are not significantly different. The simple developmental expression patterns of representative NK genes in Amphimedon lends support to the proposition that their regulatory apparatuses, unlike those of bilaterians, do not encode the information for dynamic, pleiotropic gene expression. On the basis of this example, we suggest that the size of the intergenic regions upstream of the transcription start site may act as a proxy for estimating regulatory complexity and reflect the limitations of the sponge genome to direct complex and varied morphogenetic processes.
Abbreviations: GRN, gene regulatory network
 |
Introduction
|
|---|
Our understanding of the evolutionary transition to metazoan multicellularity is currently being transformed by comparative analyses of whole genomes (e.g., Putnam et al., 2007; King et al., 2008). To understand the events leading to the emergence of the most recent common ancestor to all living animals, comparisons need to be made between metazoans and close relatives within the Holozoa, which include choanoflagellates and members of related opisthokont lineages (e.g., Cavalier-Smith et al., 1996; King and Carroll, 2001; Snell et al., 2001; Lang et al., 2002; Burger et al., 2003; Cavalier-Smith and Chao, 2003; King et al., 2003; Steenkamp et al., 2006). The Metazoa consists of at least two ancient lineages of extant animals, the Eumetazoa (cnidarians, placozoans, and bilaterian phyla) and phylum Porifera (sponges) (e.g., Cavalier-Smith et al., 1996; Borchiellini et al., 2001; Medina et al., 2001; Collins, 2002; Wallberg et al., 2004). The recent sequencing of the genomes of the choanoflagellate Monosiga brevicollis (King et al., 2008), the sponge Amphimedon queenslandica, the cnidarians Nematostella vectensis (Putnam et al., 2007) and Hydra magnipapillata, and the placozoan Trichoplax adhaerens is allowing us to decipher the genomic events underpinning the transition to metazoan multicellularity.
It appears that a raft of genomic innovations occurred in the stem lineage leading to the metazoan last common ancestor. Sponges possess metazoan-specific transcription factors and signaling pathways (Larroux et al., 2006; Nichols et al., 2006; Adamska et al., 2007a, b; Larroux et al., 2007, 2008) whose orthologs populate the developmental gene regulatory networks (GRNs) underlying bilaterian embryogenesis (Davidson, 2006, and references therein). In contrast, the Monosiga brevicollis genome contains only a very small subset of these gene classes, despite encoding many cell adhesion and communication protein domains that are otherwise restricted to metazoans (King and Carroll, 2001; King et al., 2003, 2008; King, 2004; Larroux et al., 2008). Amphimedon possesses a large majority of transcription factor gene classes that have previously been found only in eumetazoans, including homeobox genes belonging to ANTP, prd-like, Pax, POU, LIM-HD, Six, and TALE classes, as well as basic helix-loop helix (bHLH), Rel, nuclear receptor, Mef2, Ets, Sox, T-box, and Fox genes (Larroux et al., 2006, 2007, 2008; Simionato et al., 2007; Gauthier and Degnan, 2008). Likewise, genes encoding components of all the major eumetazoan developmental signaling pathways are present in the Amphimedon genome (Adamska et al., 2007a, b; Adamska, Richards, Gauthier, and Degnan, unpubl.). Together, these observations indicate that the repertoire of gene classes comprising the developmental regulatory toolkit evolved before the divergence of sponge and eumetazoan lineages. The genesis of many of these metazoan-specific gene classes may have provided the molecular preadaptations that enabled the evolution of metazoan development and multicellularity. On the basis of the number of developmental gene families represented in the sponge genome, it is not hard to envisage the last common ancestor to all extant metazoans being developmentally and morphologically complex (Adamska et al., 2007a; Degnan and Degnan, 2006; Degnan et al., 2005).
Despite the qualitative conservation of the metazoan developmental "regulome" between sponges and eumetazoans, there must be inherent differences in the genomes of the ancestors that gave rise to these lineages. Why is it that the eumetazoan lineage consists of a wide diversity of body plans while sponge morphologies represent modifications of a unique aquiferous body plan that lacks cellular and morphological features found elsewhere in the animal kingdom (Simpson, 1984)? Why has the sponge body plan remained relatively unchanged since well before the Cambrian (Li et al., 1998)? These fundamental differences in complexity and diversity should be manifested in extant genomes. One obvious difference is that representative eumetazoans have a much larger number of developmental genes compared to Amphimedon, with individual transcription factor and signaling pathway gene classes and families having expanded differentially early in the eumetazoan lineage (Kusserow et al., 2005; Magie et al., 2005; Miller et al., 2005; Chourrout et al., 2006; Kamm et al., 2006; Ryan et al., 2006; Larroux et al., 2007, 2008; Putnam et al., 2007; Yamada et al., 2007). This increase in gene repertoire through duplication and divergence may have allowed for (i) the expansion of ancient GRNs that originally would have underpinned the primary generation and patterning of differentiated cell types in the first metazoan embryos, and (ii) the co-option of duplicates into novel developmental roles. Cnidarians and bilaterians have strikingly similar gene memberships (Kusserow et al., 2005; Magie et al., 2005; Miller et al., 2005; Chourrout et al., 2006; Kamm et al., 2006; Ryan et al., 2006; Larroux et al., 2007; Putnam et al., 2007; Yamada et al., 2007, 2008), suggesting that body plan complexity cannot solely be attributed to the growth in gene class and family size. Another possibility is that individual genes expanded their regulatory systems, allowing for their expression in multiple developmental contexts (i.e., individual genes were co-opted into new developmental roles). This increase in regulatory information is likely to manifest as an increase in the length of DNA sequence responsible for the regulation of a particular gene.
Critical cis-regulatory DNA sequences tend to be located in the vicinity of the transcription start site, often just upstream, although other regulatory information can be localized at a great distance from the coding region (reviewed in Davidson, 2006). These sequences act as binding sites for sequence-specific transcription factors that directly control the activation and repression of transcription. Detailed experimental analyses of putative regulatory regions of particular genes in a handful of animals (i.e., sea urchin, ascidian, Caenorhabditis elegans, Drosophila, and model vertebrates) have defined the role of proximal and distant cis-regulatory elements in controlling gene expression. While such analyses currently are not possible in Amphimedon, we can use expression patterns as a proxy to predict the complexity of the regulatory information for a given gene. In bilaterians, most transcription factor genes are used in a number of developmental contexts, with each context requiring its own cis-regulatory module (Davidson, 2006). Here, we explore this concept using a set of NK homeobox genes that are clustered in the genome of Amphimedon (Larroux et al., 2007). NK genes are used at all levels of the bilaterian developmental program, from early germ layer formation to terminal differentiation, and are involved in cell fate determination and differentiation, patterning and morphogenetic processes (reviewed in Banerjee-Basu and Baxevanis, 2001; Jagla et al., 2001; Carroll et al., 2005; Garcia-Fernandez, 2005; Stanfel et al., 2005; Slack, 2006). They are expressed in all three germ layers, and their roles in mesodermal and nervous system development are well studied.
By comparing the upstream intergenic regions of NK genes in Amphimedon, the cnidarian Nematostella vectensis, and various bilaterians covering a range of genome sizes, we show that the Amphimedon genes have markedly smaller intergenic regions. To test whether this trend is specific to developmental genes, we also performed these analyses on five non-homeobox structural genes, which are clustered with the NK genes in Amphimedon. No significant differences were detected for these genes between different organisms, suggesting that the size difference in regulatory regions between sponges and eumetazoans applies specifically to regulatory genes that may belong to developmental GRNs. The relatively simple expression patterns of a selection of Amphimedon NK genes during embryogenesis support the contention that the small intergenic regions may reflect the limited cis-regulatory information associated with these genes. Genome-wide studies in Drosophila and C. elegans have made a similar correlation between gene regulatory complexity and intergenic sequence size (Nelson et al., 2004). Although this case study is restricted to a very small number of genes and thus may not reflect broad genomic principles, it does present a supposition that can be addressed with genome-wide comparative analyses.
 |
Materials and Methods
|
|---|
Analysis of the Amphimedon NK cluster
The Amphimedon cluster of NK homeobox genes was initially assembled using an in-house assembly and scaffolding pipeline as described in Larroux et al. (2007). A draft genome assembly from the US Department of Energy Joint Genome Institute was later consulted to confirm the in-house assembly. We employed the AUGUSTUS gene prediction program to further systematically evaluate this assembly (Stanke et al., 2006) and manually modified models to incorporate regions of homology suggested by BLASTx alignments to sequences in the National Center for Biotechnology Information database. Where available, expressed sequence tags (ESTs) and 5' and 3' RACE sequences (homeobox genes only) were used to confirm coding sequences and to derive 5' and 3' untranslated regions.
Analysis of the expression of NK-related genes
Specimens of Amphimedon queenslandica Hooper and van Soest, 2006 (Porifera, Demospongiae, Haplosclerida, Niphatidae) were collected on Heron Island Reef, Great Barrier Reef, Australia, as described in Leys and Degnan (2002), and in situ hybridization was performed as described in Larroux et al. (2006). Detailed protocols and probe details are available upon request.
Comparative genomics
Sequence alignments (tBlastn) using homeodomains of proteins belonging to NK2, NK3, NK4, NK5, NK6, NK7, Msx, Hex, and Tlx families were derived from Lottia gigantea, Branchiostoma floridae, and Ciona intestinalis genome contigs available at http://genome.jgi-psf.org/ and from Caenorhabditis elegans and Drosophila melanogaster genomes at WormBase (http://www.wormbase.org/) and FlyBase (http://www.flybase.org/), respectively. Genes belonging to these NK families were previously characterized in the Nematostella vectensis genome by Ryan et al. (2006; see also the phylogenetic analysis in Larroux et al., 2007). Similarly, we searched these genomes (and the Nematostella vectensis genome) for orthologs of kinesin 2 KIF3B/C, tetracyclin resistance, Vacuolar Protein Sorting 8, Inositol Polyphosphate-5-Phosphatase A, and kinesin 5 KIF11. Identified sequences were classified into the different families using BlastP and/or a neighbor-joining phylogenetic tree (data not shown). Upstream intergenic region length and orientation of upstream gene were then determined through visual inspection of gene models in the genome browsers.
 |
Results and Discussion
|
|---|
Structure and composition of the Amphimedon NK cluster
The demosponge Amphimedon queenslandica has 8 NK genes in its genome, of which 6 are clustered within a 71-kb stretch of DNA (Fig. 1; Larroux et al., 2007). As this cluster is similar to that found in bilaterians, we infer that this is a metazoan synapomorphy. There is no evidence of other ANTP class homeobox genes—Hox, ParaHox and EHGbox—in the sponge genome, thus we also hypothesize that the eumetazoan ProtoHox gene originated from within the ancestral NK cluster after the divergence of sponge and eumetazoan lineages. The Amphimedon NK cluster consists of NK2/3/4-, Msx-, Hex-, Tlx-, and two NK5/6/7-related genes, one of which encodes two homeodomains and the other one of which has two different splice forms (Fig. 1). This cluster is markedly smaller than the inferred ancestral bilaterian cluster (Luke et al., 2003; Garcia-Fernandez, 2005; Larroux et al., 2007) and has a number of non-homeobox genes located within it. To illustrate this difference in size, we mapped the genomic regions corresponding to two conserved gene linkages, Tlx and NK5/6/7 (Fig. 2A) and Msx and Hex (Fig. 2B) in Amphimedon and in the chordate Branchiostoma floridae. Between Tlx and NK5/6/7, there are more genes in Amphimedon than in Branchiostoma (5 vs. 2) and the distance between the two genes is shorter in Amphimedon than in Branchiostoma (26 kb vs. 86 kb). Branchiostoma has two Hex-Msx clusters probably resulting from duplication of the genomic region. Not only is the genomic region between the two genes larger in this chordate than in Amphimedon (195–220 kb vs. 25 kb), but it also contains many more genes (11–15 vs. 4). A striking feature of the Amphimedon cluster is the high density of genes and the limited amount of intergenic DNA. In terms of the homeobox genes AmqNK2/3/4, AmqMsx, AmqHex, AmqTlx, AmqNK5/6/7A, and AmqNK5/6/7B, the estimated upstream intergenic regions range in size from 33 to 973 bp (Fig. 1; Appendix Table A1). This size range is similar for the non-homeobox genes in and flanking the Amphimedon NK cluster (227 to 1685 bp; Fig. 1; Appendix Table A2).

View larger version (22K):
[in this window]
[in a new window]
|
Figure 1. The Amphimedon NK cluster. The cluster is drawn linearly, with 10 kb per line. Names of the genes are above the gene location. Arrowheads designate the orientation of the genes. Exons, introns, and 5' and 3' untranslated regions are indicated by thick black arrows, thick black lines, grey boxes, and grey arrowheads, respectively. An alternative version of NK5/6/7a is indicated on the lower line (NK5/6/7a1 splice site derived from expressed sequence tags, NK5/6/7a2 5'untranslated regions and splice site derived from 5'RACE). Graphics drawn using GFF2PS (Abril and Guigo, 2000).
|
|

View larger version (23K):
[in this window]
[in a new window]
|
Figure 2. Comparison of (A) Tlx-NK5/6/7 and (B) Msx-Hex genomic regions in Amphimedon (Amq) and in the chordate Branchiostoma floridae (Bf). Branchiostoma has two Msx-Hex genomic regions. See Figure 1 for further explanation.
|
|
View this table:
[in this window]
[in a new window]
|
Table A1 Orientation and length of upstream intergenic regions of NK genes in Amphimedon (Amq), Nematostella (Nv), Lottia (Lg), Drosophila (Dm), C. elegans (Ce), Branchiostoma (Bf), and Ciona (Ci)
|
|
View this table:
[in this window]
[in a new window]
|
Table A2 Orientation and length of upstream intergenic regions of structural genes in Amphimedon (Amq), Nematostella (Nv), Lottia (Lg), Drosophila (Dm), C. elegans (Ce), Branchiostoma (Bf), and Ciona (Ci)
|
|
A similar high gene density has been observed by Breter et al. (2003) for other clusters of non-homeobox genes in demosponges. These authors deduced that this high gene density was typical of the genome of the sponges they were studying—Suberites domuncula and Geodia cydonium—and inferred that these species genomes consisted of about 300,000 genes. While we estimate that the Amphimedon genome has at least an order of magnitude fewer genes, we do find that in general it consists of clusters of tightly packed genes (unpubl. data), suggesting that this may be a common feature of demosponge genomes.
Representative NK-related genes are expressed in simple patterns during Amphimedon embryogenesis
The small intergenic regions of the Amphimedon NK genes suggest that there might be limited regulatory information available to drive complex patterns of developmental gene expression. We addressed this supposition by analyzing the embryonic expression patterns of three representative genes: AmqNK2/3/4, AmqTlx, and AmqNK5/6/7B (Fig. 3). Although patterning processes appear to be less complex in this sponge than in bilaterians (Adamska et al., 2007a, b), gastrulation, patterning, and differentiation events in Amphimedon give rise to a radially symmetrical parenchymella larva with 3 cell layers, at least 11 differentiated cell types, and a tissue-like structure, the photoreceptor pigment ring (Leys and Degnan, 2002). AmqNK2/3/4 transcripts are first detected during cleavage in a small number of micromeres per embryo (Fig. 3A, B). At gastrulation these localize to the inner cell mass, where they remain (Fig. 3C, D). Throughout development, the number of cells that highly express this gene seems to be limited. AmqTlx expression is not detected before the spot stage (late gastrulation) by whole-mount in situ hybridization (Fig. 3E–H). At this stage, expression is predominantly in cells surrounding the ring (Fig. 3E, F). Later in development, there is no evidence of expression around the ring. Instead, AmqTlx transcripts are restricted to cells of the inner cell mass (Fig. 3G, H). AmqNK5/6/7B appears to be expressed in a slightly more complex pattern, with two populations of micromeres expressing this gene through cleavage, gastrulation, and larva formation (Fig. 3I–M). These cells are fated to the outer layer, which contains a limited number of cell types (Leys and Degnan, 2002). During ring formation, AmqNK5/6/7B is activated in cells underlying the ring (Fig. 3J–M) that are clearly distinct from the micromeric expression. From these in situ hybridization data, we estimate that these three NK genes are expressed in 2–4 cell types during embryogenesis. The expression of AmqNK5/6/7B and AmqTlx in the vicinity of the pigment spot and ring during the formation of the ring suggests that these genes are playing a role in patterning this structure.

View larger version (86K):
[in this window]
[in a new window]
|
Figure 3. Developmental expression of Amphimedon NK genes during embryogenesis as observed by whole-mount in situ hybridization. (A–D) AmqNK2/3/4: (A, B) Cleavage. AmqNK2/3/4 transcripts differentially accumulate in a small number of micromeres (arrow). (C, D) Spot formation/late gastrulation. AmqNK2/3/4-expressing cells are predominantly localized to the inner cell mass; cells with higher levels of expression are localized predominantly on the outer edge of the mass. (E–H) AmqTlx: (E) Spot stage; posterior view. A high level of AmqTlx expression is present in the cells surrounding and adjacent to the pigment spot. (F, G) Early ring formation. AmqTlx transcripts are localized in cells of the inner cell mass. (H) Larva. AmqTlx expression remains restricted to the inner cell mass. (I–M) AmqNK5/6/7B. (I, J, M) Sections; (K, L) whole mounts; (I) cleavage. AmqNK5/6/7B expression is localized to tiny micromeres (arrow) and slightly larger granular micromeres (arrowhead). (J) Spot stage. These two AmqNK5/6/7B-expresssing cell types are localized to the outer cell region. (K, L) Early ring stage and (M) larva (L, M—posterior view). Expression persists in the outer layer but increases markedly in cells adjacent to and underlying the ring (arrow in L).
|
|
Bilaterian NK genes are expressed in all three germ layers and play a major role in mesoderm and nervous system development (reviewed in Banerjee-Basu and Baxevanis, 2001; Jagla et al., 2001; Carroll et al., 2005; Garcia-Fernandez, 2005; Stanfel et al., 2005; Slack, 2006). Given that Amphimedon NK genes are expressed in cells that do not have clear eumetazoan homologs, it is difficult to infer what the role of these homeobox genes in the last common ancestor to sponges and eumetazoans might have been. Nonetheless, it does appear that the sponge NK genes are expressed in a limited number of cell types and developmental stages, in contrast to what has been observed in representative bilaterians. More generally, expression patterns of multiple Amphimedon transcription factors and signaling molecules suggest that these genes do not have a multitude of roles during Amphimedon development (e.g., they are expressed in what appear to be a limited number of cell lineages throughout embryogenesis; Larroux et al., 2006; Adamska et al., 2007a, b; unpubl. data), indicating that regulatory genes may be less pleiotropic in sponges than in bilaterians.
Although we have suggested that whole-mount in situ hybridization data may be used as a proxy to deduce the complexity of the regulatory information controlling the expression of a given gene, a number of important caveats are associated with this approach. First, we only investigated embryogenesis and thus do not know how these genes are expressed during metamorphosis, in the juvenile, or in the adult. New expression patterns at any of these stages of the Amphimedon life cycle will require additional cis-regulatory modules. Second, without accurate cell lineage data for Amphimedon embryos, we cannot exclude the possibility that the cell type we are inferring to be constant between stages is actually changing. Evidence from the developmental expression of another NK-related gene, AmqBsh (formerly RenBsh), indicates that a single cell type does express the gene throughout embryogenesis (Larroux et al., 2006). In this case, AmqBsh is expressed in cells fated to become sclerocytes, which produce spicules and have a distinctive morphology (Leys and Degnan, 2002) and thus can be easily traced.
Comparative genomics of genes in the sponge NK cluster
To test whether there is a significant difference in the sizes of the upstream intergenic region of the eumetazoan orthologs of homeobox and non-homeobox genes in the Amphimedon NK cluster, we analyzed publicly available gene models for the cnidarian Nematostella vectensis and representative bilaterians: the lophotrochozoan Lottia gigantea, the ecdyzosoans Caenorhabditis elegans and Drosophila melanogaster, and the deuterostomes Branchiostoma floridae and Ciona intestinalis (Fig. 4; Appendix Tables A1 and A2). These bilaterian organisms were chosen to span the three major superphyletic lineages: lophotrochozoan; ecdysozoan; and deuterostome. Whereas the genomes of Lottia (360 Mbp), Nematostella (450 Mbp), and Branchiostoma (600 Mbp) are at least twice the size of the Amphimedon genome (
167 Mbp), the Drosophila and Ciona genomes are of a similar size to the sponge genome (170–180 Mbp) and C. elegans has a smaller genome (97 Mbp). We chose these latter three genomes to test whether smaller intergenic regions were merely an effect of smaller genome sizes. Gene duplication in the eumetazoan lineage meant that there were often a number of orthologs in each species for a given Amphimedon NK gene (Appendix Table A1).

View larger version (23K):
[in this window]
[in a new window]
|
Figure 4. Comparison of the lengths of upstream intergenic regions of (A) NK cluster homeobox genes and (B) structural non-homeobox genes found in the Amphimedon NK cluster in Amphimedon queenslandica (Amq), Nematostella vectensis (Nv), Drosophila melanogaster (Dm), Caenorhabditis elegans (Ce), Lottia gigantea (Lg), and Ciona intestinalis (Ci). Analyses are grouped by gene. The Total group represents the average and standard deviation of all genes analyzed. Upstream intergenic region length of all Amphimedon NK cluster genes (A) is significantly different from that of all other species analyzed, while upstream intergenic region length of all Amphimedon structural genes (B) is not significantly different from that of all other species (see Table 1 for paired t-test values).
|
|
We compared the intergenic lengths for each of the NK gene families. Where there is more than one ortholog, averages and standard deviations are shown (Fig. 4A). Comparison of these regions reveals that the Amphimedon NK genes have consistently smaller upstream intergenic regions than the orthologous genes from representative bilaterians. In total, the Amphimedon NK genes have significantly smaller intergenic regions compared to those from eumetazoans (see paired values for the Student's t-test in Table 1). As this is also the case for C. elegans, Drosophila, and Ciona, which have genomes that are smaller than or similar in size to those of Amphimedon, this trend does not appear to be a function of genome size but rather to reflect a fundamental difference between Amphimedon and these eumetazoan representatives. Although the Nematostella NK genes have generally smaller intergenic regions than those of their bilaterian orthologs, in total there is no significant difference (Fig. 4A; see paired t-test values in Table 1).
View this table:
[in this window]
[in a new window]
|
Table 1 Paired t-test values and significance levels: Amphimedon (Amq), Nematostella (Nv), Lottia (Lg), Drosophila (Dm), C. elegans (Ce), Branchiostoma (Bf), and Ciona (Ci)
|
|
On the whole, eumetazoan orthologs of the five structural genes linked to the Amphimedon NK cluster have smaller upstream intergenic distances than the NK ortholog (Fig. 4B; Appendix Table A2). Taken together, the distances of Amphimedon structural genes are not significantly different than those of eumetazoans (Fig. 4; see paired t-test values in Table 1). These data are compatible with a difference in proximal regulatory capacity between sponges and eumetazoans occurring specifically in genes belonging to developmental GRNs rather than being a genome-wide trend. Sponge regulatory genes thus appear to possess simpler proximal cis-regulatory control sequences upstream of their transcript start site than their eumetazoan counterparts. Clearly, the supposition that the Amphimedon genome houses less regulatory information per developmental gene than eumetazoan genomes needs to be investigated on a much larger scale, with the inclusion of data from a wider sample of gene classes and families. At this stage, equally parsimonious interpretations of the data presented here are that (i) there has been a compaction and simplification of upstream regulatory region DNA in the sponge lineage, and (ii) a bulk of cis-regulatory DNA is housed elsewhere in the genome. Whether there has been expansion of intergenic region in the eumetazoan lineage or reduction in the sponge lineage, there appears to be an interesting correlation between the size of intergenic DNA and morphological diversity and complexity. This observation supports genome-wide studies undertaken in Drosophila and C. elegans that reveal that the regulatory complexity of a given gene correlates positively with the amount of surrounding intergenic DNA (Nelson et al., 2004).
 |
Conclusions
|
|---|
The evolution of a developmental gene regulatory network (GRN) may have been one of the prerequisites for the evolution of multicellularity and embryogenesis. Indeed, comparison of early-branching metazoan genomes reveals that the components required to assemble basic GRN—that is, conserved transcription factors and signaling pathways—evolved prior to metazoan cladogenesis. The duplication and divergence of genes encoding signaling pathway components and transcription factors early in eumetazoan evolution apparently allowed for ancient networks to expand and evolve, and for new networks to form. In this study, we have emphasized another mechanism by which simple GRNs may have increased in complexity during the transition from an ancestral metazoan to a eumetazoan grade of organization. Increases in the size and complexity of the cis-regulatory regions required to direct expression of GRN components may have allowed for the evolution of developmental gene pleiotropy by increasing the number of ontogenetic contexts in which a given gene is employed. Using a handful of NK genes clustered in the genome of the sponge Amphimedon, we provide support for the proposal that an increase in proximal, upstream cis-regulatory information of genes belonging to developmental GRNs occurred in the stem leading to the eumetazoan last common ancestor. Because this study provides no measure of cis-regulatory information housed elsewhere in the genome, we recognise that direct measures of intergenic region size may be at best a general indicator of regulatory complexity (cf. Nelson et al., 2004). Nonetheless, the relatively simple expression patterns of the Amphimedon NK genes during embryogenesis are compatible with the idea that these genes have limited regulatory information. Given that this is a single case study that is based on a simple comparison of upstream intergenic sizes, further comparisons and experiments are required before we can include cis-regulatory expansion as a primary force in the early evolution of metazoan morphological complexity.
 |
Acknowledgments
|
|---|
We gratefully acknowledge the significant contribution and support of The US Department of Energy Joint Genome Institute in the production of Amphimedon (Reniera) genomic and EST sequences used in this study through the Community Sequencing Program. The research was supported by grants from the Australian Research Council to B.M.D.
 |
Footnotes
|
|---|
Received 7 January 2008; accepted 11 March 2008.
* These authors contributed equally to this paper. 
 |
Literature Cited
|
|---|
Abril, J. F., and R. Guigó. 2000. gff2ps: visualizing genomic annotations. Bioinformatics 16:743–744.[Abstract/Free Full Text]
Adamska, M., S. M. Degnan, K. M. Green, M. Adamski, A. Craigie, C. Larroux, and B. M. Degnan. 2007a. Wnt and TGF-β expression in the sponge Amphimedon queenslandica and the origin of metazoan embryonic patterning. PLoS ONE 2:e1031.
Adamska, M., D. Q. Matus, M. Adamski, K. M. Green, M. Q. Martindale, and B. M. Degnan. 2007b. The evolutionary origin of hedgehog proteins. Curr. Biol. 17:R836–837.[ISI][Medline]
Banerjee-Basu, S., and A. D. Baxevanis. 2001. Molecular evolution of the homeodomain family of transcription factors. Nucleic Acids Res. 29:3258–3269.[Abstract/Free Full Text]
Borchiellini, C., M. Manuel, E. Alivon, N. Boury-Esnault, J. Vacelet, and Y. Le Parco. 2001. Sponge paraphyly and the origin of Metazoa. J. Evol. Biol. 14:171–179.[ISI]
Breter, H. J., V. A. Grebenjuk, A. Skorokhod, and W. E. G. Muller. 2003. Approaches for a sustainable use of the bioactive potential in sponges: analysis of gene clusters, differential display of mRNA and DNA chips. Pp. 199–230 in Sponges: (Porifera), W. E. G. Muller, ed. Springer, Berlin.
Burger, G., L. Forget, Y. Zhu, M. W. Gray, and B. F. Lang. 2003. Unique mitochondrial genome architecture in unicellular relatives of animals. Proc. Natl. Acad. Sci. USA 100:892–897.[Abstract/Free Full Text]
Carroll, S. B., J. K. Grenier, and S. D. Weatherbee. 2005. From DNA to Diversity: Molecular Genetics and the Evolution of Animal Design. Blackwell Science, Malden, MA.
Cavalier-Smith, T., and E. E. Y. Chao. 2003. Phylogeny of choanozoa, apusozoa, and other protozoa and early eukaryote megaevolution. J. Mol. Evol. 56:540–563.[ISI][Medline]
Cavalier-Smith, T., E. E. Chao, N. Boury-Esnault, and J. Vacelet. 1996. Sponge phylogeny, animal monophyly, and the origin of the nervous system: 18S rRNA evidence. Can. J. Zool. 74:2031–2045.
Chourrout, D., F. Delsuc, P. Chourrout, R. B. Edvardsen, F. Rentzsch, E. Renfer, M. F. Jensen, B. Zhu, P. de Jong, R. E. Steele, and U. Technau. 2006. Minimal ProtoHox cluster inferred from bilaterian and cnidarian Hox complements. Nature 442:684–687.[Medline]
Collins, A. G. 2002. Phylogeny of Medusozoa and the evolution of cnidarian life cycles. J. Evol. Biol. 15:418–432.[ISI]
Davidson, E. H. 2006. The Regulatory Genome: Gene Regulatory Networks in Development and Evolution. Academic Press, New York.
Degnan, S. M., and B. M. Degnan. 2006. The origin of the pelagobenthic metazoan life cycle: what's sex got to do with it? Integr. Comp. Biol. 46:683–690.[Abstract/Free Full Text]
Degnan, B. M., S. P. Leys, and C. Larroux. 2005. Sponge development and antiquity of animal pattern formation. Integr. Comp. Biol. 45:335–341.[Abstract/Free Full Text]
Garcia-Fernandez, J. 2005. The genesis and evolution of homeobox gene clusters. Nat. Rev. Genet. 6:881–892.[ISI][Medline]
Gauthier, M., and B. M. Degnan. 2008. The transcription factor NF-
B in the demosponge Amphimedon queenslandica: insights on the evolutionary origin of the Rel homology domain. Dev. Genes Evol. 218:23–32.[ISI][Medline]
Jagla, K., M. Bellard, and M. Frasch. 2001. A cluster of Drosophila homeobox genes involved in mesoderm differentiation programs. BioEssays 23:125–133.[ISI][Medline]
Kamm, K., B. Schierwater, W. Jakob, S. L. Dellaporta, and D. J. Miller. 2006. Axial patterning and diversification in the Cnidaria predate the Hox system. Curr. Biol. 16:1–7.[ISI][Medline]
King, N. 2004. The unicellular ancestry of animal development. Dev. Cell 7:313–325.[ISI][Medline]
King, N., and S. B. Carroll. 2001. A receptor tyrosine kinase from choanoflagellates: molecular insights into early animal evolution. Proc. Natl. Acad. Sci. USA 98:15032–15037.[Abstract/Free Full Text]
King, N., C. T. Hittinger, and S. B. Carroll. 2003. Evolution of key cell signaling and adhesion protein families predates animal origins. Science 301:361–363.[Abstract/Free Full Text]
King, N., M. J. Westbrook, S. L. Young, A. Kuo, M. Abedin, J. Chapman, S. Fairclough, U. Hellsten, Y. Isogai, I. Letunic, et al. 2008. The genome of the choanoflagellate Monosiga brevicollis and the origin of metazoans. Nature 451:783–788.[Medline]
Kusserow, A., K. Pang, C. Sturm, M. Hrouda, J. Lentfer, H. A. Schmidt, U. Technau, A. von Haeseler, B. Hobmayer, M. Q. Martindale, and T. W. Holstein. 2005. Unexpected complexity of the Wnt gene family in a sea anemone. Nature 433:156–160.[Medline]
Lang, B. F., C. O'Kelly, T. Nerad, M. W. Gray, and G. Burger. 2002. The closest unicellular relatives of animals. Curr. Biol. 12:1773–1778.[ISI][Medline]
Larroux, C., B. Fahey, D. Liubicich, V. F. Hinman, M. Gauthier, M. Gongora, K. Green, G. Worheide, S. P. Leys, and B. M. Degnan. 2006. Developmental expression of transcription factor genes in a demosponge: insights into the origin of metazoan multicellularity. Evol. Dev. 8:150–173.[ISI][Medline]
Larroux, C., B. Fahey, S. M. Degnan, M. Adamski, D. S. Rokhsar, and B. M. Degnan. 2007. The NK homeobox gene cluster predates the origin of Hox genes. Curr. Biol. 17:706–710.[ISI][Medline]
Larroux, C., G. N. Luke, P. Koopman, D. S. Rokhsar, S. M. Shimeld, and B. M. Degnan. 2008. Genesis and expansion of metazoan transcription factor gene classes. Mol. Biol. Evol. 25:980–996.[Abstract/Free Full Text]
Leys, S. P., and B. M. Degnan. 2002. Embryogenesis and metamorphosis in a haplosclerid demosponge: gastrulation and transdifferentiation of larval ciliated cells to choanocytes. Invertebr. Biol. 121:171–189.
Li, C. W., J. Y. Chen, and T. E. Hua. 1998. Precambrian sponges with cellular structures. Science 279:879–882.[Abstract/Free Full Text]
Luke, G. N., L. F. C. Castro, K. McLay, C. Bird, A. Coulson, and P. W. H. Holland. 2003. Dispersal of NK homeobox gene clusters in amphioxus and humans. Proc. Natl. Acad. Sci. USA 100:5292–5295.[Abstract/Free Full Text]
Magie, C. R., K. Pang, and M. Q. Martindale. 2005. Genomic inventory and expression of Sox and Fox genes in the cnidarian Nematostella vectensis. Dev. Genes Evol. 215:618–630.[ISI][Medline]
Medina, M., A. G. Collins, J. D. Silberman, and M. L. Sogin. 2001. Evaluating hypotheses of basal animal phylogeny using complete sequences of large and small subunit rRNA. Proc. Natl. Acad. Sci. USA 98:9707–9712.[Abstract/Free Full Text]
Miller, D. J., E. E. Ball, and U. Technau. 2005. Cnidarians and ancestral genetic complexity in the animal kingdom. Trends Genet. 21:536–539.[ISI][Medline]
Nelson, C. E., B. M. Hersh, and S. B. Carroll. 2004. The regulatory content of intergenic DNA shapes genome architecture. Genome Biol. 5:R25.[Medline]
Nichols, S. A., W. Dirks, J. S. Pearse, and N. King. 2006. Early evolution of animal cell signaling and adhesion genes. Proc. Natl. Acad. Sci. USA 103:12451–12456.[Abstract/Free Full Text]
Putnam, N., M. Srivastava, U. Hellsten, B. Dirks, J. Chapman, A. Salamov, A. Terry, H. Shapiro, E. Lindquist, V. V. Kapitonov, et al. 2007. Sea anemone genome reveals ancestral eumetazoan gene repertoire and genomic organization. Science 317:86–94.[Abstract/Free Full Text]
Ryan, J. F., P. M. Burton, M. E. Mazza, G. K. Kwong, J. C. Mullikin, and J. R. Finnerty. 2006. The cnidarian-bilaterian ancestor possessed at least 56 homeoboxes. Evidence from the starlet sea anemone, Nematostella vectensis. Genome Biol. 7:R64.[Medline]
Simionato, E., V. Ledent, G. Richards, M. Thomas-Chollier, P. Kerner, D. Coornaert, B. M. Degnan, and M. Vervoort. 2007. Origin and diversification of the basic helix-loop-helix gene family in metazoans: insights from comparative genomics. BMC Evol. Biol. 7:33.[Medline]
Simpson, T. L. 1984. The Cell Biology of Sponges. Springer-Verlag, New York.
Slack, J. M. W. 2006. Essential Developmental Biology. Blackwell Publishing, Malden, MA.
Snell, E. A., R. F. Furlong, and P. W. H. Holland. 2001. Hsp70 sequences indicate that choanoflagellates are closely related to animals. Curr. Biol. 11:967–970.[ISI][Medline]
Stanfel, M. N., K. A. Moses, R. J. Schwartz, and W. E. Zimmer. 2005. Regulation of organ development by the Nkx-homeodomain factors: an Nkx code. Cell. Mol. Biol. 51: OL785–OL799.[ISI][Medline]
Stanke, M., O. Schöffmann, B. Morgenstern, and S. Waack. 2006. Gene prediction in eukaryotes with a generalized hidden Markov model that uses hints from external sources. BMC Bioinformatics 7:62.[Medline]
Steenkamp, E. T., J. Wright, and S. L. Baldauf. 2006. The protistan origins of animals and fungi. Mol. Biol. Evol. 23:93–106.[Abstract/Free Full Text]
Wallberg, A., M. Thollesson, J. S. Farris, and U. Jondelius. 2004. The phylogenetic position of the comb jellies (Ctenophora) and the importance of taxonomic sampling. Cladistics 20:558–578.[ISI]
Yamada, A., K. Pang, M. Q. Martindale, and S. Tochinai. 2007. Surprisingly complex T-box gene complement in diploblastic metazoans. Evol. Dev. 9:220–230.[ISI][Medline]
This article has been cited by other articles:

|
 |

|
 |
 
B. M. Degnan, M. Adamska, A. Craigie, S. M. Degnan, B. Fahey, M. Gauthier, J. N.A. Hooper, C. Larroux, S. P. Leys, E. Lovas, et al.
The Demosponge Amphimedon queenslandica: Reconstructing the Ancestral Metazoan Genome and Deciphering the Origin of Animal Multicellularity
CSH Protocols,
December 1, 2008;
2008(13):
pdb.emo108 - pdb.emo108.
[Abstract]
[Full Text]
|
 |
|

|
 |

|
 |
 
R. A. Cameron and J. P. Rast
Biological Bulletin Virtual Symposium: Genomics of Large Marine Metazoans
Biol. Bull.,
June 1, 2008;
214(3):
203 - 204.
[Full Text]
[PDF]
|
 |
|