Limits...
Diversification of the C-TERMINALLY ENCODED PEPTIDE (CEP) gene family in angiosperms, and evolution of plant-family specific CEP genes.

Ogilvie HA, Imin N, Djordjevic MA - BMC Genomics (2014)

Bottom Line: Using a motif-based system developed for this study to identify canonical CEP peptide domains, a total of 916 CEP genes and 1,223 CEP domains were found in angiosperms and for the first time in gymnosperms.Both CEP genes and domains were found to have diversified in angiosperms, particularly in the Poaceae and Solanaceae plant families.Multispecies orthologous relationships were determined for 22% of identified CEP genes, and further analysis of those groups found selective constraints upon residues within the CEP peptide and within the previously little-characterized variable region.

View Article: PubMed Central - PubMed

Affiliation: Research School of Biology, The Australian National University, Canberra ACT 0200, Australia. huw.ogilvie@anu.edu.au.

ABSTRACT

Background: Small, secreted signaling peptides work in parallel with phytohormones to control important aspects of plant growth and development. Genes from the C-TERMINALLY ENCODED PEPTIDE (CEP) family produce such peptides which negatively regulate plant growth, especially under stress, and affect other important developmental processes. To illuminate how the CEP gene family has evolved within the plant kingdom, including its emergence, diversification and variation between lineages, a comprehensive survey was undertaken to identify and characterize CEP genes in 106 plant genomes.

Results: Using a motif-based system developed for this study to identify canonical CEP peptide domains, a total of 916 CEP genes and 1,223 CEP domains were found in angiosperms and for the first time in gymnosperms. This defines a narrow band for the emergence of CEP genes in plants, from the divergence of lycophytes to the angiosperm/gymnosperm split. Both CEP genes and domains were found to have diversified in angiosperms, particularly in the Poaceae and Solanaceae plant families. Multispecies orthologous relationships were determined for 22% of identified CEP genes, and further analysis of those groups found selective constraints upon residues within the CEP peptide and within the previously little-characterized variable region. An examination of public Oryza sativa RNA-Seq datasets revealed an expression pattern that links OsCEP5 and OsCEP6 to panicle development and flowering, and CEP gene trees reveal these emerged from a duplication event associated with the Poaceae plant family.

Conclusions: The characterization of the plant-family specific CEP genes OsCEP5 and OsCEP6, the association of CEP genes with angiosperm-specific development processes like panicle development, and the diversification of CEP genes in angiosperms provides further support for the hypothesis that CEP genes have been integral to the evolution of novel traits within the angiosperm lineage. Beyond these findings, the comprehensive set of CEP genes and their properties reported here will be a resource for future research on CEP genes and peptides.

Show MeSH
GC content of CEP coding sequences in plant families. Box plots show the median (center line), interquartile range (IQR, white box), range (whiskers) and outliers (circles). Outliers are defined as points more distant than 1.5 times the IQR from the median. Tukey’s test reveals that differences in GC content between all pairs of plant families except Brassicaceae-Fabaceae and Pinaceae-Rosaceae are statistically significant (P < 0.001).
© Copyright Policy - open-access
Related In: Results  -  Collection

License 1 - License 2
getmorefigures.php?uid=PMC4197245&req=5

Fig5: GC content of CEP coding sequences in plant families. Box plots show the median (center line), interquartile range (IQR, white box), range (whiskers) and outliers (circles). Outliers are defined as points more distant than 1.5 times the IQR from the median. Tukey’s test reveals that differences in GC content between all pairs of plant families except Brassicaceae-Fabaceae and Pinaceae-Rosaceae are statistically significant (P < 0.001).

Mentions: To explore a potential basis for the different AA frequencies observed within the CEP domain for different plant families, the proportion of guanine and cytosine nucleotides (GC content) was calculated for all CEP genes identified in this study. Furthermore, the distribution of CEP gene GC content in the six well-represented plant families was calculated (Figure 5), and pairwise distances in GC content between those families statistically tested. The GC content of Poaceae CEP genes was significantly higher (P < 0.001) than any other plant family, and the GC content of Solanaceae CEP genes was significantly lower than any other plant family (P < 0.001).


Diversification of the C-TERMINALLY ENCODED PEPTIDE (CEP) gene family in angiosperms, and evolution of plant-family specific CEP genes.

Ogilvie HA, Imin N, Djordjevic MA - BMC Genomics (2014)

GC content of CEP coding sequences in plant families. Box plots show the median (center line), interquartile range (IQR, white box), range (whiskers) and outliers (circles). Outliers are defined as points more distant than 1.5 times the IQR from the median. Tukey’s test reveals that differences in GC content between all pairs of plant families except Brassicaceae-Fabaceae and Pinaceae-Rosaceae are statistically significant (P < 0.001).
© Copyright Policy - open-access
Related In: Results  -  Collection

License 1 - License 2
Show All Figures
getmorefigures.php?uid=PMC4197245&req=5

Fig5: GC content of CEP coding sequences in plant families. Box plots show the median (center line), interquartile range (IQR, white box), range (whiskers) and outliers (circles). Outliers are defined as points more distant than 1.5 times the IQR from the median. Tukey’s test reveals that differences in GC content between all pairs of plant families except Brassicaceae-Fabaceae and Pinaceae-Rosaceae are statistically significant (P < 0.001).
Mentions: To explore a potential basis for the different AA frequencies observed within the CEP domain for different plant families, the proportion of guanine and cytosine nucleotides (GC content) was calculated for all CEP genes identified in this study. Furthermore, the distribution of CEP gene GC content in the six well-represented plant families was calculated (Figure 5), and pairwise distances in GC content between those families statistically tested. The GC content of Poaceae CEP genes was significantly higher (P < 0.001) than any other plant family, and the GC content of Solanaceae CEP genes was significantly lower than any other plant family (P < 0.001).

Bottom Line: Using a motif-based system developed for this study to identify canonical CEP peptide domains, a total of 916 CEP genes and 1,223 CEP domains were found in angiosperms and for the first time in gymnosperms.Both CEP genes and domains were found to have diversified in angiosperms, particularly in the Poaceae and Solanaceae plant families.Multispecies orthologous relationships were determined for 22% of identified CEP genes, and further analysis of those groups found selective constraints upon residues within the CEP peptide and within the previously little-characterized variable region.

View Article: PubMed Central - PubMed

Affiliation: Research School of Biology, The Australian National University, Canberra ACT 0200, Australia. huw.ogilvie@anu.edu.au.

ABSTRACT

Background: Small, secreted signaling peptides work in parallel with phytohormones to control important aspects of plant growth and development. Genes from the C-TERMINALLY ENCODED PEPTIDE (CEP) family produce such peptides which negatively regulate plant growth, especially under stress, and affect other important developmental processes. To illuminate how the CEP gene family has evolved within the plant kingdom, including its emergence, diversification and variation between lineages, a comprehensive survey was undertaken to identify and characterize CEP genes in 106 plant genomes.

Results: Using a motif-based system developed for this study to identify canonical CEP peptide domains, a total of 916 CEP genes and 1,223 CEP domains were found in angiosperms and for the first time in gymnosperms. This defines a narrow band for the emergence of CEP genes in plants, from the divergence of lycophytes to the angiosperm/gymnosperm split. Both CEP genes and domains were found to have diversified in angiosperms, particularly in the Poaceae and Solanaceae plant families. Multispecies orthologous relationships were determined for 22% of identified CEP genes, and further analysis of those groups found selective constraints upon residues within the CEP peptide and within the previously little-characterized variable region. An examination of public Oryza sativa RNA-Seq datasets revealed an expression pattern that links OsCEP5 and OsCEP6 to panicle development and flowering, and CEP gene trees reveal these emerged from a duplication event associated with the Poaceae plant family.

Conclusions: The characterization of the plant-family specific CEP genes OsCEP5 and OsCEP6, the association of CEP genes with angiosperm-specific development processes like panicle development, and the diversification of CEP genes in angiosperms provides further support for the hypothesis that CEP genes have been integral to the evolution of novel traits within the angiosperm lineage. Beyond these findings, the comprehensive set of CEP genes and their properties reported here will be a resource for future research on CEP genes and peptides.

Show MeSH