Limits...
Divergence of RNA polymerase α subunits in angiosperm plastid genomes is mediated by genomic rearrangement.

Blazier JC, Ruhlman TA, Weng ML, Rehman SK, Sabir JS, Jansen RK - Sci Rep (2016)

Bottom Line: Multiple lines of evidence indicated that the rpoA sequences are likely functional despite retaining as low as 30% nucleotide sequence identity with rpoA genes from outgroups in the same angiosperm order.The plastid genomes containing these divergent rpoA genes have experienced extensive structural rearrangement, including large expansions of the inverted repeat.We propose that illegitimate recombination, not positive selection, has driven the divergence of rpoA.

View Article: PubMed Central - PubMed

Affiliation: Department of Integrative Biology, University of Texas, Austin, TX 78712, USA.

ABSTRACT
Genes for the plastid-encoded RNA polymerase (PEP) persist in the plastid genomes of all photosynthetic angiosperms. However, three unrelated lineages (Annonaceae, Passifloraceae and Geraniaceae) have been identified with unusually divergent open reading frames (ORFs) in the conserved region of rpoA, the gene encoding the PEP α subunit. We used sequence-based approaches to evaluate whether these genes retain function. Both gene sequences and complete plastid genome sequences were assembled and analyzed from each of the three angiosperm families. Multiple lines of evidence indicated that the rpoA sequences are likely functional despite retaining as low as 30% nucleotide sequence identity with rpoA genes from outgroups in the same angiosperm order. The ratio of non-synonymous to synonymous substitutions indicated that these genes are under purifying selection, and bioinformatic prediction of conserved domains indicated that functional domains are preserved. One of the lineages (Pelargonium, Geraniaceae) contains species with multiple rpoA-like ORFs that show evidence of ongoing inter-paralog gene conversion. The plastid genomes containing these divergent rpoA genes have experienced extensive structural rearrangement, including large expansions of the inverted repeat. We propose that illegitimate recombination, not positive selection, has driven the divergence of rpoA.

No MeSH data available.


Related in: MedlinePlus

Maximum likelihood tree generated for all 46 rpoA ORFs from 26 Pelargonium species with likelihood score −21428.281249 lnL.Species in clade C2 contain two (P. spinosum and P. endlicherianum), three (four species from section Ciconium) or six (P. transvaalense) rpoA paralogs. Bootstrap values greater than 50 are shown at the nodes; values of 100 are indicated by asterisks. Scale bar indicates non-synonymous substitutions per codon. The constraint tree (inset) does not contain clade C2 taxa as these species all contain multiple rpoA sequences.
© Copyright Policy - open-access
Related In: Results  -  Collection

License
getmorefigures.php?uid=PMC4834550&req=5

f4: Maximum likelihood tree generated for all 46 rpoA ORFs from 26 Pelargonium species with likelihood score −21428.281249 lnL.Species in clade C2 contain two (P. spinosum and P. endlicherianum), three (four species from section Ciconium) or six (P. transvaalense) rpoA paralogs. Bootstrap values greater than 50 are shown at the nodes; values of 100 are indicated by asterisks. Scale bar indicates non-synonymous substitutions per codon. The constraint tree (inset) does not contain clade C2 taxa as these species all contain multiple rpoA sequences.

Mentions: The dN/dS ratio was calculated for the three different lineages of angiosperms. Seven plastid genes (rpoA, rpoB, rpoC1, rpoC2, ndhF, matK and rbcL) were analyzed in PAML for three datasets to compare the dN/dS ratio of rpoA to the other rpo genes as well as to other non-rpo plastid genes. These same seven genes were used to generate constraint trees for each dataset. Constraint tree topologies were identical to the matK trees for Annonaceae (Fig. 2A) and Passifloraceae (Fig. 3A). The seven gene constraint tree for Geraniaceae is shown as an inset in Fig. 4.


Divergence of RNA polymerase α subunits in angiosperm plastid genomes is mediated by genomic rearrangement.

Blazier JC, Ruhlman TA, Weng ML, Rehman SK, Sabir JS, Jansen RK - Sci Rep (2016)

Maximum likelihood tree generated for all 46 rpoA ORFs from 26 Pelargonium species with likelihood score −21428.281249 lnL.Species in clade C2 contain two (P. spinosum and P. endlicherianum), three (four species from section Ciconium) or six (P. transvaalense) rpoA paralogs. Bootstrap values greater than 50 are shown at the nodes; values of 100 are indicated by asterisks. Scale bar indicates non-synonymous substitutions per codon. The constraint tree (inset) does not contain clade C2 taxa as these species all contain multiple rpoA sequences.
© Copyright Policy - open-access
Related In: Results  -  Collection

License
Show All Figures
getmorefigures.php?uid=PMC4834550&req=5

f4: Maximum likelihood tree generated for all 46 rpoA ORFs from 26 Pelargonium species with likelihood score −21428.281249 lnL.Species in clade C2 contain two (P. spinosum and P. endlicherianum), three (four species from section Ciconium) or six (P. transvaalense) rpoA paralogs. Bootstrap values greater than 50 are shown at the nodes; values of 100 are indicated by asterisks. Scale bar indicates non-synonymous substitutions per codon. The constraint tree (inset) does not contain clade C2 taxa as these species all contain multiple rpoA sequences.
Mentions: The dN/dS ratio was calculated for the three different lineages of angiosperms. Seven plastid genes (rpoA, rpoB, rpoC1, rpoC2, ndhF, matK and rbcL) were analyzed in PAML for three datasets to compare the dN/dS ratio of rpoA to the other rpo genes as well as to other non-rpo plastid genes. These same seven genes were used to generate constraint trees for each dataset. Constraint tree topologies were identical to the matK trees for Annonaceae (Fig. 2A) and Passifloraceae (Fig. 3A). The seven gene constraint tree for Geraniaceae is shown as an inset in Fig. 4.

Bottom Line: Multiple lines of evidence indicated that the rpoA sequences are likely functional despite retaining as low as 30% nucleotide sequence identity with rpoA genes from outgroups in the same angiosperm order.The plastid genomes containing these divergent rpoA genes have experienced extensive structural rearrangement, including large expansions of the inverted repeat.We propose that illegitimate recombination, not positive selection, has driven the divergence of rpoA.

View Article: PubMed Central - PubMed

Affiliation: Department of Integrative Biology, University of Texas, Austin, TX 78712, USA.

ABSTRACT
Genes for the plastid-encoded RNA polymerase (PEP) persist in the plastid genomes of all photosynthetic angiosperms. However, three unrelated lineages (Annonaceae, Passifloraceae and Geraniaceae) have been identified with unusually divergent open reading frames (ORFs) in the conserved region of rpoA, the gene encoding the PEP α subunit. We used sequence-based approaches to evaluate whether these genes retain function. Both gene sequences and complete plastid genome sequences were assembled and analyzed from each of the three angiosperm families. Multiple lines of evidence indicated that the rpoA sequences are likely functional despite retaining as low as 30% nucleotide sequence identity with rpoA genes from outgroups in the same angiosperm order. The ratio of non-synonymous to synonymous substitutions indicated that these genes are under purifying selection, and bioinformatic prediction of conserved domains indicated that functional domains are preserved. One of the lineages (Pelargonium, Geraniaceae) contains species with multiple rpoA-like ORFs that show evidence of ongoing inter-paralog gene conversion. The plastid genomes containing these divergent rpoA genes have experienced extensive structural rearrangement, including large expansions of the inverted repeat. We propose that illegitimate recombination, not positive selection, has driven the divergence of rpoA.

No MeSH data available.


Related in: MedlinePlus