Title: Evaluation of the authenticity of a highly novel environmental sequence from boreal forest soil using ribosomal RNA secondary structure modeling
Author: Glass, D.J.; Takebayashi, N.; Olson, L.; Taylor, D.L.;
Source: Molecular Phylogenetics and Evolution
Publication Series: Scientific Journal (JRNL)
Description: The number of sequences from both formally described taxa and uncultured environmental DNA deposited in the International Nucleotide Sequence Databases has increased substantially over the last two decades. Although the majority of these sequences represent authentic gene copies, there is evidence of DNA artifacts in these databases as well. These include lab artifacts, such as PCR chimeras, and biological artifacts such as pseudogenes or other paralogous sequences. Sequences that fall in basal positions in phylogenetic trees and appear distant from known sequences are particularly suspect. Phylogenetic analyses suggest that a novel sequence type (NS1) found in two boreal forest soil clone libraries belongs to the fungal kingdom but does not fall unambiguously within any known phylum. We have evaluated this sequence type using an array of secondary-structure analyses. To our knowledge, such analyses have never been used on environmental ribosomal sequences. Ribosomal secondary structure was modeled for four rRNA loci (ITS1, 5.8S, ITS2, 5' LSU). These models were analyzed for the presence of conserved domains, conserved nucleotide motifs, and compensatory base changes. Minimal free energy (MFE) foldings and GC contents of sequences representing the major fungal clades, as well as NS1, were also compared. NS1 displays secondary rRNA structures consistent with other fungi and many, but not all, conserved nucleotide motifs found across eukaryotes. However, our analyses show that many other authentic sequences from basal fungi lack more of these conserved motifs than does NS1. Together our findings suggest that NS1 represents an authentic gene copy. The methods described here can be used on any rRNA-coding sequence, not just environmental fungal sequences. As new-generation sequencing methods that yield shorter sequences become more widely implemented, methods that evaluate sequence authenticity should also be more widely implemented. For fungi, the adjacent 5.8S and ITS2 loci should be prioritized. This region is not only suited to distinguishing between closely related species, but it is also more informative in terms of expected secondary structure.
Keywords: novel lineage, ribosomal RNA secondary structure, pseudogene, rDNA, fungi, metagenomic DNA
- We recommend that you also print this page and attach it to the printout of the article, to retain the full citation information.
- This article was written and prepared by U.S. Government employees on official time, and is therefore in the public domain.
XML: View XML
Glass, D.J.; Takebayashi, N.; Olson, L.; Taylor, D.L. 2013. Evaluation of the authenticity of a highly novel environmental sequence from boreal forest soil using ribosomal RNA secondary structure modeling. Molecular Phylogenetics and Evolution. 67: 234-245.
Get the latest version of the Adobe Acrobat reader or Acrobat Reader for Windows with Search and Accessibility