Journal Article
Title | Sample richness and genetic diversity as drivers of chimera formation in nSSU metagenetic analyses. |
Publication Type | Journal Article |
Authors | Fonseca, VG, Nichols, B, Lallias, D, Quince, C, Carvalho, GR, Power, DM, Creer, S |
Year of Publication | 2012 |
Journal | Nucleic Acids Res |
Volume | 40 |
Issue | 9 |
Date Published | 2012 May |
Pagination | e66 |
ISSN | 1362-4962 |
Keywords | Animals, Artifacts, Base Sequence, Genetic Variation, Metagenomics, Molecular Sequence Data, Nematoda, Phylogeny, Polymerase Chain Reaction, Ribosome Subunits, Small, Eukaryotic, RNA, Ribosomal, 18S, Sequence Analysis, DNA |
Abstract | Eukaryotic diversity in environmental samples is often assessed via PCR-based amplification of nSSU genes. However, estimates of diversity derived from pyrosequencing environmental data sets are often inflated, mainly because of the formation of chimeric sequences during PCR amplification. Chimeras are hybrid products composed of distinct parental sequences that can lead to the misinterpretation of diversity estimates. We have analyzed the effect of sample richness, evenness and phylogenetic diversity on the formation of chimeras using a nSSU data set derived from 454 Roche pyrosequencing of replicated, large control pools of closely and distantly related nematode mock communities, of known intragenomic identity and richness. To further investigate how chimeric molecules are formed, the nSSU gene secondary structure was analyzed in several individuals. For the first time in eukaryotes, chimera formation proved to be higher in both richer and more genetically diverse samples, thus providing a novel perspective of chimera formation in pyrosequenced environmental data sets. Findings contribute to a better understanding of the nature and mechanisms involved in chimera formation during PCR amplification of environmentally derived DNA. Moreover, given the similarities between biodiversity analyses using amplicon sequencing and those used to assess genomic variation, our findings have potential broad application for identifying genetic variation in homologous loci or multigene families in general. |
DOI | 10.1093/nar/gks002 |
Sapientia | |
Alternate Journal | Nucleic Acids Res. |
PubMed ID | 22278883 |
PubMed Central ID | PMC3351157 |
Grant List | / / Biotechnology and Biological Sciences Research Council / United Kingdom |