Limits...
Validation of Genotyping-By-Sequencing Analysis in Populations of Tetraploid Alfalfa by 454 Sequencing.

Rocher S, Jean M, Castonguay Y, Belzile F - PLoS ONE (2015)

Bottom Line: A combination of genome complexity reduction and multiplexing with DNA barcoding provides a simple and affordable way to resolve allelic variation between plant samples or populations.About 60% had a significant match on the Medicago truncatula syntenic genome.Our results confirm that analysis of GBS data using UNEAK is a reliable approach for genome-wide discovery of SNP loci in outcrossed polyploids.

View Article: PubMed Central - PubMed

Affiliation: Centre de Recherche et de Développement sur les Sols et les Grandes Cultures, Agriculture et agroalimentaire Canada, Quebec City (QC), Canada.

ABSTRACT
Genotyping-by-sequencing (GBS) is a relatively low-cost high throughput genotyping technology based on next generation sequencing and is applicable to orphan species with no reference genome. A combination of genome complexity reduction and multiplexing with DNA barcoding provides a simple and affordable way to resolve allelic variation between plant samples or populations. GBS was performed on ApeKI libraries using DNA from 48 genotypes each of two heterogeneous populations of tetraploid alfalfa (Medicago sativa spp. sativa): the synthetic cultivar Apica (ATF0) and a derived population (ATF5) obtained after five cycles of recurrent selection for superior tolerance to freezing (TF). Nearly 400 million reads were obtained from two lanes of an Illumina HiSeq 2000 sequencer and analyzed with the Universal Network-Enabled Analysis Kit (UNEAK) pipeline designed for species with no reference genome. Following the application of whole dataset-level filters, 11,694 single nucleotide polymorphism (SNP) loci were obtained. About 60% had a significant match on the Medicago truncatula syntenic genome. The accuracy of allelic ratios and genotype calls based on GBS data was directly assessed using 454 sequencing on a subset of SNP loci scored in eight plant samples. Sequencing depth in this study was not sufficient for accurate tetraploid allelic dosage, but reliable genotype calls based on diploid allelic dosage were obtained when using additional quality filtering. Principal Component Analysis of SNP loci in plant samples revealed that a small proportion (<5%) of the genetic variability assessed by GBS is able to differentiate ATF0 and ATF5. Our results confirm that analysis of GBS data using UNEAK is a reliable approach for genome-wide discovery of SNP loci in outcrossed polyploids.

No MeSH data available.


Related in: MedlinePlus

Number of 454 sequences retained for each of 11 amplicons covering 14 SNP loci identified with UNEAK in eight plant samples.
© Copyright Policy
Related In: Results  -  Collection

License
getmorefigures.php?uid=PMC4482585&req=5

pone.0131918.g002: Number of 454 sequences retained for each of 11 amplicons covering 14 SNP loci identified with UNEAK in eight plant samples.

Mentions: Approximately 80,000 reads were obtained by 454 sequencing of the amplification products of these 11 genomic regions from eight samples. After alignment with the Sanger consensus sequences, 60,684 (~75%) sequences were retained for further analysis. Rejected sequences were mostly short reads (about 90% ≤ 200 bp) (data not shown). Similar sequencing depths were observed between samples (6,329 to 8,567 reads/sample; Fig 2), whereas a large variability in the number of reads was observed between amplicons (362 to 11,172 reads/amplicon; Table 5).


Validation of Genotyping-By-Sequencing Analysis in Populations of Tetraploid Alfalfa by 454 Sequencing.

Rocher S, Jean M, Castonguay Y, Belzile F - PLoS ONE (2015)

Number of 454 sequences retained for each of 11 amplicons covering 14 SNP loci identified with UNEAK in eight plant samples.
© Copyright Policy
Related In: Results  -  Collection

License
Show All Figures
getmorefigures.php?uid=PMC4482585&req=5

pone.0131918.g002: Number of 454 sequences retained for each of 11 amplicons covering 14 SNP loci identified with UNEAK in eight plant samples.
Mentions: Approximately 80,000 reads were obtained by 454 sequencing of the amplification products of these 11 genomic regions from eight samples. After alignment with the Sanger consensus sequences, 60,684 (~75%) sequences were retained for further analysis. Rejected sequences were mostly short reads (about 90% ≤ 200 bp) (data not shown). Similar sequencing depths were observed between samples (6,329 to 8,567 reads/sample; Fig 2), whereas a large variability in the number of reads was observed between amplicons (362 to 11,172 reads/amplicon; Table 5).

Bottom Line: A combination of genome complexity reduction and multiplexing with DNA barcoding provides a simple and affordable way to resolve allelic variation between plant samples or populations.About 60% had a significant match on the Medicago truncatula syntenic genome.Our results confirm that analysis of GBS data using UNEAK is a reliable approach for genome-wide discovery of SNP loci in outcrossed polyploids.

View Article: PubMed Central - PubMed

Affiliation: Centre de Recherche et de Développement sur les Sols et les Grandes Cultures, Agriculture et agroalimentaire Canada, Quebec City (QC), Canada.

ABSTRACT
Genotyping-by-sequencing (GBS) is a relatively low-cost high throughput genotyping technology based on next generation sequencing and is applicable to orphan species with no reference genome. A combination of genome complexity reduction and multiplexing with DNA barcoding provides a simple and affordable way to resolve allelic variation between plant samples or populations. GBS was performed on ApeKI libraries using DNA from 48 genotypes each of two heterogeneous populations of tetraploid alfalfa (Medicago sativa spp. sativa): the synthetic cultivar Apica (ATF0) and a derived population (ATF5) obtained after five cycles of recurrent selection for superior tolerance to freezing (TF). Nearly 400 million reads were obtained from two lanes of an Illumina HiSeq 2000 sequencer and analyzed with the Universal Network-Enabled Analysis Kit (UNEAK) pipeline designed for species with no reference genome. Following the application of whole dataset-level filters, 11,694 single nucleotide polymorphism (SNP) loci were obtained. About 60% had a significant match on the Medicago truncatula syntenic genome. The accuracy of allelic ratios and genotype calls based on GBS data was directly assessed using 454 sequencing on a subset of SNP loci scored in eight plant samples. Sequencing depth in this study was not sufficient for accurate tetraploid allelic dosage, but reliable genotype calls based on diploid allelic dosage were obtained when using additional quality filtering. Principal Component Analysis of SNP loci in plant samples revealed that a small proportion (<5%) of the genetic variability assessed by GBS is able to differentiate ATF0 and ATF5. Our results confirm that analysis of GBS data using UNEAK is a reliable approach for genome-wide discovery of SNP loci in outcrossed polyploids.

No MeSH data available.


Related in: MedlinePlus