Limits...
Transcriptome analysis and SSR/SNP markers information of the blunt snout bream (Megalobrama amblycephala).

Gao Z, Luo W, Liu H, Zeng C, Liu X, Yi S, Wang W - PLoS ONE (2012)

Bottom Line: A total number of 4,952 SSRs were found and 116 polymorphic loci have been characterized.A significant number of SNPs (25,697) and indels (23,287) were identified based on specific filter criteria in the M. amblycephala.The identified SSR and SNP markers will greatly benefit its breeding program and whole genome association studies.

View Article: PubMed Central - PubMed

Affiliation: Key Lab of Freshwater Animal Breeding, College of Fisheries, Ministry of Agriculture, Key Lab of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan, People's Republic of China.

ABSTRACT

Background: Blunt snout bream (Megalobrama amblycephala) is an herbivorous freshwater fish species native to China and has been recognized as a main aquaculture species in the Chinese freshwater polyculture system with high economic value. Right now, only limited EST resources were available for M. amblycephala. Recent advances in large-scale RNA sequencing provide a fast, cost-effective, and reliable approach to generate large expression datasets for functional genomic analysis, which is especially suitable for non-model species with un-sequenced genomes.

Methodology and principal findings: Using 454 pyrosequencing, a total of 1,409,706 high quality reads (total length 577 Mbp) were generated from the normalized cDNA of pooled M. amblycephala individuals. These sequences were assembled into 26,802 contigs and 73,675 singletons. After BLAST searches against the NCBI non-redundant (NR) and UniProt databases with an arbitrary expectation value of E(-10), over 40,000 unigenes were functionally annotated and classified using the FunCat functional annotation scheme. A comparative genomics approach revealed a substantial proportion of genes expressed in M. amblycephala tanscriptome to be shared across the genomes of zebrafish, medaka, tetraodon, fugu, stickleback, human, mouse, and chicken, and identified a substantial number of potentially novel M. amblycephala genes. A total number of 4,952 SSRs were found and 116 polymorphic loci have been characterized. A significant number of SNPs (25,697) and indels (23,287) were identified based on specific filter criteria in the M. amblycephala.

Conclusions: This study is the first comprehensive transcriptome analysis for a fish species belonging to the genus Megalobrama. These large EST resources are expected to be valuable for the development of molecular markers, construction of gene-based linkage map, and large-scale expression analysis of M. amblycephala, as well as comparative genome analysis for the genus Megalobrama fish species. The identified SSR and SNP markers will greatly benefit its breeding program and whole genome association studies.

Show MeSH
Frequency distribution of 454 sequencing read lengths for M. amblycephala.
© Copyright Policy
Related In: Results  -  Collection


getmorefigures.php?uid=PMC3412804&req=5

pone-0042637-g001: Frequency distribution of 454 sequencing read lengths for M. amblycephala.

Mentions: We created a normalized cDNA pool using RNA extracted from different tissues of multiple M. amblycephala individuals. Pyrosequencing of this cDNA pool on a 454 GS XLR70 Titanium platform produced a total of 1,409,706 reads, with an average sequence length of 411 base pairs (SD = 117, range = 40–906; Figure 1), approximately 577 Mbp of sequence data. Assuming a similar number of genes and a similar average coding sequence length of 2,000 bp in M. amblycephala as in Danio rerio (37,855), average transcriptome coverage was estimated at 7.6×. To our knowledge, this is the first comprehensive study on the transcriptome of M. amblycephala.


Transcriptome analysis and SSR/SNP markers information of the blunt snout bream (Megalobrama amblycephala).

Gao Z, Luo W, Liu H, Zeng C, Liu X, Yi S, Wang W - PLoS ONE (2012)

Frequency distribution of 454 sequencing read lengths for M. amblycephala.
© Copyright Policy
Related In: Results  -  Collection

Show All Figures
getmorefigures.php?uid=PMC3412804&req=5

pone-0042637-g001: Frequency distribution of 454 sequencing read lengths for M. amblycephala.
Mentions: We created a normalized cDNA pool using RNA extracted from different tissues of multiple M. amblycephala individuals. Pyrosequencing of this cDNA pool on a 454 GS XLR70 Titanium platform produced a total of 1,409,706 reads, with an average sequence length of 411 base pairs (SD = 117, range = 40–906; Figure 1), approximately 577 Mbp of sequence data. Assuming a similar number of genes and a similar average coding sequence length of 2,000 bp in M. amblycephala as in Danio rerio (37,855), average transcriptome coverage was estimated at 7.6×. To our knowledge, this is the first comprehensive study on the transcriptome of M. amblycephala.

Bottom Line: A total number of 4,952 SSRs were found and 116 polymorphic loci have been characterized.A significant number of SNPs (25,697) and indels (23,287) were identified based on specific filter criteria in the M. amblycephala.The identified SSR and SNP markers will greatly benefit its breeding program and whole genome association studies.

View Article: PubMed Central - PubMed

Affiliation: Key Lab of Freshwater Animal Breeding, College of Fisheries, Ministry of Agriculture, Key Lab of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan, People's Republic of China.

ABSTRACT

Background: Blunt snout bream (Megalobrama amblycephala) is an herbivorous freshwater fish species native to China and has been recognized as a main aquaculture species in the Chinese freshwater polyculture system with high economic value. Right now, only limited EST resources were available for M. amblycephala. Recent advances in large-scale RNA sequencing provide a fast, cost-effective, and reliable approach to generate large expression datasets for functional genomic analysis, which is especially suitable for non-model species with un-sequenced genomes.

Methodology and principal findings: Using 454 pyrosequencing, a total of 1,409,706 high quality reads (total length 577 Mbp) were generated from the normalized cDNA of pooled M. amblycephala individuals. These sequences were assembled into 26,802 contigs and 73,675 singletons. After BLAST searches against the NCBI non-redundant (NR) and UniProt databases with an arbitrary expectation value of E(-10), over 40,000 unigenes were functionally annotated and classified using the FunCat functional annotation scheme. A comparative genomics approach revealed a substantial proportion of genes expressed in M. amblycephala tanscriptome to be shared across the genomes of zebrafish, medaka, tetraodon, fugu, stickleback, human, mouse, and chicken, and identified a substantial number of potentially novel M. amblycephala genes. A total number of 4,952 SSRs were found and 116 polymorphic loci have been characterized. A significant number of SNPs (25,697) and indels (23,287) were identified based on specific filter criteria in the M. amblycephala.

Conclusions: This study is the first comprehensive transcriptome analysis for a fish species belonging to the genus Megalobrama. These large EST resources are expected to be valuable for the development of molecular markers, construction of gene-based linkage map, and large-scale expression analysis of M. amblycephala, as well as comparative genome analysis for the genus Megalobrama fish species. The identified SSR and SNP markers will greatly benefit its breeding program and whole genome association studies.

Show MeSH