Quality control method for RNA-seq using single nucleotide polymorphism allele frequency.
Bottom Line: When we use transcriptome data with whole-genome single nucleotide polymorphism (SNP) variant information, the allele frequency can show the genetic composition of the cell population and/or chromosomal aberrations.Here, I show how SNPs in mRNAs can be used to evaluate RNA-seq experiments by focusing on RNA-seq data based on a recently retracted paper on stimulus-triggered acquisition of pluripotency (STAP) cells.This re-evaluation showed that observing allele frequencies could help in assessing the quality of samples during a study and with retrospective evaluation of experimental quality.
Affiliation: RIKEN Center for Integrative Medical Science (IMS-RIKEN), 1-7-22 Suehiro-Cho, Tsurumi-Ku, Yokohama, Kanagawa, 230-0045, Japan.Show MeSH
Related in: MedlinePlus
Mentions: The most likely cell type to be contaminating the FI-SCs in the original study, TSCs, had many more heterozygous (B6/non-B6) alleles than non-B6 homozygous alleles. SNPs of FI-SCs were counted in alleles where B6 and 129 have different nucleotides and duplicated experiments resulted in 6859 and 7243 heterozygous SNPs and 24 and 14 non-B6 homozygous alleles, respectively. The percentage of contaminating cells can be assumed with genotype observation because the peak allele frequencies were at approximately 95–96%, whereas the population peak is expected to be at approximately 10%. This result was concordant with the expression of TSC marker genes. All of the examined TSC marker genes examined were expressed in approximately 10% of TSCs (Fig.4A).
Affiliation: RIKEN Center for Integrative Medical Science (IMS-RIKEN), 1-7-22 Suehiro-Cho, Tsurumi-Ku, Yokohama, Kanagawa, 230-0045, Japan.