PolyTB: a genomic variation map for Mycobacterium tuberculosis.
Bottom Line: The identification of informative genetic variants such as phylogenetic markers and those associated with drug resistance or virulence will help barcode Mtb in the context of epidemiological, diagnostic and clinical studies.Mtb genomic datasets are increasingly available as raw sequences, which are potentially difficult and computer intensive to process, and compare across studies.Here we have processed the raw sequence data (>1500 isolates, eight studies) to compile a catalogue of SNPs (n = 74,039, 63% non-synonymous, 51.1% in more than one isolate, i.e. non-private), small indels (n = 4810) and larger structural variants (n = 800).
Affiliation: Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, WC1E 7HT London, UK. Electronic address: email@example.com.Show MeSH
Related in: MedlinePlus
Mentions: The phylogenetic view allows the user to construct phylogenies for a subset of isolates using whole-genome spanning SNPs. Spoligotypes are included to investigate whether clustering based on SNPs correlates with a strain-type. Figure 5 shows the resulting SNP-based neighbour-joining phylogenetic tree constructed for 140 isolates belonging to four different locations. Other PHYLYP distance-based methods (Fitch-Margoliash, UPGMA and Least Squares) are available too. Lineages and locations are shown as colour-coded bar charts around the tree to highlight the correlation between lineage and location with phylogenetic clustering. The aim of the phylogenetic view is to assess the genetic relatedness of isolates within and across populations as well as comparing genetic clustering with spoligotype and geographical assignation.
Affiliation: Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, WC1E 7HT London, UK. Electronic address: firstname.lastname@example.org.