The Genomes OnLine Database (GOLD) v.5: a metadata management system based on a four level (meta)genome project classification.
Bottom Line: The database currently hosts information for about 19,200 studies, 56,000 Biosamples, 56,000 sequencing projects and 39,400 analysis projects.The problems encountered in integrating disparate and varying quality data into GOLD are briefly highlighted.GOLD fully supports and follows the Genomic Standards Consortium (GSC) Minimum Information standards.
Affiliation: Prokaryotic Super Program, DOE Joint Genome Institute, Walnut Creek, CA 94598, USA firstname.lastname@example.org.Show MeSH
Mentions: A study represents the highest-level organization. Studies include one or more Biosamples and their associated SPs and APs that have been grouped to investigate a related research topic of interest. For example, the HMP (16), GEBA (17,18) and KMG (21) studies represent typical cases where researchers set out to explore a specific topic by sequencing thousands of samples. Studies like GEBA-MDM (22) and FEBA (19) applied several different sequencing strategies (e.g. isolate genomes, single-cell genomes, metagenomes, transcriptomes, etc.) as part of a single study. Studies may be composed of one to hundreds of Biosamples from a wide range of ecological settings (Figure 2). Each Biosample may also yield several different SPs, each of which may yield multiple APs (Figure 3 and Table 1). Study IDs are referred to as ‘Gs’ IDs in the new system. A GOLD study is analogous to the NCBI's umbrella BioProject, and may contain one or more NCBI BioSamples.
Affiliation: Prokaryotic Super Program, DOE Joint Genome Institute, Walnut Creek, CA 94598, USA email@example.com.