Limits...
cDNA2Genome: a tool for mapping and annotating cDNAs.

Del Val C, Glatting KH, Suhai S - BMC Bioinformatics (2003)

Bottom Line: The final result of cDNA2Genome is an XML file containing all relevant information obtained in the process.This XML output can easily be used for further analysis such us program pipelines, or the integration of results into databases.The underlying approach allows sequential or parallel computation of sequences for high-throughput analysis of cDNAs.

View Article: PubMed Central - HTML - PubMed

Affiliation: Department of Molecular Biophysics, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 580, D-69120 Heidelberg, Germany. c.delval@dkfz.de

ABSTRACT

Background: In the last years several high-throughput cDNA sequencing projects have been funded worldwide with the aim of identifying and characterizing the structure of complete novel human transcripts. However some of these cDNAs are error prone due to frameshifts and stop codon errors caused by low sequence quality, or to cloning of truncated inserts, among other reasons. Therefore, accurate CDS prediction from these sequences first require the identification of potentially problematic cDNAs in order to speed up the posterior annotation process.

Results: cDNA2Genome is an application for the automatic high-throughput mapping and characterization of cDNAs. It utilizes current annotation data and the most up to date databases, especially in the case of ESTs and mRNAs in conjunction with a vast number of approaches to gene prediction in order to perform a comprehensive assessment of the cDNA exon-intron structure. The final result of cDNA2Genome is an XML file containing all relevant information obtained in the process. This XML output can easily be used for further analysis such us program pipelines, or the integration of results into databases. The web interface to cDNA2Genome also presents this data in HTML, where the annotation is additionally shown in a graphical form. cDNA2Genome has been implemented under the W3H task framework which allows the combination of bioinformatics tools in tailor-made analysis task flows as well as the sequential or parallel computation of many sequences for large-scale analysis.

Conclusions: cDNA2Genome represents a new versatile and easily extensible approach to the automated mapping and annotation of human cDNAs. The underlying approach allows sequential or parallel computation of sequences for high-throughput analysis of cDNAs.

Show MeSH
Data flow and dependencies of applications in cDNA2Genome. programs used by cDNA2genome during the annotation process.
© Copyright Policy
Related In: Results  -  Collection


getmorefigures.php?uid=PMC239864&req=5

Figure 1: Data flow and dependencies of applications in cDNA2Genome. programs used by cDNA2genome during the annotation process.

Mentions: For the implementation of cDNA2Genome under the task framework it was necessary to describe the applications dependencies, the data flow (Fig. 1) and the merging of the individual outputs into a common output report. The system stores both, the results of the different applications together with newly computed results. The final output of the task is an XML file which contains all relevant information obtained. For the web user this XML output can be transformed by means of W2H's post-processing mechanism into an HTML page using XSLT (Extensible Stylesheet Language Transformations) .


cDNA2Genome: a tool for mapping and annotating cDNAs.

Del Val C, Glatting KH, Suhai S - BMC Bioinformatics (2003)

Data flow and dependencies of applications in cDNA2Genome. programs used by cDNA2genome during the annotation process.
© Copyright Policy
Related In: Results  -  Collection

Show All Figures
getmorefigures.php?uid=PMC239864&req=5

Figure 1: Data flow and dependencies of applications in cDNA2Genome. programs used by cDNA2genome during the annotation process.
Mentions: For the implementation of cDNA2Genome under the task framework it was necessary to describe the applications dependencies, the data flow (Fig. 1) and the merging of the individual outputs into a common output report. The system stores both, the results of the different applications together with newly computed results. The final output of the task is an XML file which contains all relevant information obtained. For the web user this XML output can be transformed by means of W2H's post-processing mechanism into an HTML page using XSLT (Extensible Stylesheet Language Transformations) .

Bottom Line: The final result of cDNA2Genome is an XML file containing all relevant information obtained in the process.This XML output can easily be used for further analysis such us program pipelines, or the integration of results into databases.The underlying approach allows sequential or parallel computation of sequences for high-throughput analysis of cDNAs.

View Article: PubMed Central - HTML - PubMed

Affiliation: Department of Molecular Biophysics, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 580, D-69120 Heidelberg, Germany. c.delval@dkfz.de

ABSTRACT

Background: In the last years several high-throughput cDNA sequencing projects have been funded worldwide with the aim of identifying and characterizing the structure of complete novel human transcripts. However some of these cDNAs are error prone due to frameshifts and stop codon errors caused by low sequence quality, or to cloning of truncated inserts, among other reasons. Therefore, accurate CDS prediction from these sequences first require the identification of potentially problematic cDNAs in order to speed up the posterior annotation process.

Results: cDNA2Genome is an application for the automatic high-throughput mapping and characterization of cDNAs. It utilizes current annotation data and the most up to date databases, especially in the case of ESTs and mRNAs in conjunction with a vast number of approaches to gene prediction in order to perform a comprehensive assessment of the cDNA exon-intron structure. The final result of cDNA2Genome is an XML file containing all relevant information obtained in the process. This XML output can easily be used for further analysis such us program pipelines, or the integration of results into databases. The web interface to cDNA2Genome also presents this data in HTML, where the annotation is additionally shown in a graphical form. cDNA2Genome has been implemented under the W3H task framework which allows the combination of bioinformatics tools in tailor-made analysis task flows as well as the sequential or parallel computation of many sequences for large-scale analysis.

Conclusions: cDNA2Genome represents a new versatile and easily extensible approach to the automated mapping and annotation of human cDNAs. The underlying approach allows sequential or parallel computation of sequences for high-throughput analysis of cDNAs.

Show MeSH