Limits...
DNABIT Compress - Genome compression algorithm.

Rajarajeswari P, Apparao A - Bioinformation (2011)

Bottom Line: Data compression is concerned with how information is organized in data.Efficient storage means removal of redundancy from the data being stored in the DNA molecule.Data compression algorithms remove redundancy and are used to understand biologically important molecules.

View Article: PubMed Central - PubMed

ABSTRACT
Data compression is concerned with how information is organized in data. Efficient storage means removal of redundancy from the data being stored in the DNA molecule. Data compression algorithms remove redundancy and are used to understand biologically important molecules. We present a compression algorithm, "DNABIT Compress" for DNA sequences based on a novel algorithm of assigning binary bits for smaller segments of DNA bases to compress both repetitive and non repetitive DNA sequence. Our proposed algorithm achieves the best compression ratio for DNA sequences for larger genome. Significantly better compression results show that "DNABIT Compress" algorithm is the best among the remaining compression algorithms. While achieving the best compression ratios for DNA sequences (Genomes),our new DNABIT Compress algorithm significantly improves the running time of all previous DNA compression programs. Assigning binary bits (Unique BIT CODE) for (Exact Repeats, Reverse Repeats) fragments of DNA sequence is also a unique concept introduced in this algorithm for the first time in DNA compression. This proposed new algorithm could achieve the best compression ratio as much as 1.58 bits/bases where the existing best methods could not achieve a ratio less than 1.72 bits/bases.

No MeSH data available.


Decompressed Text in the output Box.
© Copyright Policy - open-access
Related In: Results  -  Collection


getmorefigures.php?uid=PMC3046040&req=5

Figure 4: Decompressed Text in the output Box.


DNABIT Compress - Genome compression algorithm.

Rajarajeswari P, Apparao A - Bioinformation (2011)

Decompressed Text in the output Box.
© Copyright Policy - open-access
Related In: Results  -  Collection

Show All Figures
getmorefigures.php?uid=PMC3046040&req=5

Figure 4: Decompressed Text in the output Box.
Bottom Line: Data compression is concerned with how information is organized in data.Efficient storage means removal of redundancy from the data being stored in the DNA molecule.Data compression algorithms remove redundancy and are used to understand biologically important molecules.

View Article: PubMed Central - PubMed

ABSTRACT
Data compression is concerned with how information is organized in data. Efficient storage means removal of redundancy from the data being stored in the DNA molecule. Data compression algorithms remove redundancy and are used to understand biologically important molecules. We present a compression algorithm, "DNABIT Compress" for DNA sequences based on a novel algorithm of assigning binary bits for smaller segments of DNA bases to compress both repetitive and non repetitive DNA sequence. Our proposed algorithm achieves the best compression ratio for DNA sequences for larger genome. Significantly better compression results show that "DNABIT Compress" algorithm is the best among the remaining compression algorithms. While achieving the best compression ratios for DNA sequences (Genomes),our new DNABIT Compress algorithm significantly improves the running time of all previous DNA compression programs. Assigning binary bits (Unique BIT CODE) for (Exact Repeats, Reverse Repeats) fragments of DNA sequence is also a unique concept introduced in this algorithm for the first time in DNA compression. This proposed new algorithm could achieve the best compression ratio as much as 1.58 bits/bases where the existing best methods could not achieve a ratio less than 1.72 bits/bases.

No MeSH data available.