Events2Join

High efficiency referential genome compression algorithm


AskScience AMA Series: We're compression experts from Stanford ...

One approach is what's called reference-based compression, which starts with one human genome sequence and describes all other sequences in ...

'ERGC: an efficient referential genome compression algorithm ...

Data compression is crucial in effective handling of genomic data. Among several recently published algorithms, ERGC seems to be surprisingly good, easily ...

The Desperate Quest for Genomic Compression Algorithms

However, to distinguish real mutations from random read errors, and to make sure that mutations can be confidently called for regions that where ...

yuansliu/HiRGC: High-speed and high-ratio referential ... - GitHub

We present a high-performance referential genome compression algorithm named HiRGC. It is based on a 2-bit encoding scheme and an advanced greedy-matching ...

DNA Sequence Compression using ERGC and NRGC for Better ...

In this article, we are proposing an algorithm which is analyzing two compression algorithms Efficient Referential Genome Compression algorithm and Novel ...

Reference-based data compression for genome in cloud

Compared with existing genome-specific compression methods, our proposed content-aware method focused on high compression ratio by taking advantages of ...

SparkGC: Spark based genome compression for large collections of ...

In recent decade, the performance of referential genome compression method ... High efficiency referential genome compression algorithm. Bioinformatics ...

A Survey on Gene Sequence Compression Algorithms Based on ...

[Results] The analysis covers the technologies utilized by existing reference-based gene compression algorithms, including those based on single nucleotide ...

FastqZip: An Improved Reference-Based Genome Sequence Lossy ...

We reordered the reads to get a higher compression ratio. We evaluate our algorithms on five datasets and show that FastqZip can outperform the ...

Design considerations and methodology of .ORA format to achieve ...

Since we have a reference-based compressor, reads are first mapped to a reference genome, and then encoded as a position in the genome plus a ...

Reference-based genome compression using the longest matched ...

... reference genome sequence and then utilizes LMSs to compress the target genome sequence. ... High efficiency referential genome compression algorithm. Shi W., ...

Tackling the Challenges of FASTQ Referential Compression

The exponential growth of genomic data has recently motivated the development of compression algorithms to tackle the storage capacity ...

GReEn: a tool for efficient compression of genome resequencing data

... algorithm for DNA sequence compression ... (2011). Efficient storage of high throughput DNA sequencing data using reference-based compression.

An Algorithm to Build a Multi-genome Reference | bioRxiv

To evaluate the performance of our model, we have developed a genome compression tool, which can be used to estimate the amount of shared ...

SeqCompress: An algorithm for biological sequence compression

The algorithm is based on lossless data compression and uses statistical model as well as arithmetic coding to compress DNA sequences. The proposed algorithm is ...

Efficient storage of high throughput DNA sequencing data ... - Ex Libris

In this paper we present a new reference-based compression method that efficiently compresses DNA sequences for storage. Our approach works for resequencing ...

Efficient storage of high throughput DNA sequencing data using ...

In this paper we present a new reference-based compression method that efficiently compresses DNA sequences for storage. Our approach works ...

Advances in high throughput DNA sequence data compression

In this article, we present a comprehensive review of compression methods for genome and reads compression. Algorithms are categorized as referential or ...

Optimal compressed representation of high throughput sequence ...

Here, we introduce a new reference-free compressed representation for genomic data based on light de novo assembly of reads, where each read is ...

DNA Lossless Compression Algorithms: Review

This algorithm is faster than any other algorithm and achieves a compression ratio very close to the best DNA compressors. The algorithm is memory efficient and ...