top of page
Genozip logo

The Future of Genomic Compression
Save Terabytes with Genozip

Join hundreds of institutions, research hospitals, and companies who rely on Genozip for the best production-grade genomic compression available. Genozip compresses all common genomic formats including FASTQ, BAM and VCF, while ensuring lossless compression and data integrity with verified MD5 checksums. Our affordable pricing makes Genozip accessible for all users, and we offer a free license for most academic applications

Examples

Example of compressing paired-end FASTQ files. These are Illumina NovaSeq 30X WGS files:

[20:23:56]$ genozip sample_R1.fastq.gz sample_R2.fastq.gz --pair --reference GRCh38.fa
genozip sample
_R1.fastq.gz : Done (2 minutes 50 seconds)
genozip sample
_R2.fastq.gz : Done (6 minutes 30 seconds, FASTQ compression ratio: 21.8 - better than .fastq.gz by a factor of 4.5)
testing: genounzip sample
_R1.fastq.gz : verified as identical to the original FASTQ
testing: genounzip sample
_R2.fastq.gz : verified as identical to the original FASTQ

[20:35:06]$ ls -nh *fastq*
-rw-------+ 1 100 100 29G Oct 12  2020 sample
_R1.fastq.gz
-rw-------+ 1 100 100 33G Oct 12  2020 sample
_R2.fastq.gz

-rw-------+ 1 100 100 14G Jan 12 20:33 sample_R1+2.fastq.genozip

Example of compressing a BAM file. This is the same NovaSeq data as above, aligned with bwa mem:

[20:56:02]$ genozip sample.bam --reference GRCh38.fa
genozip sample.bam : Done (11 minutes 26 seconds, BAM compression ratio: 3.8)
testing: genounzip sample.bam.genozip : verified as identical to the original BAM

[21:20:02]$  ls -nh sample.bam*
-rw-------+ 1 100 100 56G Apr 10  2022 sample.bam
-rw-------+ 1 100 100 15G Jan 12 21:07 sample.bam.genozip

Example of compressing a VCF file. This is a GVCF file generated with GATK HaplotypeCaller and GenotypeGVCFs

[19:39:02]$ genozip sample.g.vcf.gz --reference GRCh38.fa

genozip sample.g.vcf.gz : Done (6 minutes 33 seconds, VCF compression ratio: 202.2 - better than .vcf.gz by a factor of 15.5)

testing: genounzip sample.g.vcf.genozip : verified as identical to the original VCF
 

[19:54:23 $] ls -nh sample.g.vcf.*

-rw-rw-r--+ 1 100 100 13G  May 13  2020 sample.g.vcf.gz

-rw-rw-r--+ 1 100 100 838M May 11 19:45 sample.g.vcf.genozip

Contact

Sales inquiries: sales@genozip.com

Technical questions: support@genozip.com

All other inquires: info@genozip.com

bottom of page