top of page
Uncompress files compressed with genozip.

 

Usage

 

genounzip [options]… [files]…

​

One or more file names must be given.

​

Examples

​

genounzip file1.vcf.genozip file2.sam.genozip

 

genounzip file.vcf.genozip --output file.vcf.gz

 

genounzip mysample.R1+R2.fastq.genozip --prefix=new_directory/

​

Options

 

-e, --reference filename.  

Load a reference file prior to decompressing. Used only for files compressed with --reference. If not provided, genounzip will use the same reference filename as used for genozip.

​

Note: this is equivalent of setting the environment variable $GENOZIP_REFERENCE with the reference filename.

​

Note: if $GENOZIP_REFERENCE is set to a directory name, then the reference file is sought in that directory, with the reference file name used during compression. If it is not found there, the reference file path used for compression is used. 

​

--no-cache

Don't store reference genome data in RAM. Can also be used to delete previously cached genomes. See reference genome caching

​

-f, --force  

Force overwrite of the output file.

​

-D, --subdirs  

If a file name on the command line is a directory include all files of that directory (recursively).

​

-o, --output output-filename

Output to this filename.

 

Note: output-filename can also be a directory name, in which case the output file is written to the specified directory. If the name has a ‘/’ suffix (e.g. “-o my-dir/”), then the directory is created if it doesn’t already exist.

​

-p, --password password.  

Provide password to access file(s) that were compressed with --password.

​

-x, --index  

Create an index file alongside the uncompressed file. This option is set by default for bam, sam.gz and vcf.gz files, and is optional for FASTQ and FASTA files. See indexing for more details.

​

--no-bai

--no-tbi  

Don't create a bai or tbi index file. This option does not result in faster uncompression. See indexing for more details.

​

-z, --bgzf [level|exact|exact-strict] 

Controls the details of the re-compression of the output to gzip format (.gz or .bam extension). level specifies the gz-recompression level from 0 (no compression) to 5 (best yet slowest compression). The default level is 2.

​​

--bgzf=exact to instruct genounzip to attempt to gz-recompress to the exact same gz-compression as the original file. Whether genounzip succeeds to do so depends on the gzip library used by the tool that created the original file. See Compressing already-compressed files.

​

--bgzf=exact-strict is the same as --bgzf=exact except that the process exits with an error if exact gz-recompression is not possible.

​

-u--prefix prefix

For FASTQ files compressed with --pair: Specify a prefix that is added to each file name. A prefix may include a directory.

 

-m, --md5  

Show the digest of the decompressed file - MD5 if the file was compressed with --md5 and Adler32 if not.

Note: for compressed files (e.g. myfile.vcf.gz) the digest calculated is that of the original uncompressed file, see:

Verifying file integrity.

 

-t, --test  

Decompress in memory (i.e. without writing the decompressed file to disk) and use the digest (MD5 or Adler32) to verify that the resulting decompressed file is identical to the original file.

​

-^, --replace  

Delete the genozip-compressed file after completing its decompression.

​

-q, --quiet  

Don't show the progress indicator or warnings.

​

-Q, --noisy

The --quiet option is turned on by default when outputting to the terminal. --noisy stops the suppression of warnings.

​

-@, --threads number

Specify the maximum number of threads. By default genozip allocates 1.1 threads per core in order to maximize usage of all available cores. An exception is on Mac and Windows (including WSL) where the default allocation is 0.75 threads per core to maintain the operating system's UI's feeling of interactivity.

​

Note: For genounzip and genocat this limit is only approximate. For genozip, it is strictly enforced.

​

--low-memory

Uses less memory than normal, but runs slower

​

-w, --stats

Show the internal structure of a genozip file and the associated compression statistics.

​

--print-filename

Show the file name for each file.

​

--validate[=valid]  

Validates that the file(s) are valid genozip files. By default reports files that are invalid. With --validate=valid reports files that are valid, and if run on a single file, exit code indicates validity.

​

-T, --files-from filename

An alternative to providing input file names on the command line. filename it a textual file containing a newline-separated list of files. If filename is - (a hyphen) data is taken from stdin rather than a file.

​

--log filename

Send non-file output to a log file instead of the terminal.

​

--echo [=arbitrary-text]

Output the full command line upon successful or failed completion of execution. Useful if logging output. 

​​

It is possible to add an optional argument that has no effect other than to be shown on the command line outputted – this can be useful for adding contextual information when logging the output.

​

--help

Show a link to this page.

​

-L, --license, --licence

Show the license terms and conditions for this product as accepted. Combine with --force to see the version of the license current to the version of Genozip used. If you wish to change your license to the most recent one - make sure your version of Genozip is the latest and re-register with genozip --register.

​

-V, --version

Display Genozip's version number

​

--print-reference

Show the name and MD5 of the reference file that needs to be provided to uncompress this file.

​

bgzf
index
no-bai

© 2024 Genozip Limited. All rights reserved. Genozip™ is a trademark. Our technology is patent-pending. Privacy Policy.

bottom of page