Uncompress files compressed with genozip.
Usage
genounzip [options]… [files]…
​
One or more file names must be given.
​
Examples
​
genounzip file1.vcf.genozip file2.sam.genozip
genounzip file.vcf.genozip --output file.vcf.gz
genounzip mysample.R1+R2.fastq.genozip --prefix=new_directory/
​
Options
-e, --reference filename.
Load a reference file prior to decompressing. Used only for files compressed with --reference. If not provided, genounzip will use the same reference filename as used for genozip.
​
Note: this is equivalent of setting the environment variable $GENOZIP_REFERENCE with the reference filename.
​
Note: if $GENOZIP_REFERENCE is set to a directory name, then the reference file is sought in that directory, with the reference file name used during compression. If it is not found there, the reference file path used for compression is used.
​
--no-cache
Don't store reference genome data in RAM. Can also be used to delete previously cached genomes. See reference genome caching.
​
-f, --force
Force overwrite of the output file.
​
-D, --subdirs
If a file name on the command line is a directory include all files of that directory (recursively).
​
-o, --output output-filename
Output to this filename.
Note: output-filename can also be a directory name, in which case the output file is written to the specified directory. If the name has a ‘/’ suffix (e.g. “-o my-dir/”), then the directory is created if it doesn’t already exist.
​
-p, --password password.
Provide password to access file(s) that were compressed with --password.
​
-x, --index
Create an index file alongside the uncompressed file. This option is set by default for bam, sam.gz and vcf.gz files, and is optional for FASTQ and FASTA files. See indexing for more details.
​
--no-bai
--no-tbi
Don't create a bai or tbi index file. This option does not result in faster uncompression. See indexing for more details.
​
-z, --bgzf [level|exact|exact-strict]
Controls the details of the re-compression of the output to gzip format (.gz or .bam extension). level specifies the gz-recompression level from 0 (no compression) to 5 (best yet slowest compression). The default level is 2.
​​
--bgzf=exact to instruct genounzip to attempt to gz-recompress to the exact same gz-compression as the original file. Whether genounzip succeeds to do so depends on the gzip library used by the tool that created the original file. See Compressing already-compressed files.
​
--bgzf=exact-strict is the same as --bgzf=exact except that the process exits with an error if exact gz-recompression is not possible.
​
-u, --prefix prefix
For FASTQ files compressed with --pair: Specify a prefix that is added to each file name. A prefix may include a directory.
-m, --md5
Show the digest of the decompressed file - MD5 if the file was compressed with --md5 and Adler32 if not.
Note: for compressed files (e.g. myfile.vcf.gz) the digest calculated is that of the original uncompressed file, see:
-t, --test
Decompress in memory (i.e. without writing the decompressed file to disk) and use the digest (MD5 or Adler32) to verify that the resulting decompressed file is identical to the original file.
​
-^, --replace
Delete the genozip-compressed file after completing its decompression.
​
-q, --quiet
Don't show the progress indicator or warnings.
​
-Q, --noisy
The --quiet option is turned on by default when outputting to the terminal. --noisy stops the suppression of warnings.
​
-@, --threads number
Specify the maximum number of threads. By default genozip allocates 1.1 threads per core in order to maximize usage of all available cores. An exception is on Mac and Windows (including WSL) where the default allocation is 0.75 threads per core to maintain the operating system's UI's feeling of interactivity.
​
Note: For genounzip and genocat this limit is only approximate. For genozip, it is strictly enforced.
​
--low-memory
Uses less memory than normal, but runs slower
​
-w, --stats
Show the internal structure of a genozip file and the associated compression statistics.
​
--print-filename
Show the file name for each file.
​
--validate[=valid]
Validates that the file(s) are valid genozip files. By default reports files that are invalid. With --validate=valid reports files that are valid, and if run on a single file, exit code indicates validity.
​
-T, --files-from filename
An alternative to providing input file names on the command line. filename it a textual file containing a newline-separated list of files. If filename is - (a hyphen) data is taken from stdin rather than a file.
​
--log filename
Send non-file output to a log file instead of the terminal.
​
--echo [=arbitrary-text]
Output the full command line upon successful or failed completion of execution. Useful if logging output.
​​
It is possible to add an optional argument that has no effect other than to be shown on the command line outputted – this can be useful for adding contextual information when logging the output.
​
--help
Show a link to this page.
​
-L, --license, --licence
Show the license terms and conditions for this product as accepted. Combine with --force to see the version of the license current to the version of Genozip used. If you wish to change your license to the most recent one - make sure your version of Genozip is the latest and re-register with genozip --register.
​
-V, --version
Display Genozip's version number
​
--print-reference
Show the name and MD5 of the reference file that needs to be provided to uncompress this file.
​

