
Sex Classifier algorithm (FASTQ)
NOTE: This feature is no longer available as of Genozip version 15.0.42. To access this feature, use an earlier version.
​
1) Calculate the per-contig Depth as described in Coverage and Depth.
​
2) Calculate X_Depth / Y_Depth and Autosome_Depth / X_Depth.
​
3) Multiply Autosome_Depth / X_Depth by a correction factor of 1.333 - this is to correct for observed Genozip Aligner biases in favor of X.
​
4) Decision matrix (for FASTQ):
Definitions:
• Depth is defined here Coverage and Depth.
• Autosome_Depth means combined coverage of all autosome contigs divided by combined length of all autosome contigs.
• Autosome contigs are all contigs excluding X, Y, MT and excluding non-primary contigs like chr22_KI270731v1_random.
• Chromosome X is the contig named “X”, “chrX” or “ChrX”, and similarly for Y. For MT, contig names based on “M” and “MT” are accepted.
• The correction factor of 1.333 is set based on empirical tests on human data with the GRCh38 reference genome. It is not yet known if this factor will remain the same for other species or other reference genomes.