Map / Align Metrics

Mapping and aligning metrics, unless explicitly stated, are in reads (not in terms of pairs or alignments).

The following metrics are reported:

Statistic	Description
Total input reads	Total number of reads in the input FASTQs.
Number of duplicate marked reads	Number of reads marked as duplicate.
Number of duplicate marked and mate reads removed	Number of duplicate reads discarded.
Number of unique reads (excl. duplicate marked reads)	Total number of reads minus the duplicate marked reads.
Reads with mate sequenced	The number of reads with a mate.
Reads without mate sequenced	The total number of reads minus the reads with mate sequenced.
QC-failed reads	Reads not passing platform/ vendor quality checks (SAM flag 0x200).
Mapped reads	The total number of mapped reads minus the number of unmapped reads.
Unmapped reads	The total number of reads that could not be mapped.
Singleton reads (itself mapped; mate unmapped)	The number of reads, where the read could be mapped, but the paired mate could not.
Paired reads (itself & mate mapped)	Both reads in the pair are mapped.
Properly paired reads	Both reads in the pair are mapped and fall within an acceptable range from each other based on the estimated insert length distribution.
Not properly paired reads (discordant)	The number of paired reads minus the number of properly paired reads.
Reads with MAPQ [40:inf)	The number of reads with a MAPQ value greater than or equal to 40.
Reads with MAPQ [30:40)	The number of reads with a MAPQ value greater than or equal to 30 and less than 40.
Reads with MAPQ [20:30)	The number of reads with a MAPQ value greater than or equal to 20 and less than 30.
Reads with MAPQ [10:20)	The number of reads with a MAPQ value greater than or equal to 10 and less than 20.
Reads with MAPQ [0:10)	The number of reads with a MAPQ value greater than or equal to 0 and less than 10.
Reads with MAPQ NA (Unmapped reads)	Total number of reads without a MAPQ value.
Reads with indel R1	The percentage of R1 reads containing at least 1 indel.
Reads with indel R2	The percentage of R2 reads containing at least 1 indel.
Soft-clipped bases R1	The percentage of bases in R1 reads that are soft clipped.
Soft-clipped bases R2	The percentage of bases in R2 reads that are soft clipped.
Total alignments	Total number of loci reads aligned to with > 0 quality.
Secondary alignments	Number of secondary alignment loci.
Supplementary (chimeric) alignments	A chimeric read is split over multiple loci (possibly due to structural variants). One alignment is referred to as the representative alignment, the other are supplementary.
Estimated read length	Total number of input bases divided by the number of reads.
Bases in reference genome	Total number of bases in the reference genome.
Bases in target bed [% of genome]	Total number of bases in the target region
Average sequenced coverage over genome	Total number of reads times the average read length divided by the total number of loci in the genome.
Average alignment coverage over genome	Total number of aligned bases divided by the total number of loci in the genome.
PCT of genome with coverage [100x:inf)	Percentage of bases in the genome with a range of coverage greater than or equal to 100x.
PCT of genome with coverage [50x:100x)	Percentage of bases in the genome with a range of coverage greater than or equal to 50x and less than 100x.
PCT of genome with coverage [20x:50x)	Percentage of bases in the genome with a range of coverage greater than or equal to 20x and less than 50x.
PCT of genome with coverage [10x:20x)	Percentage of bases in the genome with a range of coverage greater than or equal to 10x and less than 20x.
PCT of genome with coverage [3x:10x)	Percentage of bases in the genome with a range of coverage greater than or equal to 3x and less than 10x.
PCT of genome with coverage [0x:3x)	Percentage of bases in the genome with a range of coverage greater than or equal to 0x and less than 3x.
Average chr X coverage	Total number of bases that aligned to chromosome X (or to the intersection of chromosome X with the target region) divided by the total number of loci in chromosome X (or to the intersection of chromosome X with the target region). If there is no chromosome X in the reference genome, this metric shows as NA.
Average chr Y coverage	Total number of bases that aligned to chromosome Y (or to the intersection of chromosome Y with the target region) divided by the total number of loci in chromosome Y (or in the intersection of chromosome Y with the target region). If there is no chromosome Y in the reference genome, this metric shows as NA.
Average mitochondrial coverage	Total number of bases that aligned to the mitochondrial chromosome (or to the intersection of the mitochondrial chromosome with the target region) divided by the total number of loci in the mitochondrial chromosome (or in the intersection of the mitochondrial chromosome with the target region). If there was no mitochondrial chromosome in the reference genome, this metric shows as NA.
Average autosomal coverage	Total number of bases that aligned to autosomes (or to the autosomal loci in the target region) divided by the total number of loci in the autosomes (or to the autosomal loci in the target region). If there was no autosome in the reference genome, this metric shows as NA.
Median autosomal coverage	Median alignment coverage over the autosomes (or over the autosomal loci in the target region). If there was no autosome in the reference genome, this metric shows as NA.
Mean/Median autosomal coverage ratio	Mean autosomal coverage in the genome (or in the target region) divided by the median autosomal coverage in the genome (or in the target region). If there was no autosome in the reference genome, this metric shows as NA.
XAvgCov/YAvgCov ratio	Average chromosome X alignment coverage in the genome (or in the target region) divided by the average chromosome Y alignment coverage in the genome (or in the target region). If there was chromosome X or chromosome Y in the reference genome, this metric shows as NA.
XAvgCov/AutosomalAvgCov ratio	Average chromosome X alignment coverage in the genome (or in the target region) divided by the average autosomal coverage in the genome (or in the target region).
Provided sex chromosome ploidy	The specified sex chromosome of the sample (if entered).
Predicted sex chromosome ploidy	The sex chromosome of the sample as determined through the analysis.
Capture Specificity based on target region	Number of unique (nonduplicate) reads that mapped inside the target region with a MAP quality greater than 0, divided by the number of unique (nonduplicate) reads that mapped anywhere in the reference with a MAP quality greater than 0.
DRAGEN mapping rate [mil. reads/second]	The rate at which the pipeline generates seeds from the read and looks for exact matches in the reference genome.
Total reads in RG	Total number of reads in the read group.
Insert length: mean	The average length of the sequence in between a pair of reads.
Insert length: standard deviation	The standard deviation of the length of the sequence in between a pair of reads.

Related articles

Map / Align Metrics