Map / Align Metrics
Mapping and aligning metrics, unless explicitly stated, are in reads (not in terms of pairs or alignments).
The following metrics are reported:
Statistic |
Description |
---|---|
Total input reads |
Total number of reads in the input FASTQs. |
Number of duplicate marked reads |
Number of reads marked as duplicate. |
Number of duplicate marked and mate reads removed |
Number of duplicate reads discarded. |
Number of unique reads (excl. duplicate marked reads) |
Total number of reads minus the duplicate marked reads. |
Reads with mate sequenced |
The number of reads with a mate. |
Reads without mate sequenced |
The total number of reads minus the reads with mate sequenced. |
QC-failed reads |
Reads not passing platform/ vendor quality checks (SAM flag 0x200). |
Mapped reads |
The total number of mapped reads minus the number of unmapped reads. |
Unmapped reads |
The total number of reads that could not be mapped. |
Singleton reads (itself mapped; mate unmapped) |
The number of reads, where the read could be mapped, but the paired mate could not. |
Paired reads (itself & mate mapped) |
Both reads in the pair are mapped. |
Properly paired reads |
Both reads in the pair are mapped and fall within an acceptable range from each other based on the estimated insert length distribution. |
Not properly paired reads (discordant) |
The number of paired reads minus the number of properly paired reads. |
Reads with MAPQ [40:inf) |
The number of reads with a MAPQ value greater than or equal to 40. |
Reads with MAPQ [30:40) |
The number of reads with a MAPQ value greater than or equal to 30 and less than 40. |
Reads with MAPQ [20:30) |
The number of reads with a MAPQ value greater than or equal to 20 and less than 30. |
Reads with MAPQ [10:20) |
The number of reads with a MAPQ value greater than or equal to 10 and less than 20. |
Reads with MAPQ [0:10) |
The number of reads with a MAPQ value greater than or equal to 0 and less than 10. |
Reads with MAPQ NA (Unmapped reads) |
Total number of reads without a MAPQ value. |
Reads with indel R1 |
The percentage of R1 reads containing at least 1 indel. |
Reads with indel R2 |
The percentage of R2 reads containing at least 1 indel. |
Soft-clipped bases R1 |
The percentage of bases in R1 reads that are soft clipped. |
Soft-clipped bases R2 |
The percentage of bases in R2 reads that are soft clipped. |
Total alignments |
Total number of loci reads aligned to with > 0 quality. |
Secondary alignments |
Number of secondary alignment loci. |
Supplementary (chimeric) alignments |
A chimeric read is split over multiple loci (possibly due to structural variants). One alignment is referred to as the representative alignment, the other are supplementary. |
Estimated read length |
Total number of input bases divided by the number of reads. |
Bases in reference genome |
Total number of bases in the reference genome. |
Bases in target bed [% of genome] |
Total number of bases in the target region |
Average sequenced coverage over genome |
Total number of reads times the average read length divided by the total number of loci in the genome. |
Average alignment coverage over genome |
Total number of aligned bases divided by the total number of loci in the genome. |
PCT of genome with coverage [100x:inf) |
Percentage of bases in the genome with a range of coverage greater than or equal to 100x. |
PCT of genome with coverage [50x:100x) |
Percentage of bases in the genome with a range of coverage greater than or equal to 50x and less than 100x. |
PCT of genome with coverage [20x:50x) |
Percentage of bases in the genome with a range of coverage greater than or equal to 20x and less than 50x. |
PCT of genome with coverage [10x:20x) |
Percentage of bases in the genome with a range of coverage greater than or equal to 10x and less than 20x. |
PCT of genome with coverage [3x:10x) |
Percentage of bases in the genome with a range of coverage greater than or equal to 3x and less than 10x. |
PCT of genome with coverage [0x:3x) |
Percentage of bases in the genome with a range of coverage greater than or equal to 0x and less than 3x. |
Average chr X coverage |
Total number of bases that aligned to chromosome X (or to the intersection of chromosome X with the target region) divided by the total number of loci in chromosome X (or to the intersection of chromosome X with the target region). If there is no chromosome X in the reference genome, this metric shows as NA. |
Average chr Y coverage |
Total number of bases that aligned to chromosome Y (or to the intersection of chromosome Y with the target region) divided by the total number of loci in chromosome Y (or in the intersection of chromosome Y with the target region). If there is no chromosome Y in the reference genome, this metric shows as NA. |
Average mitochondrial coverage |
Total number of bases that aligned to the mitochondrial chromosome (or to the intersection of the mitochondrial chromosome with the target region) divided by the total number of loci in the mitochondrial chromosome (or in the intersection of the mitochondrial chromosome with the target region). If there was no mitochondrial chromosome in the reference genome, this metric shows as NA. |
Average autosomal coverage |
Total number of bases that aligned to autosomes (or to the autosomal loci in the target region) divided by the total number of loci in the autosomes (or to the autosomal loci in the target region). If there was no autosome in the reference genome, this metric shows as NA. |
Median autosomal coverage |
Median alignment coverage over the autosomes (or over the autosomal loci in the target region). If there was no autosome in the reference genome, this metric shows as NA. |
Mean/Median autosomal coverage ratio |
Mean autosomal coverage in the genome (or in the target region) divided by the median autosomal coverage in the genome (or in the target region). If there was no autosome in the reference genome, this metric shows as NA. |
XAvgCov/YAvgCov ratio |
Average chromosome X alignment coverage in the genome (or in the target region) divided by the average chromosome Y alignment coverage in the genome (or in the target region). If there was chromosome X or chromosome Y in the reference genome, this metric shows as NA. |
XAvgCov/AutosomalAvgCov ratio |
Average chromosome X alignment coverage in the genome (or in the target region) divided by the average autosomal coverage in the genome (or in the target region). |
Provided sex chromosome ploidy |
The specified sex chromosome of the sample (if entered). |
Predicted sex chromosome ploidy |
The sex chromosome of the sample as determined through the analysis. |
Capture Specificity based on target region |
Number of unique (nonduplicate) reads that mapped inside the target region with a MAP quality greater than 0, divided by the number of unique (nonduplicate) reads that mapped anywhere in the reference with a MAP quality greater than 0. |
DRAGEN mapping rate [mil. reads/second] |
The rate at which the pipeline generates seeds from the read and looks for exact matches in the reference genome. |
Total reads in RG |
Total number of reads in the read group. |
Insert length: mean |
The average length of the sequence in between a pair of reads. |
Insert length: standard deviation |
The standard deviation of the length of the sequence in between a pair of reads. |