STR Annotation Threshold Files
The input for custom STR annotation thresholds is a *.tsv file that specifies the number of units required for annotation of repeat expansions in the specified genome build.
STR annotation threshold files contain a header section and a data section:
• | Header—The header row specifies the genome assembly, GRCh37 or GRCh38, in the format #assembly=<assembly>. |
• | Data—The data section includes columns described in the following table. Each row corresponds to a loci. |
Field |
Description |
---|---|
Chrom |
The name of the chromosome. |
Start |
The genomic coordinate of the start position. |
End |
The genomic coordinate of the end position. |
Phenotype |
The phenotype associated with the STR. |
OMIM ID |
The OMIM ID for the phenotype. |
Repeat numbers |
The observed repeat lengths. |
Allele counts |
The number of times a repeat length has been observed, listed in order by repeat number. |
Classifications |
The STR classification. Possible entries are Normal and Expanded. |
Classification ranges |
The threshiolds for normal and expanded classifications. Modify the ranges to change the genetic finding classifications. |