STR Annotation Threshold Files
The input for custom STR annotation thresholds is a *.tsv file that specifies the number of units required for annotation of repeat expansions in the specified genome build.
STR annotation threshold files contain a header section and a data section:
| • | Header—The header row specifies the genome assembly, GRCh37 or GRCh38, in the format #assembly=<assembly>. |
| • | Data—The data section includes columns described in the following table. Each row corresponds to a loci. |
|
Field |
Description |
|---|---|
|
Chrom |
The name of the chromosome. |
|
Start |
The genomic coordinate of the start position. |
|
End |
The genomic coordinate of the end position. |
|
Phenotype |
The phenotype associated with the STR. |
|
OMIM ID |
The OMIM ID for the phenotype. |
|
Repeat numbers |
The observed repeat lengths. |
|
Allele counts |
The number of times a repeat length has been observed, listed in order by repeat number. |
|
Classifications |
The STR classification. Possible entries are Normal and Expanded. |
|
Classification ranges |
The threshiolds for normal and expanded classifications. Modify the ranges to change the genetic finding classifications. |