VCF Input Requirements
BaseSpace Variant Interpreter imports single nucleotide variants (SNVs), multiple nucleotide variants (MNVs), structural variants (SVs), copy number variants (CNVs), and indels reported in VCF and genome VCF (*.vcf or *.vcf.gz) file formats, v4.1.
Annotations are applied only to human genomes aligned to
CAUTION
If the genome assembly is not specified in the reference header, ingestion can fail.
The following table provides an example VCF file format. Columns and values vary by variant caller, and only variations produced by compatible variant callers are supported.
CAUTION
Imported VCF files must be sorted and indexed correctly to ensure valid calculation results in BaseSpace Variant Interpreter.
VCF Column |
Required Value |
|||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CHROM |
The reference sequence name. Values are #, or chr#, where # is 1 of the following:
|
|||||||||||||||||||||||||||||||||||||||
POS |
The position of the variant. Values are numeric with the first base taking position 1 (1-based). |
|||||||||||||||||||||||||||||||||||||||
ID |
The ID is the rs number for the SNP as recorded in the software. A value must be present. If a dbSNP entry does not exist, a missing value marker '.' is an acceptable value. Although the ID column and valid values are required, the values are not imported. The software applies dbSNP annotations when available. |
|||||||||||||||||||||||||||||||||||||||
REF |
The reference allele. |
|||||||||||||||||||||||||||||||||||||||
ALT |
The alternate allele. |
|||||||||||||||||||||||||||||||||||||||
QUAL |
The quality score assigned by the variant caller. A value of '.' is acceptable, and is reported as 0. |
|||||||||||||||||||||||||||||||||||||||
FILTER |
The status of the variant call quality as annotated in the VCF file. PASS indicates that all filters passed. Otherwise, the variant call filter is listed. |
|||||||||||||||||||||||||||||||||||||||
INFO |
The recognized field is DP (read depth), which is represented in the Read Depth column in the Variants table. A value of '.' (none) is acceptable. |
|||||||||||||||||||||||||||||||||||||||
FORMAT |
A list of fields that define values in the Sample column. Possible values are '.' (none) and the following:
Genotype Values: (Not present in somatic VCF files.)
|