Home/
Set Analysis Parameters

Set Analysis Parameters

1. Open DRAGEN Germline from BaseSpace™ Sequence Hub as follows.
  1. Select the Apps tab, and then select DRAGEN Germline.
  2. From the Version drop-down list, select 3.4.5.
  3. Select Launch Application.
2. To override the default analysis name, enter a preferred analysis name in the Analysis Name field.

The default is the app name with the date and time the session started.

3. From the Save Results To field, select Select Project, and then select a project to store app results to.
4. Specify a sample by selecting the option that matches the input file type. Multiple samples of the same type can be selected in a single row.
5. [Optional] From the Sample Sex drop-down list, select the sex of the sample.

If you are using Expansion Hunter for the analysis, you must specify the sex.

[Optional] Select Add a New Row to add more samples to the run.

6. Set the analysis pipeline configuration.
Map/Align–Samples are mapped and aligned to the reference genome and position-sorted.
Map/Align + Small Variant Caller–In addition to the Map/Align processes, variant calling is performed.
Small Variant Caller–Only variant calling is performed. This configuration accepts BAM or CRAM inputs files.
7. From the Reference drop-down list, select a reference genome.
8. If you selected Custom from the Reference drop-down list, select the custom DRAGEN and/or FASTA reference files.
Custom DRAGEN Reference File–The custom reference file must be generated by the DRAGEN Reference Builder app.
Custom FASTA Reference File for SV or Expansion Hunter–Used for Expansion Hunter or SV.
9. From the Map/Align Output drop-down list, specify whether to output a BAM, CRAM, or no alignment file at all.
10. From the Small Variant Caller Output drop-down list, specify the VCF output type:
VCF
GVCF–Variants are recorded individually and nonvariants are grouped into blocks.
GVCF with BP_RESOLUTION–Variants and nonvariants are recorded individually. This option will increase run time and create large gVCF files, eg, 2 hours for a 30x sample with a 20 GB gVCF.
11. [Optional] Select a target BED file to restrict processing of the small variant caller and target BED-related coverage and callability metrics to regions specified in this file.

The contig names must match those of the chosen reference. If a mismatch is detected, analysis will abort.

12. Define regions over which to produce coverage metrics as follows.
  1. Select the BED file(s) that contain the regions for which you want to produce coverage metrics.
  2. Enter a MAPQ filter value. Any read with a MAPQ value less than this threshold will be filtered out. Default value is 1.
  3. Enter a BQ filter value. Any base call with a quality score less than this threshold will be filtered out. Default value is 0.
  4. Enter a filename tag. The tag must contain letters, numbers, and underscores only. "wgs" or "target_bed" are reserved tags and cannot be used.
  5. Set the Full-Res option. Disabled by default, enabling this option will generate large files.
  6. [Optional] Select Add a New Row to add up to 5 BED files to produce coverage metrics.
13. Specify additional run settings by expanding the function headings and selecting the appropriate checkboxes.

Heading

Description

CNV

Enables germline CNV calling. If enabled, select the Segmentation Algorithm.

Circular Binary Segmentation–Iteratively identifies change points in a genomic sequence using a nonparametric hypothesis testing approach. Recommended for whole exome processing.
Shifting Level Model–Models genomic data as the sum of two independent stochastic processes and segments using a subclass of Hidden Markov Model. Recommended for whole genome processing.

SV (Manta)

Enables SV (Manta) analysis. If enabled, set the following options:

Depth Filters–Set options for high coverage input: turn off depth filters.
SV BED File[Optional] Select the BED file that will be passed into the SV analysis.

Expansion Hunter

Enables calling of repeat-expansion variants.

UMI Settings

Enables UMI-based read processing when the run is configured for the Map/Align pipeline configuration. Disabled by default and is provided for experimental purposes only.

UMI Min Supporting Reads–Sets the minimum number of supporting reads required for a family. Default value is 2.

Advanced Settings

Duplicate Marking–Enabled by default.
BQD–Enable base quality drop off detection. Enabled by default.
dbSNP VCF–Select the variant annotation database *.vcf or *.vcf.gz file.
ForceGT VCF–Select the *.vcf or *.vcf.gz file containing the list of variants to force genotype.

Automation Settings

Enables automation settings. Specify a sample by selecting the option that matches the input file type and the sex.

14. Select Launch Application to start the analysis.

When the analysis is complete, the status of the app session is automatically updated and you receive a confirmation email.