Set Analysis Parameters

Open DRAGEN Germline Pipeline from BaseSpace™ Sequence Hub as follows.

Select the Apps tab, and then select DRAGEN Germline Pipeline.
From the Version drop-down list, select 3.3.7.
Select Launch Application.

To override the default analysis name, enter a preferred analysis name in the Analysis Name field.

The default is the app name with the date and time the session started.

From the Save Results To field, select Select Project, and then select a project to store app results to.

Specify a sample by selecting the option that matches the input file type. Multiple samples can be selected in a single row.

[Optional] From the Sample Sex drop-down list, select the sex of the sample.

If you plan on using Expansion Hunter for the analysis, you must specify the sex.

Set the analysis pipeline configuration.

•

Map/Align Only–Samples are mapped and aligned to the reference genome and position-sorted.

•

Map/Align + Small Variant Caller–In addition to the Map/Align Only processes, variant calling is performed.

•

Small Variant Caller–Only variant calling is performed. This configuration only accepts BAM inputs files.

From the Reference drop-down list, select a reference genome.

If you selected Custom from the Reference drop-down list, select the custom DRAGEN and/or FASTA reference files.

•

DRAGEN–The custom reference file must have been generated by the DRAGEN Reference Builder v3.x apps.

•

FASTA–Used for Expansion Hunter or SV.

From the Map/Align Output drop-down list, specify whether to output a BAM, CRAM, or no alignment file at all.

10.

From the Variant Caller Output drop-down list, specify the VCF output type:

•

VCF

•

GVCF–Variants are recorded individually and nonvariants are grouped into blocks.

•

GVCF with BP_RESOLUTION–Variants and nonvariants are recorded individually. This option will increase run time and create large gVCF files, eg, 2 hours for a 30x sample with a 20 GB gVCF.

11.

[Optional] If you enabled a variant caller for the analysis, select a target BED file.

The contig names must match those of the chosen reference. If a mismatch is detected, variant calling will be abandoned.

12.

Specify regions over which to produce coverage metrics as follows:

Select Select Dataset File(s).
Select the BED file(s) that contain the regions for which you want to produce coverage metrics.
Enter a MAPQ filter value. Any read with a MAPQ value less than this threshold will be filtered out.
Enter a BQ filter value. Any base call with a quality score less than this threshold will be filtered out.
Set the Full-Res option. Disabled by default, enabling this option will generate large files.

13.

Specify additional run settings by expanding the function headings and selecting the appropriate checkboxes.

Heading

Description

CNV

Enables germline CNV calling. If enabled, select the Segmentation Algorithm.

•

Circular Binary Segmentation–Iteratively identifies change points in a genomic sequence using a nonparametric hypothesis testing approach.

•

Shifting Level Model–Models genomic data as the sum of two independent stochastic processes and segments using a subclass of Hidden Markov Model.

SV (Manta)

Enables SV (Manta) analysis.

Expansion Hunter

Enables calling of repeat-expansion variants.

Advanced Settings

•

Duplicate Marking–Enabled by default.

•

BQD–Enable base quality drop off detection. Enabled by default.

•

Depth of Coverage–Enables depth-of-coverage calculations. A target BED file is required, and coverage calculations are performed over the regions definited in the BED file. *genomecov.bed file is generated.

•

dbSNP VCF–Select the variant annotation database .vcf or .vcf.gz file.

Automation Settings

Enables automation settings. Specify a sample by selecting the option that matches the input file type and the sex.

14.

Select Launch Application to start the analysis.

When the analysis is complete, the status of the app session is automatically updated and you receive a confirmation email.

Related articles

Set Analysis Parameters