Command-line Options

For a complete list of command line options, see Command Line Options.

DRAGEN Command-Line Options

Perform the following command-line options using the dragen:

•

Build Reference/Hash Table

dragen --build-hash-table true --ht-reference <REF_FASTA> \
--output-directory <REF_DIRECTORY> [options]

•

Run Map/Align and Variant Caller (*.fastq to *.vcf)

dragen -r <REF_DIRECTORY> --output-directory <OUT_DIRECTORY> \
--output-file-prefix <FILE_PREFIX> [options] -1 <FASTQ1> \
[-2 <FASTQ2>] –-RGID <RG0> --RGSM <SM0> --enable-variant-caller true

•

Run Map/Align (*.fastq to *.bam)

dragen -r <REF_DIRECTORY> --output-directory <OUT_DIRECTORY> \
--output-file-prefix <FILE_PREFIX> [options] \
-1 <FASTQ1> [-2 <FASTQ2>] \
--RGID <RG0> --RGSM

•

Run Variant Caller (*.bam to *.vcf)

dragen -r <REF_DIRECTORY> --output-directory <OUT_DIRECTORY> \
--output-file-prefix <FILE_PREFIX> [options] -b <BAM> \
--enable-variant-caller true

•

Run BCL Converter (BCL to *.fastq)

dragen --bcl-conversion-only true --bcl-input-directory <BCL_DIRECTORY> \
--output-directory <OUT_DIRECTORY>

•

Run RNA Map/Align (*.fastq to *.bam)

dragen -r <REF_DIRECTORY> --output-directory <OUT_DIRECTORY> \
--output-file-prefix <FILE_PREFIX> [options] -1 <FASTQ1> \
[-2 <FASTQ2>] --enable-rna true

Operating Modes

DRAGEN has two primary modes of operation, as follows:

•

Mapper/aligner

•

Variant caller

DRAGEN is capable of performing each mode independently or as an end-to-end solution. DRAGEN also allows you to enable and disable decompression, sorting, duplicate marking, and compression along the DRAGENpipeline.

Output Options

The following commandline options for output are mandatory:

•

--output-directory <out_dir>—Specifies the output directory for generated files.

•

--output-file-prefix <out_prefix>—Specifies the output file prefix. DRAGENappends the appropriate file extension onto this prefix for each generated file.

•

-r [ --ref-dir ] specifies the reference hash table.

For brevity, the following examples do not include these mandatory options.

For mapping and aligning, the output is sorted and compressed into BAM format by default before saving to disk. You can control the output format from the map/align stage with the --output-format <SAM|BAM|CRAM> option. If the output file exists, DRAGEN issues a warning and exits. To force overwrite if the output file already exists, use the -f [ --force ] option.

For example, the following commands output to a compressed BAM file and forces overwrite:

dragen ... -f

dragen ... -f --output-format bam

To generate a BAI-format BAM index file (.bai file extension), set --enable-bam-indexing to true.

The following example outputs to a SAM file and forces overwrite:

dragen ... -f --output-format sam

The following example outputs to a CRAM file and forces overwrite:

dragen ... -f --output-format cram

DRAGEN can generate mismatch difference (MD) tags, as described in the BAM standard. The feature is turned off by default because there is a small performance cost to generate these strings. To generate MD tags, set --generate-md-tags to true.

To generate ZS:Z alignment status tags, set --generate-zs-tags to true. These tags are only generated in the primary alignment and when a read has suboptimal alignments qualifying for secondary output (even if none were output because --Aligner.sec-aligns was set to 0).

The following are valid tag values:

•

ZS:Z:R—Multiple alignments with similar score were found.

•

ZS:Z:NM—No alignment was found.

•

ZS:Z:QL—An alignment was found, but it was below the quality threshold.

To generate SA:Z tags, set --generate-sa-tags to true (the default). These tags provide alignment information (position, cigar, orientation) of groups of supplementary alignments, which are useful in structural variant calling.

Read Group Options

DRAGEN assumes that all the reads in a given FASTQ belong to the same read group. The DRAGEN system creates a single @RG read group descriptor in the header of the output BAM file, with the ability to specify the following standard BAM attributes:

Attribute	Argument	Description
ID	--RGID	Read group identifier. If you include any of the read group parameters, RGID is required. It is the value written into each output BAM record.
LB	--RGLB	Library.
PL	--RGPL	Platform/technology used to produce the reads. The BAM standard allows for values CAPILLARY, LS454, ILLUMINA, SOLID, HELICOS, IONTORRENT and PACBIO.
PU	--RGPU	Platform unit, eg, flowcell-barcode.lane.
SM	--RGSM	Sample.
CN	--RGCN	Name of the sequencing center that produced the read.
DS	--RGDS	Description.
DT	--RGDT	Date the run was produced.
PI	--RGPI	Predicted mean insert size.

If any of these arguments are present, the DRAGEN software adds an RG tag to all the output records to indicate that they are members of a read group. The following example shows a command line that includes read group parameters:

dragen --RGID 1 --RGCN Broad --RGLB Solexa-135852 \

--RGPL Illumina --RGPU 1 --RGSM NA12878 \

-r /staging/human/reference/hg19/hg19.fa.k_21.f_16.m_149 \

-1 SRA056922.fastq --output-directory /staging/tmp/ \

--output-file-prefix rg_example

When using the --fastq-list option to input multiple read groups, BAM tags (and others) are specified for each read group by adding columns to the fastq_list.csv file. Each column heading consists of four capital letters and each begins with 'RG'. For each column, each read group’s values for that column are propagated to the output BAM file in an identically named tag.