Summary Report (*.summary.csv)

The MethylSeq v1.0 app provides a MethylSeq Summary Report containing sample results in a comma-separated values (CSV) format (*.summary.csv). This report is an overview of statistics for the sample.

Statistic

Definition

Sample ID

IDs of samples reported on in the file.

Sample Name

Names of samples reported on in the file.

Run Folder

Run folders for samples reported on in the file.

Reference Genome

Reference genome selected.

Paired End

When true, the input sample is paired-end.

When false, the input sample is single-end.

Number of 5' bases to crop during trimming

The number of 5' bases cropped from each read during the FASTQ trimming stage.

Minimum 5' base quality during trimming

All 5' bases with a quality score less than this value are removed during the FASTQ trimming stage.

Minimum 3' base quality during trimming

All 3' bases with a quality score less than this value are removed during the FASTQ trimming stage.

Input methylation library directional or not

When true, the methylation preparation library only contains reads from bisulfite-treated top and bottom strands.

When false, the library contains additional reads complementary to bisulfite-treated top and bottom strand.

PCR duplicates flagged or not

When true, PCR duplicates are flagged in the final .bam files. If the input data are single-end, this setting is ignored and no duplicates are marked.

Remove duplicate methylation calls in overlapping regions of two ends

Applies to paired-end reads with an overlap for Read 1 and Read 2. When true, this option tells the app to score methylation calls in the overlap area only one time, which removes a bias towards more methylation calls towards the center of sequenced fragments. This option does not apply to single-end data.

Number of 5' bases to ignore in methylation-calling

Number of 5' bases to ignore during the methylation calling stage.

Enable cytosine report output

When true, a Cytosine Report containing methylation status for every cytosine in the genome (including both strands) is generated, with the file name SampleName_SampleNumber.CX_report.txt.gz. See Cytosine Report (*.CX_report.txt.gz).

Diversity

Measuring diversity of input library. It is the number of unique input DNA fragments. Because of the low complexity associated with MethylSeq data, this value might not apply.

Total PF reads

The number of reads (2x the number of pairs for paired-end data) in the trimmed FASTQ files.

Total aligned Read 1

The total number in Read 1 that are aligned to the reference genome.

Total aligned Read 2

The total number in Read 2 that are aligned to the reference genome.

Percent aligned Read 1

The percentage of Read 1 that are aligned to the reference genome.

Percent aligned Read 2

The percentage of Read 2 that are aligned to the reference genome.

Percent duplicate read pairs

Percentage of paired reads that have duplicates.

Fragment length median

Median length of the sequenced fragment. The fragment length is calculated based on the locations at which a read pair aligns to the reference. The read mapping information is parsed from the BAM files.

Fragment length min

Minimum length of the sequenced fragment.

Fragment length max

Maximum length of the sequenced fragment.

Fragment length SD

Standard deviation of the sequenced fragment length.

Total PF bases

The number of bases passing filter for the sample.

Total PF bases Read 1

The number of bases passing filter for Read 1.

Total PF bases Read 2

The number of bases passing filter for Read 2.

Percent Q30 bases Read 1

Percentage of bases from Read 1 with a quality score of 30 or higher.

Percent Q30 bases Read 2

Percentage of bases from Read 2 with a quality score of 30 or higher.

Total aligned bases

The total number of bases present in the data set that aligned to the reference genome.

Total aligned bases Read 1

The total number of bases that are aligned to the reference genome for Read 1.

Total aligned bases Read 2

The total number of bases that are aligned to the reference genome for Read 2.

Percent aligned bases

The percentage of aligned bases for the reads.

Percent aligned bases Read 1

The percentage of aligned bases for Read 1.

Percent aligned bases Read 2

The percentage of aligned bases for Read 2.

Mean coverage

The total number of aligned bases divided by the genome size.

Top Strand

The number of read pairs (or reads when single-end) aligned to the bisulfite-converted top strand of reference genome.

Complementary Top Strand

The number of aligned reads that are complementary to the original bisulfite-treated top strand. When the library prep kit is directional, the value is 0.

Bottom Strand

The number of aligned reads that are the original bisulfite-treated bottom strand.

Complementary Bottom Strand

The number of aligned reads that are complementary to the original bisulfite-treated bottom strand. When the library prep kit is directional, the value is 0.

Number of Cs analyzed

The number of C bases in aligned reads.

Number of Methylated Cs in CpG

The number of methylated C bases in aligned reads that are in the context of CpG.

Number of Methylated Cs in CHG

The number of methylated C bases in aligned reads that are in the context of CHG.

Number of Methylated Cs in CHH

The number of methylated C bases in aligned reads that are in the context of CHH.

Number of Methylated Cs in others

The number of methylated C bases in aligned reads that are in other contexts.

Number of Unmethylated Cs in CpG

The number of unmethylated C bases in aligned reads that are in the context of CpG.

Number of Unmethylated Cs in CHG

The number of unmethylated C bases in aligned reads that are in the context of CHG.

Number of Unmethylated Cs in CHH

The number of unmethylated C bases in aligned reads that are in the context of CHH.

Number of Unmethylated Cs in others

The number of unmethylated C bases in aligned reads that are in other contexts.

Percent Methylated Cs in CpG

The percent of methylated C bases in aligned reads that are in the context of CpG.

Percent Methylated Cs in CHG

The percent of methylated C bases in aligned reads that are in the context of CHG.

Percent Methylated Cs in CHH

The percent of methylated C bases in aligned reads that are in the context of CHH.

Percent Methylated Cs in others

The percent of methylated C bases in aligned reads that are in other contexts (besides contexts listed).

 

© 2015 Illumina, Inc. All rights reserved.

15071187 Rev. A