nf-core/sarek

Analysis pipeline to detect germline or somatic variants (pre-processing, variant calling and annotation) from WGS / targeted sequencing

annotationcancergatk4genomicsgermlinepre-processingsomatictarget-panelsvariant-callingwhole-exome-sequencingwhole-genome-sequencing

These pages are for an old version of the pipeline (2.7.1). The latest stable release is 3.6.0 .

Launch version 2.7.1 https://github.com/nf-core/sarek

Define where the pipeline should find input data and save output data.

Path to input file(s).

required

type: string

Starting step.

required

type: string

The output directory where the results will be saved.

type: string

default: ./results

Option used for most of the pipeline

Tools to use for variant calling and/or for annotation.

type: string

default: null

Disable usage of intervals.

type: boolean

Estimate interval size.

type: number

default: 1000

Enable Sentieon if available.

type: boolean

Disable specified QC and Reporting tools.

type: string

default: null

Target BED file for whole exome or targeted sequencing.

type: string

Run Trim Galore.

hidden

type: boolean

Remove bp from the 5’ end of read 1.

hidden

type: integer

Remove bp from the 5’ end of read 2.

hidden

type: integer

Remove bp from the 3’ end of read 1 AFTER adapter/quality trimming has been performed.

hidden

type: integer

Remove bp from the 3’ end of read 2 AFTER adapter/quality trimming has been performed.

hidden

type: integer

Apply the —nextseq=X option, to trim based on quality after removing poly-G tails.

hidden

type: integer

Save trimmed FastQ file intermediates

hidden

type: boolean

Specify how many reads should be contained in the split FastQ file

hidden

type: number

Specify aligner to be used to map reads to reference genome.

hidden

type: string

Establish values for GATK MarkDuplicates memory consumption

hidden

type: string

default: -Xms4000m -Xmx7g

Enable usage of GATK Spark implementation

type: boolean

Save Mapped BAMs

type: boolean

Skip GATK MarkDuplicates

type: boolean

Overwrite ASCAT ploidy

type: string

default: null

Overwrite ASCAT purity

type: string

default: null

Overwrite Control-FREEC coefficientOfVariation

type: number

default: 0.05

Overwrite Control-FREEC contaminationAdjustement

type: boolean

Design known contamination value for Control-FREEC

type: string

default: null

Overwrite Control-FREEC ploidy

type: number

default: 2

Overwrite Control-FREEC window size

type: number

Generate g.vcf output from GATK HaplotypeCaller

type: boolean

Will not use Manta candidateSmallIndels for Strelka

type: boolean

Panel-of-normals VCF (bgzipped) for GATK Mutect2 / Sentieon TNscope

type: string

Index of PON panel-of-normals VCF

type: string

Do not analyze soft clipped bases in the reads for GATK Mutect2

type: boolean

If provided, UMIs steps will be run to extract and annotate the reads with UMI and create consensus reads

type: boolean

When processing UMIs, a read structure should always be provided for each of the fastq files.

type: string

default: null

When processing UMIs, a read structure should always be provided for each of the fastq files.

type: string

default: null

Specify from which tools Sarek should look for VCF files to annotate

hidden

type: string

default: null

Enable the use of cache for annotation

hidden

type: boolean

Enable CADD cache.

hidden

type: boolean

Path to CADD InDels file.

hidden

type: string

default: null

Path to CADD InDels index.

hidden

type: string

default: null

Path to CADD SNVs file.

hidden

type: string

default: null

Path to CADD SNVs index.

hidden

type: string

default: null

Enable the use of the VEP GeneSplicer plugin.

hidden

type: boolean

Path to snpEff cache

hidden

type: string

default: null

Path to VEP cache

hidden

type: string

default: null

Options for the reference genome files

Name of iGenomes reference.

type: string

Path to ASCAT loci file.

type: string

Path to ASCAT GC correction file.

type: string

Path to BWA mem indices.

type: string

Path to chromosomes folder.

type: string

Path to chromosomes length file.

type: string

Path to dbsnp file.

type: string

Path to dbsnp index.

type: string

Path to FASTA dictionary file.

type: string

Path to FASTA genome file.

type: string

Path to FASTA reference index.

type: string

Path to GATK Mutect2 Germline Resource File

type: string

Path to GATK Mutect2 Germline Resource Index

type: string

Path to intervals file

type: string

Path to known indels file

type: string

Path to known indels file index

type: string

Path to Control-FREEC mappability file

type: string

snpEff DB version

type: string

snpEff species

type: string

VEP cache version

type: string

Save built references

type: boolean

Directory / URL base for iGenomes references.

type: string

default: s3://ngi-igenomes/igenomes

Directory / URL base for genomes references.

type: string

default: null

Do not load the iGenomes reference config.

type: boolean

Less common options for the pipeline, typically set in a config file.

Display help text.

type: boolean

Method used to save pipeline results to output directory.

hidden

type: string

Email address for completion summary.

type: string

pattern: ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

Boolean whether to validate parameters against the schema at runtime

hidden

type: boolean

default: true

Email address for completion summary, only when pipeline fails.

hidden

type: string

pattern: ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

Send plain-text email instead of HTML.

hidden

type: boolean

File size limit when attaching MultiQC reports to summary emails.

hidden

type: string

default: 25.MB

Do not use coloured log outputs.

hidden

type: boolean

Path to MultiQC custom config file.

hidden

type: string

Directory to keep pipeline Nextflow logs and reports.

hidden

type: string

default: ${params.outdir}/pipeline_info

Name of sequencing center to be displayed in BAM file

hidden

type: string

default: null

Show all params when using --help

hidden

type: boolean

Set the top limit for requested resources for any single job.

type: integer

default: 8

Use to set memory for a single CPU.

type: string

default: 7 GB

Maximum number of CPUs that can be requested for any single job.

hidden

type: integer

default: 16

Maximum amount of memory that can be requested for any single job.

hidden

type: string

default: 128.GB

pattern: ^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$

Maximum amount of time that can be requested for any single job.

hidden

type: string

default: 240.h

pattern: ^(\d+\.?\s*(s|m|h|day)\s*)+$

Parameters used to describe centralised config profiles. These should not be edited.

Git commit id for Institutional configs.

hidden

type: string

default: master

Base directory for Institutional configs.

hidden

type: string

default: https://raw.githubusercontent.com/nf-core/configs/master

Institutional configs hostname.

hidden

type: string

Institutional config name.

hidden

type: string

Institutional config description.

hidden

type: string

Institutional config contact information.

hidden

type: string

Institutional config URL link.

hidden

type: string

On this page