@@ -11,7 +11,7 @@ mv ./nextflow ~/bin
1111export NXF_WORK=$SNIC_NOBACKUP /work
1212
1313# Pull worfklow from this repo, run manta, normalize, and variant effect predictor:
14- nextflow run NBISweden/wgs-structvar --project < uppmax_project_id> --bam < bamfile.bam> --steps manta,normalize,vep
14+ nextflow run -profile milou NBISweden/wgs-structvar --project < uppmax_project_id> --bam < bamfile.bam> --steps manta,normalize,vep
1515
1616# Monitor log file
1717tail -f .nextflow.log
@@ -32,6 +32,29 @@ used for that dataset. If you have access to the structural variants in the
3232swegen dataset you can add that file to the pipeline and thereby have the
3333ability to filter population specific variants.
3434
35+ ### Profiles for running on Uppmax HPC clusters
36+
37+ It is possible to run the pipeline in a few different ways. Either as a
38+ single-node job or letting nextflow distribute the tasks using the SLURM
39+ queing engine. There is also some slight differences in module usage depending
40+ on which HPC system is used.
41+
42+ specify the profile to use with the ` -profile ` option to NextFlow:
43+
44+ <dl >
45+ <dt >-profile milou</dt >
46+ <dd >Run on the milou cluster using the queueing system (for example,
47+ directly from the login node).
48+ <dt >-profile miloulocal</dt >
49+ <dd >Run on milou but only on the local node. Use this in a batch job on one
50+ node, reserve it for 48 hours and everything should be ok.
51+ <dt >-profile bianca</dt >
52+ <dd >The same as `milou` but on the Bianca system</dd >
53+ <dt >-profile biancalocal</dt >
54+ <dd >The same as `miloulocal` but on the Bianca system</dd >
55+ </dl >
56+
57+
3558### Masking
3659
3760#### Artifact masking
@@ -54,7 +77,7 @@ filters put the `bed` files in the `mask_cohort/` subdirectory and add the
5477``` bash
5578cp some_bed_file.bed < path-to-wgs-structvar> /mask_cohort/
5679
57- nextflow run < path-to-wgs-structvar> /main.nf --project < uppmax_project_id> --bam < bamfile.bam> --steps manta,normalize,vep,mask_cohort
80+ nextflow run -profile biancalocal < path-to-wgs-structvar> /main.nf --project < uppmax_project_id> --bam < bamfile.bam> --steps manta,normalize,vep,mask_cohort
5881```
5982
6083You can configure the location of the cohort mask files with the
@@ -80,8 +103,11 @@ Options:
80103 with a run id to more easily keep track of the run (otherwise
81104 it\'s autogenerated).
82105 --project Uppmax project to log cluster time to
83- OR
84- -profile local Run locally, only for very small datasets
106+ -profile <profile>
107+ Where profile can be any of milou, localmilou, bianca,
108+ localbianca and devel. The local<x> are for running the
109+ entire workflow on a single node on the cluster, without
110+ the local prefix the slurm queueing system is used.
85111 Optional
86112 --help Show this message and exit
87113 --fastq Input fastqfile (default is bam but with fq as fileending)
0 commit comments