Using Nextflow on the RCS compute service

Nextflow is a popular tool for running multi-stage computational pipelines. It’s a scalable alternative to writing shell scripts for co-ordinating complex workflows, particularly those involving data processing. It supports pipeline resumption and job queuing so is well-suited for use on the RCS compute service.

Existing workflows can be run mostly unmodified, if the following advice is followed:

For a full example, including a file highlighting the changes required to the official tutorial, please see this Gist. To run this example on the compute service simply clone and run it:

git clone https://gist.github.com/322369519b5dfd0195e3645d82bfe909.git nextflow-tutorial
cd nextflow-tutorial
qsub tutorial.pbs.sh

Further resources