Open
Description
I am trying to run a configuration where the input is a tuple of paths, some of which are optional. The pattern in this repository works for separate path inputs (or, so says the author), but extending it to my use case results in the error Not a valid path value: NO_FILE
.
In this simple example demonstrating the issue, the input is a CSV file defining RNA-seq sample names, forward and reverse read fastqs, and a STAR genome index to align them to. The reverse read is optional.
My question to the community is how I can work around this issue.
Repository setup
nextflow.config
params {
manifest = null // csv file name,R1,R2?,index
outdir = "outs" // save output bams
}
profiles {
conda {
conda.enabled = true
process.conda = "star samtools"
}
}
main.nf
process MyProcess {
publishDir outdir, mode: "copy"
input:
tuple val(name), path(R1), path(R2), path(index)
path outdir
output:
path "${name}_Aligned.out.sortedByCoord.bam"
path "${name}_Aligned.out.sortedByCoord.bam.bai"
script:
R2_arg = R2.name == "NO_FILE" ? "" : R2
"""
STAR --readFilesIn $R1 $R2_arg --readFilesCommand gunzip -c \
--genomeDir $index --outSAMtype BAM SortedByCoordinate \
--outFileNamePrefix ${name}_
samtools index ${name}_Aligned.out.sortedByCoord.bam
"""
}
workflow {
MyProcess(
file(manifest).read().splitCsv(header: ["name", "R1", "R2", "index"]).map{it.R2 = it.R2 ?: "NO_FILE"},
params.outdir
)
}
Run
nextflow run main.nf [-profile conda] --manifest path/to/manifest.csv
Metadata
Metadata
Assignees
Labels
No labels