This repository contains analysis scripts and resources for processing Illumina sequencing data of Hepatitis E virus (HEV) patient samples.
The analysis is conducted using two different reference sequences: one from Sudan and another from Uganda. The respective workflows are implemented as independent pipelines in workflow/Snakefile_sudan
and workflow/Snakemake_uganda
. Results from each workflow are organized separately in the directories results_sudan
and results_uganda
.
The configuration file config/config.yaml
specifies the reference sequences used and the directories where the input data are located.
Use the files run_pipeline_sudan.sh
and run_pipeline_uganda.sh
for the execution of the pipelines on Slurm clusters.