Readme

Static Badge Static Badge Static Badge

Table of Contents

Overview

The biomodal duet multiomics solution bioinformatics pipeline is intended for processing FASTQ files that have been generated using the duet multiomics solution library preparation kits. The biomodal command line interface (CLI) is the recommended command line tool to install, test, and run the biomodal duet pipeline to analyse all your data.

The duet multiomics solution includes:

  • Standard and bespoke trimming of duet +modC and duet evoC FASTQ files
  • FASTQ resolution to convert raw paired-end duet +modC and duet evoC reads into resolved, error-suppressed 4-base genomic reads with accompanying epigenomic information
  • Reference genome and control alignment using BWA_MEM
  • Lane-merging of aligned BAM files
  • Forking of aligned genome reads and control reads into separate BAM files and analysis pathways
  • Deduplication of the genome and long control BAM files
  • Modified cytosine quantification in CpG context
  • Optional modified cytosine quantification in CHG and CHH contexts
  • Germline variant calling with optional joint variant calling and allele-specific methylation calling
  • Genetic accuracy analysis using controls
  • Analysis and generation of quality metrics associated with the genome-aligned reads and the control-aligned reads
  • Generation of summary reports in Excel and HTML formats

The biomodal pipeline and CLI utilises Nextflow as the orchestration tool that will leverage the capabilities of your compute platform to process your data through the duet pipeline. Nextflow will act as an orchestrator performing the following tasks:

  • Launching Virtual Machines or HPC nodes
  • Copying input files to the virtual machines or HPC nodes
  • Downloading and running Docker containers with appropriate software dependencies
  • Executing analyses on the virtual machines or HPC nodes
  • Transferring outputs from analyses to local or cloud storage
  • Organise output files into a convenient directory structure
  • Coordinating the channelling of outputs from one stage of analysis to the next

The following instructions are intended to be executed at the command line with Bash on a Linux platform, or via a Cloud Shell in a browser. If you prefer to use the GUI (cloud console) through the browser, searching the command line instructions will lead you to the platform specific documentation with the steps that need to be taken in the relevant cloud console.

The biomodal CLI and duet pipeline is not supported on Windows platforms. We recommend you use Ubuntu 22.04 or CentOS/RHEL 7. Other Linux distributions may work fine, but we have not specifically tested them. We are looking to test with other Linux distributions in the near future.

(back to top)

Support

Feel free to contact us at support@biomodal.com.
If your inquiry is related to the CLI or duet pipeline software, please include the output of the biomodal info command in your inquiry.

(back to top) | (Next)

Cambridge Epigenetix is now biomodal