Hybrid Metagenomic Assembly, Binning, and GEMs Analysis Workflow

public public 1yr ago Version: Version 1 0 bookmarks

Workflow (hybrid) metagenomic assembly and binning + GEMs

Accepts both Illumina and Long reads (ONT/PacBio)

Workflow binnning https://workflowhub.eu/workflows/64?version=11 (optional)

  • Metabat2/MaxBin2/SemiBin
  • DAS Tool
  • CheckM
  • BUSCO
  • GTDB-Tk

Workflow Genome-scale metabolic models https://workflowhub.eu/workflows/372 (optional)

  • CarveMe (GEM generation)
  • MEMOTE (GEM test suite)
  • SMETANA (Species METabolic interaction ANAlysis)

Code Snippets

16
17
18
19
baseCommand: [pigz, -c]

arguments:
  - valueFrom: $(inputs.inputfile)
CWL From line 16 of bash/pigz.cwl
25
baseCommand: [ busco ]
26
baseCommand: [ carve ]
20
21
22
23
24
25
26
27
28
29
- entryname: script.sh
  entry: |-
    #!/bin/bash
    identifier=$1
    shift;
    echo "Model Mets Reactions Genes" > $identifier\_CarveMe_GEMstats.tsv
    for file in "$@"
    do
      bash /unlock/infrastructure/scripts/GEMstats.sh $file
    done >> $identifier\_CarveMe_GEMstats.tsv
CWL From line 20 of carveme/GEMstats.cwl
34
35
36
37
38
- entryname: script.sh
  entry: |-
    # !/bin/bash
    export CHECKM_DATA_PATH=/venv/checkm_data
    checkm lineage_wf $@
22
baseCommand: [ DAS_Tool ]
22
baseCommand: [ "Fasta_to_Contig2Bin.sh" ]
22
baseCommand: [ EukRep ]
18
19
20
21
22
23
- entryname: script.sh
  entry: |-
    #!/bin/bash
    export GTDBTK_DATA_PATH=$1
    shift;
    gtdbtk classify_wf $@
22
baseCommand: [ "run_MaxBin.pl" ]
22
baseCommand: [ memote ]
22
baseCommand: [aggregateBinDepths.pl]
23
24
25
26
27
baseCommand: [ metabat2 ]

arguments:
  - prefix: "--outFile"
    valueFrom: MetaBAT2_bins/$(inputs.identifier)_MetaBAT2_bin
23
24
25
26
27
28
baseCommand: [jgi_summarize_bam_contig_depths]

arguments:
  - position: 1
    prefix: '--outputDepth'
    valueFrom:  $(inputs.identifier)_contigDepths.tsv
38
baseCommand: ["python3", "/scripts/metagenomics/assembly_bins_readstats.py"]
41
baseCommand: ["python3", "/scripts/metagenomics/bins_summary.py"]
18
19
20
21
22
23
24
25
26
27
28
baseCommand: [ prodigal ]

arguments:
  # - valueFrom: "sco" # What is the sco format?
  #   prefix: "-f"
  - valueFrom: $(inputs.input_fasta.nameroot).prodigal
    prefix: "-o"
  - valueFrom: $(inputs.input_fasta.nameroot).prodigal.ffn
    prefix: "-d"
  - valueFrom: $(inputs.input_fasta.nameroot).prodigal.faa
    prefix: "-a"
24
baseCommand: [ SemiBin, single_easy_bin ]
24
baseCommand: [ smetana ]
12
baseCommand: [ raw_n50 ]
ShowHide 8 more snippets with no or duplicated tags.

Login to post a comment if you would like to share your experience with this workflow.

Do you know this workflow well? If so, you can request seller status , and start supporting this workflow.

Free

Created: 1yr ago
Updated: 1yr ago
Maitainers: public
URL: https://gitlab.com/m-unlock/cwl/-/blob/master/cwl/workflows/workflow_metagenomics_assembly.cwl
Name: hybrid-metagenomics-workflow
Version: Version 1
Badge:
workflow icon

Insert copied code into your website to add a link to this workflow.

Downloaded: 0
Copyright: Public Domain
License: None
  • Future updates

Related Workflows

cellranger-snakemake-gke
snakemake workflow to run cellranger on a given bucket using gke.
A Snakemake workflow for running cellranger on a given bucket using Google Kubernetes Engine. The usage of this workflow ...