SnakeMake workflow for the processing of EMseq data

public public 1yr ago Version: v0.1 0 bookmarks

Code Snippets

20
21
22
23
shell:
    """
    bismark_genome_preparation --genomic_composition {input.genome_dir}
    """
45
46
47
48
49
50
shell:
    """
    bismark -p {threads} --nucleotide_coverage {REF_GENOME_DIR} -1 {input.read1} -2 {input.read2} --basename {wildcards.sample}_mapped_{REF_SHORT}_trim_bismark --output_dir results/02_MAPPED 2> {log.log1}
    mv results/02_MAPPED/{wildcards.sample}_mapped_{REF_SHORT}_trim_bismark_PE_report.txt {log.log2}
    mv results/02_MAPPED/{wildcards.sample}_mapped_{REF_SHORT}_trim_bismark_pe.nucleotide_stats.txt {log.log3}
    """
67
68
69
70
shell:
    """
    deduplicate_bismark --paired --bam {input.bam} --output_dir results/02_MAPPED 2> {log}
    """
93
94
95
96
shell:
    """
    bismark_methylation_extractor {params.extra} --gzip --paired-end --multicore {threads} --genome_folder {params.ref} -s {input.bam} --output results/03_METHYL_EX 2> {log}
    """
116
117
118
119
shell:
    """
    bismark2bedGraph --CX --dir results/04_COVERAGE -o {params.bedGraph} {input.methylex} 2> {log}
    """
24
25
26
27
shell:
    """
    fastqc {input.read} -t {threads} -f fastq --outdir logs/fastqc/raw
    """
46
47
48
49
shell:
    """
    bbduk.sh -Xmx8g in1={input.read1} in2={input.read2} out1={output.read1} out2={output.read2} stats={output.trim_stats} minlen=75 qtrim=rl trimq=20 ktrim=r k=25 mink=11 ref={params.adapter} hdist=1
    """
64
65
66
67
shell:
    """
    fastqc {input.read} -t {threads} -f fastq --outdir logs/fastqc/trimmed
    """
18
19
20
21
22
shell:
    """
    samtools sort {input.bam} > {output.sort}
    samtools index {output.sort}
    """
ShowHide 5 more snippets with no or duplicated tags.

Login to post a comment if you would like to share your experience with this workflow.

Do you know this workflow well? If so, you can request seller status , and start supporting this workflow.

Free

Created: 1yr ago
Updated: 1yr ago
Maitainers: public
URL: https://github.com/wegnerce/smk_emseq
Name: smk_emseq
Version: v0.1
Badge:
workflow icon

Insert copied code into your website to add a link to this workflow.

Downloaded: 0
Copyright: Public Domain
License: GNU General Public License v3.0
  • Future updates

Related Workflows

cellranger-snakemake-gke
snakemake workflow to run cellranger on a given bucket using gke.
A Snakemake workflow for running cellranger on a given bucket using Google Kubernetes Engine. The usage of this workflow ...