pyIPSA: Integrative Splicing Analysis Pipeline

public 1yr ago 0 bookmarks

View Workflow

Help improve this workflow!

This workflow has been published but could be further improved with some additional meta data:

Keyword(s) in categories input, output, operation

You can help improve this workflow by suggesting the addition or removal of keywords, suggest changes and report issues, or request to become a maintainer of the Workflow .

pyIPSA

Integrative Pipeline for Splicing Analysis

Installation & Run

Step 1: Obtain a copy of this workflow

Clone this repository to your local system, into the

Code Snippets

shell:
    """
    wget -O {output.genome}.gz {params.url}
    gunzip {output.genome}.gz
    """

SnakeMake From line 6 of rules/genome.smk

shell:
    """python3 -c 'import pysam; pysam.index("{input.bam}")'"""

SnakeMake From line 17 of rules/junctions.smk

shell:
    "python3 -m workflow.scripts.count_junctions "
    "-i {input.bam} "
    "-k {input.known} "
    "-o {output.junctions} "
    "-l {output.library_stats} "
    "{params.primary} {params.unique} "
    "-t {threads}"

SnakeMake From line 36 of rules/junctions.smk

shell:
    "python3 -m workflow.scripts.gather_library_stats "
    "{OUTPUT_DIR}/J1  "
    "-o {output.tsv}"

SnakeMake From line 52 of rules/junctions.smk

shell:
    "python3 -m workflow.scripts.aggregate_junctions "
    "-i {input.junctions} "
    "-s {input.library_stats} "
    "-o {output.aggregated_junctions} "
    "--min_offset {params.min_offset} "
    "--min_intron_length {params.min_intron_length} "
    "--max_intron_length {params.max_intron_length}"

SnakeMake From line 71 of rules/junctions.smk

shell:
     "python3 -m workflow.scripts.annotate_junctions "
     "-i {input.aggregated_junctions} "
     "-k {input.known_sj} "
     "-f {input.genome} "
     "-o {output.annotated_junctions}"

SnakeMake From line 94 of rules/junctions.smk

shell:
    "python3 -m workflow.scripts.choose_strand "
    "-i {input.annotated_junctions} "
    "-r {input.ranked_list} "
    "-o {output.stranded_junctions} "
    "-s {output.junction_stats}"

SnakeMake From line 110 of rules/junctions.smk

run:
    d = defaultdict(list)
    for replicate in input.junction_stats:
        p = Path(replicate)
        name = Path(p.stem).stem
        with p.open("r") as f:
            d["replicate"].append(name)
            for line in f:
                if line.startswith("-"):
                    break
                left, right = line.strip().split(": ")
                d[left].append(right)
    df = pd.DataFrame(d)
    if not df.empty:
            df.sort_values(by="replicate")
    df.to_csv(output.tsv, index=False, sep="\t")

SnakeMake From line 123 of rules/junctions.smk

shell:
     "python3 -m workflow.scripts.filter "
     "-i {input.stranded_junctions} "
     "-e {params.entropy} "
     "-c {params.total_count} "
     "{params.gtag} "
     "-o {output.filtered_junctions}"

SnakeMake From line 151 of rules/junctions.smk

shell:
     "python3 -m workflow.scripts.merge_junctions "
     "{input.stranded_junctions} "
     "-o {output.merged_junctions}"

SnakeMake From line 166 of rules/junctions.smk

shell:
    "python3 -m workflow.scripts.count_polyA "
    "-i {input.bam} "
    "-o {output.polyA} "
    "{params.primary} {params.unique} "
    "-t {threads}"

SnakeMake From line 13 of rules/polyA.smk

shell:
    "python3 -m workflow.scripts.aggregate_polyA "
    "-i {input.polyA} "
    "-s {input.library_stats} "
    "-o {output.aggregated_polyA} "
    "--min_overhang {params.min_overhang} "

SnakeMake From line 30 of rules/polyA.smk

shell:
    "python3 -m workflow.scripts.count_sites "
    "-i {input.bam} "
    "-j {input.junctions} "
    "-s {input.stats} "
    "-o {output.pooled_sites} "
    "{params.primary} {params.unique} "
    "-t {threads}"

SnakeMake From line 14 of rules/pooled_sites.smk

shell:
    "python3 -m workflow.scripts.aggregate_sites "
    "-i {input.sites} "
    "-s {input.stats} "
    "-o {output.aggregated_pooled_sites} "
    "-m {params.min_offset}"

SnakeMake From line 33 of rules/pooled_sites.smk

shell:
     "python3 -m workflow.scripts.filter "
     "-i {input.aggregated_pooled_sites} "
     "--sites "
     "-e {params.entropy} "
     "-c {params.total_count} "
     "-o {output.filtered_pooled_sites}"

SnakeMake From line 50 of rules/pooled_sites.smk

shell:
    "python3 -m workflow.scripts.count_sites "
    "-i {input.bam} "
    "-j {input.junctions} "
    "-s {input.stats} "
    "-o {output.sites} "
    "{params.primary} {params.unique} "
    "-t {threads}"

SnakeMake From line 16 of rules/sites.smk

shell:
    "python3 -m workflow.scripts.aggregate_sites "
    "-i {input.sites} "
    "-s {input.stats} "
    "-o {output.aggregated_sites} "
    "-m {params.min_offset}"

SnakeMake From line 35 of rules/sites.smk

shell:
     "python3 -m workflow.scripts.filter "
     "-i {input.aggregated_sites} "
     "--sites "
     "-e {params.entropy} "
     "-c {params.total_count} "
     "-o {output.filtered_sites}"

SnakeMake From line 52 of rules/sites.smk

shell:
     "python3 -m workflow.scripts.compute_rates "
     "-j {input.filtered_junctions} "
     "-s {input.filtered_sites} "
     "-o {output.rates}"

SnakeMake From line 51 of workflow/Snakefile

shell:
     "python3 -m workflow.scripts.compute_rates "
     "-j {input.filtered_junctions} "
     "-s {input.filtered_pooled_sites} "
     "-o {output.rates}"