Stacks RAD-seq de novo workflow

public public 1yr ago Version: Version 1 0 bookmarks

workflow-denovo-stacks

These workflows are part of a set designed to work for RAD-seq data on the Galaxy platform, using the tools from the Stacks program.Stacks: http://catchenlab.life.illinois.edu/stacks/

  • demultiplexed reads in fastq format, may be output from the QC workflow. Files are in a collection.
  • population map in text format

Steps and outputs

ustacks:

  • input reads go to ustacks.
  • ustacks assembles the reads into matching stacks (hypothetical alleles).
  • The outputs are in a collection called something like: Stacks2: ustacks on data 21, data 20, and others Loci and polymorphism. Click on this to see the files:
  • for each sample, assembled loci (tsv format), named e.g. sample_CAAC.tags
  • for each sample, model calls from each locus (tsv format), named e.g. sample_CAAC.snps
  • for each sample, haplotypes/alleles recorded from each locus (tsv format), named e.g. sample_CAAC.alleles
  • Please see sections 6.1 to 6.4 in https://catchenlab.life.illinois.edu/stacks/manual/#ufiles for a full description.

cstacks:

  • cstacks will merge stacks into a catalog of consensus loci.
  • The outputs are in a collection called something like Stacks2: cstacks on data 3, data 71, and others Catalog of loci. Click on this to see the three files, each in tsv format: catalog.tags catalog.snps catalog.alleles

sstacks:

  • sstacks will compare each sample to the loci in the catalog.
  • The outputs are in a collection called something like Stacks2: sstacks on data 3, data 76, and others Matches to the catalog.Click on this to see the files: There is one file for each sample, named e.g. sample_CAAC.matches, in tsv format.

tsv2bam:

  • Conversion to BAM format
  • Reads from each sample are now aligned to each locus, and the tsv2bam tool will convert this into a bam file for each sample.
  • The outputs are in a collection called something like Stacks2: tsv2bam on data 3, data 94, and others Matches to the catalog.Click on this to see the files: There is one file for each sample, named e.g sample_CAAC.matches, in BAM format.

gstacks:

  • Catalog of loci in fasta format
  • Variant calls in VCF format

populations:

  • Locus consensus sequences in fasta format
  • Snp calls, in VCF format
  • Haplotypes, in VCF format
  • Summary statistics

Login to post a comment if you would like to share your experience with this workflow.

Do you know this workflow well? If so, you can request seller status , and start supporting this workflow.

Free

Created: 1yr ago
Updated: 1yr ago
Maitainers: public
URL: https://github.com/AnnaSyme/workflow-denovo-stacks.git
Name: stacks-rad-seq-de-novo-workflow
Version: Version 1
Badge:
workflow icon

Insert copied code into your website to add a link to this workflow.

Downloaded: 0
Copyright: Public Domain
License: None
  • Future updates

Related Workflows

cellranger-snakemake-gke
snakemake workflow to run cellranger on a given bucket using gke.
A Snakemake workflow for running cellranger on a given bucket using Google Kubernetes Engine. The usage of this workflow ...