VGP-meryldb workflow for finding the genomic complexity

public public 1yr ago Version: Version 1 0 bookmarks

This workflow collects the metrics on the properties of the genome under consideration by analyzing the k-mer frequencies. It provides information about the genomic complexity, such as the genome size and levels of heterozygosity and repeat content, as well about the data quality. It uses reads from two parental genomes to partition long reads from the offspring into haplotype-specific k-mer databases.

Inputs

  • Collection of Hifi long reads in FASTQ format
  • Paternal short-read Illumina sequencing reads in FASTQ format
  • Maternal short-read Illumina sequencing reads in FASTQ format

Outputs

  • Meryl databases of k-mer counts
    • Child
    • Paternal haplotype
    • Maternal haplotype
  • GenomeScope metrics of child and parental genomes
    • Linear plot
    • Log plot
    • Transformed linear plot
    • Transformed log plot
    • Summary
    • Model
    • Model parameteres

Login to post a comment if you would like to share your experience with this workflow.

Do you know this workflow well? If so, you can request seller status , and start supporting this workflow.

Free

Created: 1yr ago
Updated: 1yr ago
Maitainers: public
URL: https://github.com/iwc-workflows/VGP-meryldb-creation-trio
Name: vgp-meryldb-creation-trio-main
Version: Version 1
Badge:
workflow icon

Insert copied code into your website to add a link to this workflow.

Downloaded: 0
Copyright: Public Domain
License: None
  • Future updates

Related Workflows

cellranger-snakemake-gke
snakemake workflow to run cellranger on a given bucket using gke.
A Snakemake workflow for running cellranger on a given bucket using Google Kubernetes Engine. The usage of this workflow ...