Keywords and Expertise

Use keywords to characterize workflows and forum posts, and reach out to sellers with relevant expertise

Categories

All 43970
Topic 271
Operation 1063
Data 1872
Tool 39364
Format 605
Input 458
Output 337

operation / edam

Alignment

Compare two or more entities, typically the sequence or structure (or derivatives) of macromolecules, to identify equivalent subunits.

Synonyms: Alignment construction, Alignment generation

operation / edam

Mapping

Map properties to positions on an biological entity (typically a molecular sequence or structure), or assemble such an entity from constituent parts.

Synonyms: Cartography

operation / edam

Clustering

Group together some data entities on the basis of similarities such that entities in the same group (cluster) are more similar to each other than to those in other groups (clusters).

operation / edam

Analysis

Apply analytical methods to existing data of a specific type.|This excludes non-analytical methods that read and write the same basic type of data (for that, see 'Data handling').

operation / edam

Base-calling

Identify base (nucleobase) sequence from a fluorescence 'trace' data generated by an automated DNA sequencer.

Synonyms: Base calling, Phred base calling, Phred base-calling

operation / edam

Quantification

Counting and measuring experimentally determined observations into quantities.

Synonyms: Quantitation

operation / edam

Annotation

Annotate an entity (typically a biological or biomedical database entity) with terms from a controlled vocabulary.|This is a broad concept and is used a placeholder for other, more specific concepts.

operation / edam

Calculation

Mathematical determination of the value of something, typically a properly of a molecule.

operation / edam

Scaffolding

Link together a non-contiguous series of genomic sequences into a scaffold, consisting of sequences separated by gaps of known length. The sequences that are linked are typically typically contigs; contiguous sequences corresponding to read overlaps.|Scaffold may be positioned along a chromosome physical map to create a "golden path".

Synonyms: Scaffold construction, Scaffold generation

operation / edam

Filtering

Filter a set of files or data items according to some property.

library: python:filtering

Synonyms: rRNA filtering, Sequence filtering

11 3

operation / edam

Cross-assembly

Construction of a single sequence assembly of all reads from different samples, typically as part of a comparative metagenomic analysis.

Synonyms: Sequence assembly (cross-assembly)

operation / edam

Variant calling

Detect, identify and map mutations, such as single nucleotide polymorphisms, short indels and structural variants, in multiple DNA sequences. Typically the alignment and comparison of the fluorescent traces produced by DNA sequencing hardware, to study genomic alterations.|Somatic variant calling is the detection of variations established in somatic cells and hence not inherited as a germ line variant.|Variant detection|Methods often utilise a database of aligned reads.

Synonyms: Variant mapping, Mutation detection, Genome variant detection, Somatic variant calling, de novo mutation detection, Germ line variant calling, Allele calling, Exome variant detection

operation / edam

Sequencing quality control

Raw sequence data quality control.|Analyse raw sequence data from a sequencing pipeline and identify (and possiby fix) problems.

Synonyms: Sequencing QC, Sequencing quality assessment

operation / edam

Genome assembly

The process of assembling many short DNA sequences together such thay they represent the original chromosomes from which the DNA originated.

Synonyms: Sequence assembly (genome assembly), Genomic assembly, Breakend assembly

operation / edam

Peak calling

Identify putative protein-binding regions in a genome sequence from analysis of Chip-sequencing data or ChIP-on-chip data.|Chip-sequencing combines chromatin immunoprecipitation (ChIP) with massively parallel DNA sequencing to generate a set of reads, which are aligned to a genome sequence. The enriched areas contain the binding sites of DNA-associated proteins. For example, a transcription factor binding site. ChIP-on-chip in contrast combines chromatin immunoprecipitation ('ChIP') with microarray ('chip'). "Peak-pair calling" is similar to "Peak calling" in the context of ChIP-exo.

Synonyms: Protein binding peak detection, Peak-pair calling

operation / edam

Sequence alignment

Align (identify equivalent sites within) molecular sequences.|See also "Read mapping"

Synonyms: Sequence alignment construction, Sequence alignment generation, Constrained sequence alignment, Multiple sequence alignment (constrained), Sequence alignment (constrained), Consensus-based sequence alignment

operation / edam

Sequence read processing

The processing of reads from high-throughput sequencing machines.

operation / edam

Expression analysis

Process (read and/or write) expression data from experiments measuring molecules (e.g. omics data), including analysis of one or more expression profiles, typically to interpret them in functional terms.

Synonyms: Expression data analysis, Metagenomic inference, Protein expression analysis, Gene expression regulation analysis, Microarray data analysis, Gene expression analysis, Gene expression data analysis

operation / edam

Sequence clustering

Build clusters of similar sequences, typically using scores from pair-wise alignment or other comparison of the sequences.|The clusters may be output or used internally for some other purpose.

Synonyms: Sequence cluster construction, Sequence cluster generation

operation / edam

Sequence analysis

Analyse one or more known molecular sequences.

Synonyms: Sequence analysis (general)

operation / edam

Pathway analysis

Generate, process or analyse a biological pathway.

Synonyms: Biological pathway analysis, Pathway simulation, Pathway modelling, Pathway comparison, Functional pathway analysis, Pathway prediction, Biological pathway modelling, Biological pathway prediction

operation / edam

Sequence annotation

Annotate a molecular sequence record with terms from a controlled vocabulary.

operation / edam

RNA-Seq analysis

Analyze data from RNA-seq experiments.

operation / edam

Protein-ligand docking

Model protein-ligand (for example protein-peptide) binding using comparative modelling or other techniques.|Virtual screening is used in drug discovery to search libraries of small molecules in order to identify those molecules which are most likely to bind to a drug target (typically a protein receptor or enzyme).|Methods aim to predict the position and orientation of a ligand bound to a protein receptor or enzyme.

Synonyms: Ligand-binding simulation, Protein-peptide docking

operation / edam

Genome analysis

Detects chimeric sequences (chimeras) from a sequence alignment.

operation / edam

Genome annotation

Annotate a genome sequence with terms from a controlled vocabulary.

Synonyms: Functional genome annotation, Structural genome annotation, Metagenome annotation

operation / edam

Variant filtering

Variant filtering is used to eliminate false positive variants based for example on base calling quality, strand and position information, and mapping info.

operation / edam

Transcriptome assembly

Infer a transcriptome sequence by analysis of short sequence reads.

operation / edam

Sequence assembly

Combine (align and merge) overlapping fragments of a DNA sequence to reconstruct the original sequence.|For example, assemble overlapping reads from paired-end sequencers into contigs (a contiguous sequence corresponding to read overlaps). Or assemble contigs, for example ESTs and genomic DNA fragments, depending on the detected fragment overlaps.

Synonyms: Metagenomic assembly, Sequence assembly editing

operation / edam

Methylation analysis

Analyse cytosine methylation states in nucleic acid sequences.

Synonyms: Methylation profile analysis

operation / edam

Modelling and simulation

Model or simulate some biological entity or system, typically using mathematical techniques including dynamical systems, statistical models, differential equations, and game theoretic models.

Synonyms: Mathematical modelling

operation / edam

Haplotype mapping

Infer haplotypes, either alleles at multiple loci that are transmitted together on the same chromosome, or a set of single nucleotide polymorphisms (SNPs) on a single chromatid that are statistically associated.|Haplotype inference can help in population genetic studies and the identification of complex disease genes, , and is typically based on aligned single nucleotide polymorphism (SNP) fragments. Haplotype comparison is a useful way to characterize the genetic variation between individuals. An individual's haplotype describes which nucleotide base occurs at each position for a set of common SNPs. Tools might use combinatorial functions (for example parsimony) or a likelihood function or model with optimisation such as minimum error correction (MEC) model, expectation-maximisation algorithm (EM), genetic algorithm or Markov chain Monte Carlo (MCMC).

Synonyms: Haplotype inference, Haplotype map generation, Haplotype reconstruction

operation / edam

Gene expression profiling

The measurement of the activity (expression) of multiple genes in a cell, tissue, sample etc., in order to get an impression of biological function.|Gene expression profiling generates some sort of gene expression profile, for example from microarray data.

Synonyms: Gene expression profile construction, Gene expression profile generation, Gene expression quantification, Functional profiling, Gene transcription profiling, Feature expression analysis, RNA profiling, mRNA profiling, Protein profiling, Non-coding RNA profiling

operation / edam

Simulation analysis

Analyse flexibility and motion in protein structure.|Use this concept for analysis of flexible and rigid residues, local chain deformability, regions undergoing conformational change, molecular vibrations or fluctuational dynamics, domain motions or other large-scale structural transitions in a protein structure.

Synonyms: Trajectory analysis, CG analysis, MD analysis, Protein Dynamics Analysis, Protein motion prediction, Nucleic Acid Dynamics Analysis, Protein flexibility prediction, Protein flexibility and motion analysis

operation / edam

Ligand-binding site prediction

Predict or detect ligand-binding sites in proteins; a region of a protein which reversibly binds a ligand for some biochemical purpose, such as transport or regulation of protein function.

Synonyms: Ligand-binding site detection, Peptide-protein binding prediction

operation / edam

Protein-protein interaction analysis

Analyse the interactions of proteins with other proteins.

Synonyms: Protein interaction analysis, Protein interaction raw data analysis, Protein interaction simulation

operation / edam

Taxonomic classification

Classifiication (typically of molecular sequences) by assignment to some taxonomic hierarchy.

Synonyms: Taxonomy assignment, Taxonomic profiling

operation / edam

Multiple sequence alignment

Align more than two molecular sequences.|This includes methods that use an existing alignment, for example to incorporate sequences into an alignment, or combine several multiple alignments into a single, improved alignment.

Synonyms: Multiple alignment

operation / edam

Gene prediction

Detect, predict and identify genes or components of genes in DNA sequences, including promoters, coding regions, splice sites, etc.|Methods for gene prediction might be ab initio, based on phylogenetic comparisons, use motifs, sequence features, support vector machine, alignment etc.

Synonyms: Gene finding, Gene calling, Whole gene prediction

operation / edam

Genetic variation analysis

Analyse a genetic variation, for example to annotate its location, alleles, classification, and effects on individual transcripts predicted for a gene model.|Genetic variation annotation provides contextual interpretation of coding SNP consequences in transcripts. It allows comparisons to be made between variation data in different populations or strains for the same transcript.

Synonyms: Genetic variation annotation, Variant analysis, Sequence variation analysis, Transcript variant analysis

operation / edam

Phylogenetic analysis

Analyse an existing phylogenetic tree or trees, typically to detect features or make predictions.|Phylgenetic modelling is the modelling of trait evolution and prediction of trait values using phylogeny as a basis.

Synonyms: Phylogenetic tree analysis, Phylogenetic modelling

operation / edam

Indel detection

Identify insertion, deletion and duplication events from a sequence alignment.|Tools might use a genetic algorithm, quartet-mapping, bootscanning, graphical methods, random forest model and so on.

Synonyms: Sequence alignment analysis (indel detection), Indel discovery

operation / edam

Read pre-processing

Pre-process sequence reads to ensure (or improve) quality and reliability.|For example process paired end reads to trim low quality ends remove short sequences, identify sequence inserts, detect chimeric reads, or remove low quality sequnces including vector, adaptor, low complexity and contaminant sequences. Sequences might come from genomic DNA library, EST libraries, SSH library and so on.

Synonyms: Sequence read pre-processing

operation / edam

Gene functional annotation

Annotate one or more sequences with functional information, such as cellular processes or metaobolic pathways, by reference to a controlled vocabulary - invariably the Gene Ontology (GO).

Synonyms: Sequence functional annotation

operation / edam

Statistical calculation

Perform a statistical data operation of some type, e.g. calibration or validation.

Synonyms: Significance testing, Statistical test, Statistical testing, Statistical analysis, Hypothesis testing, Gibbs sampling, Expectation maximisation, Omnibus test

operation / edam

Genetic mapping

Generate a genetic (linkage) map of a DNA sequence (typically a chromosome) showing the relative positions of genetic markers based on estimation of non-physical distances.|Mapping involves ordering genetic loci along a chromosome and estimating the physical distance between loci. A genetic map shows the relative (not physical) position of known genes and genetic markers.|This includes mapping of the genetic architecture of dynamic complex traits (functional mapping), e.g. by characterisation of the underlying quantitative trait loci (QTLs) or nucleotides (QTNs).

Synonyms: Genetic map construction, Genetic cartography, Functional mapping, Linkage mapping, Genetic map generation, QTL mapping

operation / edam

Regression analysis

A statistical calculation to estimate the relationships among variables.

Synonyms: Regression

operation / edam

Active site prediction

Predict or detect active sites in proteins; the region of an enzyme which binds a substrate bind and catalyses a reaction.

Synonyms: Active site detection