Commit 07e28536 authored by Jorge Navarro Muñoz's avatar Jorge Navarro Muñoz
Browse files

Expanded Readme

parent 766696bb
BiG-SCAPE (Biosynthetic Gene Similarity Clustering and Prospecting Engine) is a Python script that calculates a distance matrix between groups of genes. In particular, BiG-SCAPE has been built with the aim of analyzing **Biosynthetic Gene Clusters**, which encode the pathways for Secondary Metabolites.
**BiG-SCAPE** (Biosynthetic Gene Similarity Clustering and Prospecting Engine) is a software package, written in Python, that constructs sequence similarity networks of Biosynthetic Gene Clusters (BGCs) and groups them into Gene Cluster Families (GCFs). BiG-SCAPE does this by rapidly calculating a distance matrix between gene clusters based on a comparison of their protein domain content, order, copy number and sequence identity.
BiG-SCAPE works by predicting and comparing conserved domains found in the proteins encoded in the gene clusters.
As input, BiG-SCAPE takes GenBank files from the output of [antiSMASH]( with BGC predictions, as well as reference BGCs from the [MIBiG repository]( As output, BiG-SCAPE generates tab-delimited output files, as well as a rich HTML visualization that includes multi-locus phylogenies of each Gene Cluster Family made using [CORASON](
This comparison allows to form a distance matrix that can be used to group similar BGCs automatically.
In principle, BiG-SCAPE can also be used on any other gene clusters, such as pathogenicity islands, secretion system-encoding gene clusters, or even whole viral genomes.
Learn more about BiG-SCAPE in the [wiki](
![](BiG-SCAPE CORASON Workflow.png)
![BiG-SCAPE/CORASON workflow](BiG-SCAPE CORASON Workflow.png)
Supports Markdown
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment