Xenbase Simple Synteny Search Information

These synteny images are based on information from several sources.

GFFs provided by model organism databases or other resources:

Xenopus tropicalis from Xenbase.org: XENTR_10.0_Xenbase.gff3.gz

Xenopus laevis from Xenbase.org: XENLA_10.1_Xenbase.gff3.gz

Human (Homo sapiens) from NCBI: GCF_000001405.40_GRCh38.p14_genomic.gff.gz

Mouse (Mus musculus) from NCBI: GCF_000001635.27_GRCm39_genomic.gff.gz

Chick (Gallus gallus) from NCBI: GCF_016699485.2_bGalGal1.mat.broiler.GRCg7b_genomic.gff.gz

Methodology:

The synteny connections are based on a combination of explicit orthology assertions from Xenbase.org, drawn from expert curation and NCBI's Ortholog and Homologene data sets, and symbol-based matching using the gene symbols from the GFF files. A window of genes 5 upstream and 5 downstream from the gene of interest are extracted from the source GFFs for each species and stored in the CMAP format. An optimization step is then used to maximize the longest common subsequence (LCS) between all of the genes in the window for each species.

Sequence 1: A B C D E F G Sequence 2: A X B Y C Z F LCS: A B C F (4)

Optimization is performed by iteratively inverting the gene orders for each species and recalculating the overall LCS value across species. The optimized set is the orientation of all genomes giving the highest overall LCS. These optimized orientations are then used for rendering the images.

Caveats:

Not every gene in the Xenopus genomes is represented here, only protein coding genes with meaningful gene symbols, i.e. not LOC######, and identified orthologs. LOC genes will show up in simple synteny figures but cannot themselves be searched for. If you cannot find a specific gene of interest we suggest using other genes from the equivalent locus in another species to explore the region, if possible.

Script Availability:

The scripts for running the Xenbase Simple Synteny image generation pipeline are available on Gitlab (https://gitlab.com/Xenbase/bioinformatics/xenbasesimplesyntenyscripts). The images were produced using a modified version of the SimpleSynteny V1.4.0 'SyntenyDrawer.rb' command line tool.

The original SimpleSynteny tool is described in Veltri, D., Malapi-Wight, M. and Crouch J.A. SimpleSynteny: a web-based tool for visualization of microsynteny across multiple species. Nucleic Acids Res. 44(W1):W41-W45, 2016. doi:10.1093/nar/gkw330