Aquatic Symbiosis Genomics (ASG) Project

Aquatic Symbiosis Genomics (ASG) Project Logo

Aquatic Symbiosis Genomics Project is a global collaboration to generate high quality genome sequences for a wide range of eukaryotes and their microbial symbionts. Launched under the Symbiosis in Aquatic Systems Initiative of the Gordon and Betty Moore Foundation, the ASG Project brings together researchers from across the globe who hope to use these reference genomes to augment and extend their analyses of the dynamics, mechanisms and environmental importance of symbiosis. Applying large-scale, high-throughput sequencing and assembly technologies, the ASG collaboration aims tol assemble and annotate the genomes of 500 symbiotic organisms – both the “hosts” and the microbial symbionts with which they associate. These data will be released openly to benefit all who work on symbiosis, from conservation geneticists to those interested in the origin of the eukaryotic cell.

At Ensembl, we annotate the protein-coding and non-coding RNA gene structures using re-engineered versions of our Gene Annotation System (Aken et al, 2017) optimised for vertebrates and for non-vertebrates. When a species lacks transcriptomic data, we run BRAKER2 to generate hint-guided ab initio gene predictions of protein-coding genes, in the default protein mode (see the blog post for more information). After QC, genomes and annotations are made available via our FTP site (see table below) before subsequently being made available in the Ensembl Genome Browser.

Image Species Accession Annotation method Annotation Proteins Transcripts Softmasked genome Repeat library Other data View in browser BUSCO completeness Alternate haplotype
Agelas oroides GCA_949130485.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA Repeatmodeler FTP dumps rapid.ensembl.org
Amphiduros pacificus GCA_949316495.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA Repeatmodeler FTP dumps rapid.ensembl.org
Aplysina aerophoba GCA_949841015.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA FTP dumps rapid.ensembl.org
Bathymodiolus brooksi GCA_963680875.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA FTP dumps rapid.ensembl.org
Bathymodiolus septemdierum GCA_963383655.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA FTP dumps rapid.ensembl.org
Blastomussa wellsi GCA_947652115.1 BRAKER2 GTF, GFF3 FASTA FASTA FASTA Repeatmodeler FTP dumps rapid.ensembl.org
Chondrosia reniformis GCA_947172415.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA Repeatmodeler FTP dumps rapid.ensembl.org
Corticium candelabrum GCA_963422355.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA FTP dumps rapid.ensembl.org
Eunapius fragilis GCA_963681505.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA FTP dumps rapid.ensembl.org
Fragum fragum GCA_946902895.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA Repeatmodeler FTP dumps rapid.ensembl.org BUSCO
Fragum sueziense GCA_963680895.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA FTP dumps rapid.ensembl.org
Galaxea fascicularis GCA_948470475.1 BRAKER2 GTF, GFF3 FASTA FASTA FASTA Repeatmodeler FTP dumps rapid.ensembl.org
Halichondria panicea GCA_963675165.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA FTP dumps rapid.ensembl.org
Halisarca caerulea GCA_963170055.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA FTP dumps rapid.ensembl.org
Lamellibrachia columna GCA_963662155.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA FTP dumps rapid.ensembl.org
Magelona johnstoni GCA_963942565.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA FTP dumps rapid.ensembl.org
Montipora capitata GCA_949126865.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA Repeatmodeler FTP dumps rapid.ensembl.org rapid.ensembl.org
Montipora capitata (alternate haplotype) GCA_949126885.1 BRAKER2 GTF, GFF3 FASTA FASTA FASTA FTP dumps rapid.ensembl.org
Oscarella lobularis GCA_947507565.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA Repeatmodeler FTP dumps rapid.ensembl.org
Porites lutea GCA_958299795.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA FTP dumps rapid.ensembl.org rapid.ensembl.org
Porites lutea (alternate haplotype) GCA_958299805.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA FTP dumps rapid.ensembl.org
Protula sp. h YS-2021 GCA_949752745.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA FTP dumps rapid.ensembl.org
Ricordea florida (alternate haplotype) GCA_949710055.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA FTP dumps rapid.ensembl.org
Sepiola atlantica GCA_963556195.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA FTP dumps rapid.ensembl.org
Sepioteuthis lessoniana GCA_963585895.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA FTP dumps rapid.ensembl.org
Spongilla lacustris GCA_949361645.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA Repeatmodeler FTP dumps rapid.ensembl.org
Tridacna crocea GCA_943736015.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA Repeatmodeler FTP dumps rapid.ensembl.org BUSCO
Tridacna derasa GCA_963210305.1 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA FTP dumps rapid.ensembl.org
Tridacna gigas GCA_945859785.2 Ensembl Genebuild GTF, GFF3 FASTA FASTA FASTA Repeatmodeler FTP dumps rapid.ensembl.org BUSCO
Xestospongia muta GCA_963693285.1 BRAKER2 GTF, GFF3 FASTA FASTA FASTA FTP dumps rapid.ensembl.org