Data files

VectorBase is committed to a new release every two months. A list of changes and the latest data (e.g., current gene sets) relative to a specific release can be found in our release notes. Current and archived data are available for download below (an FAQ provides a summary of the file types).

Displaying 1 - 22 of 22
Double-click species to show assemblies
Operations
File Organism Size Type Description
Anopheles-melas-CM1001059_A_CONTIGS_AmelC2.fa.gz Anopheles melas 61.63 MB Contigs CM1001059_A strain genomic contigs sequences, AmelC2 assembly.
Anopheles-melas-CM1001059_A_SCAFFOLDS_AmelC2.fa.gz Anopheles melas 62.28 MB Scaffolds CM1001059_A strain genomic scaffolds sequences, AmelC2 assembly, softmasked using RepeatMasker, Dust, and TRF.
Anopheles-melas-CM1001059_A_PEPTIDES_AmelC2.6.fa.gz Anopheles melas 4.55 MB Peptides CM1001059_A strain peptide sequences, AmelC2.6 geneset.
Anopheles-melas-CM1001059_A_TRANSCRIPTS_AmelC2.6.fa.gz Anopheles melas 6.82 MB Transcripts CM1001059_A strain transcript sequences, AmelC2.6 geneset.
Anopheles-melas-CM1001059_A_BASEFEATURES_AmelC2.6.gff3.gz Anopheles melas 2.44 MB Basefeatures CM1001059_A strain AmelC2.6 geneset in GFF3 format.
Anopheles-melas-CM1001059_A_BASEFEATURES_AmelC2.6.gtf.gz Anopheles melas 1.67 MB Basefeatures CM1001059_A strain AmelC2.6 geneset in GTF (v2.2) format.
Anopheles-melas-CM1001059_A_CONTIG2SCAFFOLD_AmelC2.agp.gz Anopheles melas 439.33 KB Contig to Scaffold mapping AGP (v2.0) file relating contigs to scaffolds for the Anopheles melas CM1001059_A strain, AmelC2.1 assembly.
Anopheles-melas-CM1001059_A_MAPPINGS_AmelC2.2-AmelC2.3.txt Anopheles melas 8.6 KB ID Mapping Stable ID mapping between genesets AmelC2.2 and AmelC2.3 (RNA gene update)
Anopheles-melas-CM1001059_A_REPEATS.lib Anopheles melas 10.22 KB RepeatMasker library RepeatMasker library file of repeats for Anopheles melas, CM1001059_A strain.
Anopheles-melas-CM1001059_A_REPEATFEATURES_AmelC2.gff3.gz Anopheles melas 5.02 MB Repeat features CM1001059_A strain AmelC2 repeat features (RepeatMasker, Dust, TRF) in GFF3 format.
anopheles_melas_unresolved_transcripts_2015_06.txt Anopheles melas 238 bytes Transcript projection The stable IDs of unprojected transcripts which fulfill the following criteria, and thus represented credible genes on the old assembly: * protein_features > 0 * orthologs...
anopheles_melas_projected_cdna_2015_06.fa Anopheles melas 20.99 MB Transcript projection cDNA sequence (from the old assembly) of projected transcripts.
anopheles_melas_projected_cds_2015_06.fa Anopheles melas 20.9 MB Transcript projection Coding sequence (from the old assembly) of projected transcripts.
anopheles_melas_projected_2015_06.gff3 Anopheles melas 9.95 MB Transcript projection Projected transcripts (coordinates on the new assembly) in GFF3 format.
anopheles_melas_projected_pep_2015_06.fa Anopheles melas 7.21 MB Transcript projection Peptide sequence (from the old assembly) of projected transcripts.
anopheles_melas_README_2015_06.txt Anopheles melas 7.38 KB Transcript projection In VectorBase release 1506, Anopheles melas genes were projected from assembly version 1 (GCA_000473525.1) to assembly version 2 (GCA_000473525.2). This README file describes the...
anopheles_melas_report_2015_06.txt Anopheles melas 2.87 MB Transcript projection Description of the fate of every transcript in the old assembly, along with statistics to judge the quality and quantity of evidence. Columns: * transcript: stable ID * status:...
anopheles_melas_summary_2015_06.txt Anopheles melas 472 bytes Transcript projection A summary of the number of transcripts that could and could not be projected.
anopheles_melas_unprojected_cdna_2015_06.fa Anopheles melas 1.75 MB Transcript projection cDNA sequence (from the old assembly) of unprojected transcripts.
anopheles_melas_unprojected_cds_2015_06.fa Anopheles melas 1.75 MB Transcript projection Coding sequence (from the old assembly) of unprojected transcripts.
anopheles_melas_unprojected_2015_06.gff3 Anopheles melas 144.51 KB Transcript projection Unrojected (but partially mapped) transcripts (coordinates on the new assembly) in GFF3 format.
anopheles_melas_unprojected_pep_2015_06.fa Anopheles melas 613.12 KB Transcript projection Peptide sequence (from the old assembly) of unprojected transcripts.