Data files

VectorBase is committed to a new release every two months. A list of changes and the latest data (e.g., current gene sets) relative to a specific release can be found in our release notes. Current and archived data are available for download below (an FAQ provides a summary of the file types).

Displaying 1 - 33 of 33
Double-click species to show assemblies
Operations
File Organism Size Type Description
Anopheles-sinensis-China_CONTIGS_AsinC2.fa.gz Anopheles sinensis 64.74 MB Contigs China strain genomic contig sequences, AsinC2 assembly.
Anopheles-sinensis-SINENSIS_CONTIGS_AsinS2.fa.gz Anopheles sinensis 57.48 MB Contigs SINENSIS strain genomic contig sequences, AsinS2 assembly.
Anopheles-sinensis-China_SCAFFOLDS_AsinC2.fa.gz Anopheles sinensis 64.22 MB Scaffolds China strain genomic scaffolds sequences, AsinC2 assembly, softmasked using RepeatMasker, Dust, and TRF.
Anopheles-sinensis-SINENSIS_SCAFFOLDS_AsinS2.fa.gz Anopheles sinensis 60.07 MB Scaffolds SINENSIS strain genomic scaffold sequences, AsinS2 assembly, softmasked using RepeatMasker, Dust, and TRF.
Anopheles-sinensis-SINENSIS_PEPTIDES_AsinS2.5.fa.gz Anopheles sinensis 3.85 MB Peptides SINENSIS strain peptide sequences, AsinS2.5 geneset.
Anopheles-sinensis-China_PEPTIDES_AsinC2.2.fa.gz Anopheles sinensis 4.67 MB Peptides China strain peptide sequences, AsinC2.2 geneset.
Anopheles-sinensis-SINENSIS_TRANSCRIPTS_AsinS2.5.fa.gz Anopheles sinensis 6.96 MB Transcripts SINENSIS strain transcript sequences, AsinS2.5 geneset.
Anopheles-sinensis-China_TRANSCRIPTS_AsinC2.2.fa.gz Anopheles sinensis 7.02 MB Transcripts China strain transcript sequences, AsinC2.2 geneset.
Anopheles-sinensis-SINENSIS_BASEFEATURES_AsinS2.5.gff3.gz Anopheles sinensis 2.03 MB Basefeatures SINENSIS strain AsinS2.5 geneset in GFF3 format.
Anopheles-sinensis-SINENSIS_BASEFEATURES_AsinS2.5.gtf.gz Anopheles sinensis 1.53 MB Basefeatures SINENSIS strain AsinS2.5 geneset in GTF (v2.2) format.
Anopheles-sinensis-China_BASEFEATURES_AsinC2.2.gff3.gz Anopheles sinensis 2.19 MB Basefeatures China strain AsinC2.2 geneset in GFF3 format.
Anopheles-sinensis-China_BASEFEATURES_AsinC2.2.gtf.gz Anopheles sinensis 1.79 MB Basefeatures China strain AsinC2.2 geneset in GTF (v2.2) format.
Anopheles-sinensis-SINENSIS_CONTIG2SCAFFOLD_AsinS2.agp.gz Anopheles sinensis 615.48 KB Contig to Scaffold mapping AGP (v2.0) file relating contigs to scaffolds for the Anopheles sinensis SINENSIS strain, AsinS2.1 assembly.
Anopheles-sinensis-China_CONTIG2SCAFFOLD_AsinC2.agp.gz Anopheles sinensis 548.56 KB Contig to Scaffold mapping AGP (v2.0) file relating contigs to scaffolds for the Anopheles sinensis China strain, AsinC2.1 assembly.
Anopheles-sinensis-China_MAPPINGS_AsinC2.1-AsinC2.2.txt Anopheles sinensis 15.77 KB ID Mapping Stable ID mapping between genesets AsinC2.1 and AsinC2.2 (RNA gene update)
Anopheles-sinensis-SINENSIS_MAPPINGS_AsinS2.1-AsinS2.2.txt Anopheles sinensis 11.8 KB ID Mapping Stable ID mapping between genesets AsinS2.1 and AsinS2.2 (RNA gene update)
Anopheles-sinensis-SINENSIS_REPEATS.lib Anopheles sinensis 20.64 KB RepeatMasker library RepeatMasker library file of repeats for Anopheles sinensis, SINENSIS strain.
Anopheles-sinensis-China_REPEATS.lib Anopheles sinensis 424.99 KB RepeatMasker library RepeatMasker library file of repeats for Anopheles sinensis, China strain.
Anopheles-sinensis-China_REPEATFEATURES_AsinC2.gff3.gz Anopheles sinensis 3.16 MB Repeat features China strain AsinC2 repeat features (RepeatMasker, Dust, TRF) in GFF3 format.
Anopheles-sinensis-SINENSIS_REPEATFEATURES_AsinS2.gff3.gz Anopheles sinensis 3.01 MB Repeat features SINENSIS strain AsinS2 repeat features (RepeatMasker, Dust, TRF) in GFF3 format.
anopheles_sinensis_unresolved_transcripts_2015_06.txt Anopheles sinensis 5.5 KB Transcript projection The stable IDs of unprojected transcripts which fulfill the following criteria, and thus represented credible genes on the old assembly: * protein_features > 0 * orthologs...
anopheles_sinensis_projected_cdna_2015_06.fa Anopheles sinensis 21.49 MB Transcript projection cDNA sequence (from the old assembly) of projected transcripts.
anopheles_sinensis_projected_cds_2015_06.fa Anopheles sinensis 17.53 MB Transcript projection Coding sequence (from the old assembly) of projected transcripts.
anopheles_sinensis_projected_2015_06.gff3 Anopheles sinensis 8.61 MB Transcript projection Projected transcripts (coordinates on the new assembly) in GFF3 format.
anopheles_sinensis_projected_pep_2015_06.fa Anopheles sinensis 6.05 MB Transcript projection Peptide sequence (from the old assembly) of projected transcripts.
anopheles_sinensis_README_2015_06.txt Anopheles sinensis 7.38 KB Transcript projection In VectorBase release 1506, Anopheles sinensis genes were projected from assembly version 1 (GCA_000472065.1) to assembly version 2 (GCA_000472065.2). This README file describes...
anopheles_sinensis_report_2015_06.txt Anopheles sinensis 2.75 MB Transcript projection Description of the fate of every transcript in the old assembly, along with statistics to judge the quality and quantity of evidence. Columns: * transcript: stable ID * status:...
anopheles_sinensis_summary_2015_06.txt Anopheles sinensis 478 bytes Transcript projection A summary of the number of transcripts that could and could not be projected.
anopheles_sinensis_unprojected_cdna_2015_06.fa Anopheles sinensis 3.59 MB Transcript projection cDNA sequence (from the old assembly) of unprojected transcripts.
anopheles_sinensis_unprojected_cds_2015_06.fa Anopheles sinensis 3.19 MB Transcript projection Coding sequence (from the old assembly) of unprojected transcripts.
anopheles_sinensis_unprojected_2015_06.gff3 Anopheles sinensis 1.69 MB Transcript projection Unrojected (but partially mapped) transcripts (coordinates on the new assembly) in GFF3 format.
anopheles_sinensis_unprojected_pep_2015_06.fa Anopheles sinensis 1.12 MB Transcript projection Peptide sequence (from the old assembly) of unprojected transcripts.
Anopheles-sinensis_china_EXPR-STATS_VB-2018-12.txt.gz Anopheles sinensis 496.74 KB Expression statistics Tab-delimited, log2 transformed expression values, gene x condition: mean, variance and number of replicates. The means are no longer median-subtracted.