RproC1.0

An enhanced gene build for Rhodnius prolixus has been included as part of the VB-2012-04 release. The high number of transcripts has been reduced.

At the present time, the following data are available:

  • repeats (Dust, TRF, known repeated elements publicly available)
  • alignments to 9,665 Rhodnius prolixus Expressed Sequence Tags (ESTs) from dbEST mapped to the genome
  • alignments to 792,023 sequences from 2 Rhodnius prolixus RNA-seq datasets mapped to the genome
  • alignments to 143,631 proteins from UniProt, grouped into 7 taxa, mapped to the genome
  • 549 ncRNAs and 1,148 pseudogenes have been predicted
  • 18,426 Genscan and 65,946 SNAP ab intio gene models
  • 15,878 gene models from the Lagerblad lab
  • 2,589 full length cDNAs from the Lagerblad lab
  • consensus gene set with 16,122 genes and 16,134 transcripts built with the Ensembl gene build pipeline

Be aware that the gene, transcript and peptide identifiers are not stable and there is no plan to maintain them. They have been indexed as part of the site-wide search.

Note that VectorBase provides a BLAST service to compare your sequence against the Rhodnius prolixus supercontigs, transcripts and peptides.

Gene annotation is available in GFF3 and GTF for download. The transcripts, peptides, supercontigs and ESTs are provided in FASTA format.

An enhanced genebuild will follow in a future VectorBase release. We aim to integrate annotation from the community and produce a release quality gene set.

Genes

Genes Protein-coding Other
16,122

Transcripts

Transcripts Protein-coding Other
Release date: 
16 Apr 2012
Assembly: 

RproC1

Rhodnius prolixus was sequenced by the Washington University School of Medicine, Genome Sequencing Center (WUGSC) in September 2010. The whole-genome shotgun plasmid, bac end reads and 454 reads were assembled using the CABOG assembler. DNA was isolated from the colony at the Centers for Disease Control in Atlanta, USA. It is composed of 58,560 contigs assembled into 27,872 scaffolds with a total length of 702 Mb. This includes 141 Mb (20%) of padding characters.