What is this -RA suffix after gene names?

Answer: 

This notation represents transcripts. All transcripts are named after the gene, with addition of the suffix -Rx, 'x' being a letter. Translation will have a "-Px" suffix. This is based on the FlyBase notation.




E.g., given the AGP000123 gene in Anopheles,

  • the 1st transcript is "AGAP000123-RA" and the cognate translation is "AGAP000123-PA",
  • the 2nd transcript is "AGAP000123-RB" and the cognate translation is "AGAP000123-PB",
  • the 3rd transcript is "AGAP000123-RC" and the cognate translation is "AGAP000123-PC",
  • etc.





It makes is easy to identify the gene given its transcript identifier as you only need to remove the suffix to get the gene stable identifier.


If a transcript is removed (because it was proved it doesn't exist), its ID will be remove but will NOT be reassigned. Thus the succession of letters might be interrupted.