Hello, I am using a gtf file in the homepage of iGenomes for bulk RNA-seq of the whole brain of drosophila (Drosophila_melanogaster/UCSC/dm6/Annotation/Genes/genes.gtf).
I did the annotation using Rsubread and got a file with gene symbol. However, there are some genes of one spelling but the first letter is either uppercase or lowercase. They are with different gene ID (e.g. Crc and crc).
However, when I search NCBI's homepage for "Crc", I am not sure whether it means "cryptocephal" or "Calreticulin". I found a description that the gene name of drosophila starts with lowercase if named for recessive mutant and uppercase if named for dominant mutant. But it was hard to tell when I searched the homepage...
Is there any good way to detect the correct official full name? Or is there any way to get an annotated file with both gene symbol and gene ID? I would appreciate it if someone could let me informed.
Dear Dr. Gordon Smyth,
Thank you so much for the detailed answer, the problem was completely solved.
The "alias2SymbolUsingNCBI" was amazing indeed. Also, with edgeR, limma, and Rsubread, I was able to complete my PhD work. I deeply appreciate your creating such great packages.
Sincerely,
Chise