Annotated file with gene ID (instead of gene symbol)
1
0
Entering edit mode
Chise ▴ 10
@9cb59de3
Last seen 2.4 years ago
United States

Hello, I am using "featureCounts" in Rsubread package for analyzing bulk RNA-seq of drosophila. Since there is no inbuilt annotations of drosophila, I am using a gtf file in the homepage of iGenomes. (Drosophila_melanogaster/UCSC/dm6/Annotation/Genes/genes.gtf.)

I made bam files using Rsubread referring to dm6, then I tried to use the following codes.

library(Rsubread)
library(limma)
library(edgeR)
targets <- readTargets("wholeflyseq.txt")
fc <- featureCounts(files=targets$OutputFile,annot.ext="genes.gtf",isGTFAnnotationFile=TRUE, isPairedEnd=TRUE)

I was able to obtain a file with gene symbol but is there any way to get a file with gene ID of NCBI? When I tried to convert the information of gene symbol to gene ID after I get the file, around 20% of the information was lost.

I would appreciate it if someone could inform me of how to get a file with gene ID instead of gene symbol.

Rsubread featureCounts • 1.9k views
ADD COMMENT
0
Entering edit mode

Have you looked at the NCBI RefSeq database?

ADD REPLY
0
Entering edit mode

Do you mean the following homepage? https://www.ncbi.nlm.nih.gov/nuccore/?term=dm6+AND+srcdb_refseq%5BPROP%5D

I am not an expert of computational analysis, so I do not think I was able to make good use of it...

ADD REPLY
1
Entering edit mode
@gordon-smyth
Last seen 38 minutes ago
WEHI, Melbourne, Australia

See Aliases of drosophila gene names.

ADD COMMENT
0
Entering edit mode

Thank you so much for the detailed answer, the problem was completely solved! The "alias2SymbolUsingNCBI" was amazing indeed.

ADD REPLY

Login before adding your answer.

Traffic: 435 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6