Entering edit mode
Wendy Qiao
▴
360
@wendy-qiao-4501
Last seen 10.2 years ago
Hi all,
I am converting the HGNC symbols from an Illumina human array to
Entrez ID
using biomaRt. I found that there are some gene symbols are matched to
many
Entrez IDs, and vice versa. I am wondering if how to solve the
problem, so
one gene symbol is only matched to one Entrez ID. Or is there any
other
package that I can use for matching gene symbols to Entrez IDs. Thank
you in
advance.
Wendy
=====
In the following example, BAGE2, 3, 4 and 5 are matched to 85316 and
85317
which are the Entrez IDs of BAGE5 and BAGE4, respectively.
library('biomaRt')
ensembl=useMart("ensembl_mart_51",dataset="hsapiens_gene_ensembl",arch
ive=TRUE)
Entrez<-getBM(attributes=c("hgnc_symbol","entrezgene"),filters="hgnc_s
ymbol",values=GeneList,mart=ensembl)
# class(GeneList) = factor
Entrez[1:20,]
hgnc_symbol entrezgene
1 ZFP62 92379
2 C9orf169 375791
3 FAM72D 653573
4 HMX1 NA
5 HMX1 3166
6 ZFP62 NA
7 RSPO4 343637
8 DOC2B 8447
9 C8orf42 157695
10 TTTY8 NA
11 A26C3 NA
12 BAGE5 85316
13 BAGE4 85316
14 BAGE3 85316
15 BAGE2 85316
16 BAGE5 85317
17 BAGE4 85317
18 BAGE3 85317
19 BAGE2 85317
20 NBR1 4077
[[alternative HTML version deleted]]