I thought that all of the information about each chipset (i.e. platform) was in the corresponding R package.
For example, hgu219.db is the annotation package for the hgu219 platform.
However, my key lookup results differ depending on the package version of AnnotationDbi, even when the hgu219.db package versions are the same.
So for example,
keys(hgu219.db, keytype = 'UNIPROT')
gives a different list of UNIPROTs depending on the AnnotationDbi version.
I thought all of the info was in the hgu219.db package. My thinking must be incorrect?
Can someone explain why this is happening? I'm not sure if I should be filing a bug report.
I think this is related to
org.Hs.eg.db
.There are missing UNIPROT identifiers that used to exists. Example with org.Hs.eg.db:
v3.11.4:
v3.8.2:
There is an NA when it should link to P84550 (which it did in previous versions). Which doesn't seem to have any clear reason for removal: https://www.uniprot.org/uniprot/P84550
To expand on this comment: IMO this has to do with the fact that somehow the current annotation info provided by the NCBI does not include anymore the link to the UniProt ID (but previously it did).
Please realize that NCBI and UniProt are 2 independent groups/consortia that provide annotation info, and that the
org.Hs.eg.db
(or any otherOrgDb
) is simply a repackaged, R-compatible 1 to 1 copy of the info provided by the NCBI.James MacDonald expanded on this multiple times, for example in this post (although it is dealing with GO annotations I assume you will get the point).