Hi,
Your sessionInfo() looks good. I see you're using the current release (Bioconductor 3.4) and have the current version of AnnotationHub loaded. Bioconductor 3.4 was released in October 2016 that is why when you load AnnotationHub the default snapshot date is 2016-10-11:
> hub = AnnotationHub()
snapshotDate(): 2016-10-11
All snapshot dates for a particular release can be seen by calling possibleDates() on the AnnotationHub object. Note that the 2016-10-11 date is the max:
> possibleDates(hub)
[1] "2013-03-19" "2013-03-21" "2013-03-26" "2013-04-04" "2013-04-29"
[6] "2013-06-24" "2013-06-25" "2013-06-26" "2013-06-27" "2013-10-29"
...
[46] "2015-08-17" "2015-08-26" "2015-12-28" "2015-12-29" "2016-01-25"
[51] "2016-03-07" "2016-05-03" "2016-05-25" "2016-06-06" "2016-07-20"
[56] "2016-08-15" "2016-10-11"
I can also reproduce your results:
> length(query(hub,"Pseudomonas"))
[1] 1
So all of that looks good. As for your coworker on windows, I still can't see the output of sessionInfo(). The commands you show above are using a snapshot date of 2017-04-10 which would indicate they are using R/Bioconductor devel. If I load AnnotationHub in devel I see
> hub = AnnotationHub()
snapshotDate(): 2017-04-10
As a side note, you'll see the number of possibleDates() for devel are different:
> possibleDates(hub)
[1] "2013-03-19" "2013-03-21" "2013-03-26" "2013-04-04" "2013-04-29"
[6] "2013-06-24" "2013-06-25" "2013-06-26" "2013-06-27" "2013-10-29"
...
[56] "2016-08-15" "2016-10-11" "2016-11-03" "2016-11-08" "2016-11-09"
[61] "2016-11-13" "2016-11-14" "2016-12-22" "2016-12-28" "2017-01-05"
[66] "2017-02-07" "2017-04-03" "2017-04-04" "2017-04-05" "2017-04-10"
[71] "2017-04-10"
When I query the devel hub for 'Pseudomonas' I still get the one record:
> query(hub,"Pseudomonas")
AnnotationHub with 1 record
# snapshotDate(): 2017-04-10
# names(): AH10565
# $dataprovider: Inparanoid8
# $species: Pseudomonas aeruginosa
# $rdataclass: Inparanoid8Db
# $title: hom.Pseudomonas_aeruginosa.inp8.sqlite
# $description: Inparanoid 8 annotations about Pseudomonas aeruginosa
# $taxonomyid: 208964
# $genome: inparanoid8 genomes
# $sourcetype: Inparanoid
# $sourceurl: http://inparanoid.sbc.su.se/download/current/Orthologs/P.aerug...
# $sourcelastmodifieddate: NA
# $sourcesize: NA
# $tags: c("Inparanoid", "Gene", "Homology", "Annotation")
# retrieve record with 'object[["AH10565"]]'
Unless we see the sessionInfo() from your coworker and get a little more information we can't help. We need to be able to reproduce the error, or unexpected results, to get to the bottom of this.
Valerie
Hi,
Sorry for the delay. There is a limit on how frequently I can post here. Here is her sessionInfo()
R version 3.3.1 (2016-06-21)
Platform: x86_64-w64-mingw32/x64 (64-bit)
Running under: Windows 7 x64 (build 7601) Service Pack 1
locale:
[1] LC_COLLATE=French_France.1252 LC_CTYPE=French_France.1252 LC_MONETARY=French_France.1252
[4] LC_NUMERIC=C LC_TIME=French_France.1252
attached base packages:
[1] parallel stats graphics grDevices utils datasets methods base
other attached packages:
[1] AnnotationHub_2.4.2 BiocGenerics_0.18.0
loaded via a namespace (and not attached):
[1] Rcpp_0.12.10 IRanges_2.6.1 digest_0.6.12
[4] mime_0.5 R6_2.2.0 xtable_1.8-2
[7] DBI_0.6-1 stats4_3.3.1 RSQLite_1.1-2
[10] BiocInstaller_1.22.3 httr_1.2.1 curl_2.4
[13] S4Vectors_0.10.3 tools_3.3.1 Biobase_2.32.0
[16] shiny_1.0.1 httpuv_1.3.3 AnnotationDbi_1.34.4
[19] memoise_1.0.0 htmltools_0.3.5 interactiveDisplayBase_1.10.3
The problem is with the version of AnnotationHub - it's 2 (soon to be 3 ) releases old. The current release is version 2.6.5 seen here:
http://www.bioconductor.org/checkResults/release/bioc-LATEST/
The old 2.4.2 version does not have the correct logic implemented for the OrgDb objects. The data in an OrgDb is a snapshot of the most current information known at the time (vs tied to a genome build such as the TxDb packages). It quickly becomes out of date as updates are submitted to the source - be it UCSC, NCBI etc. Because of this we want all OrgDbs to be refreshed every 6 months before a release. The OrgDbs your coworker has listed are from 2014 and 2015:
> library(DBI)
> con = dbconn(hub)
> table(dbGetQuery(con, "select rdatadateadded from resources where title like '%Pseudomonas%'"))
2014-03-31 2014-07-02 2014-07-09 2015-07-27
1 1 25 9
It would make sense to use these annotations if you were trying to reproduce an analysis from 2014 but not if you are doing exploratory work / analysis today. I would reccomend the coworker update R/Bioconductor and use the most current packages.
As for the Pseudomonas record in the current AnnotationHub, we can see that's not an OrgDb but the Inparanoid8 database package:
> mcols(query(hub,"Pseudomonas"))[c("title", "rdatadateadded", "rdataclass")]
DataFrame with 1 row and 3 columns
title rdatadateadded rdataclass
<character> <character> <character>
AH10565 hom.Pseudomonas_aeruginosa.inp8.sqlite 2014-03-31 Inparanoid8Db
The Inparanoid db has been static for some time now with the last update in 2013 (http://inparanoid.sbc.su.se/cgi-bin/index.cgi). We propagate but do not rebuild the package with each release.
Valerie
PS: FYI there are a few more Inparanoid resources in the main Bioconductor repository if you're interested but these are also from 2014:
http://www.bioconductor.org/packages/release/BiocViews.html#___InparanoidDb