Hello,
I have been trying to use makeOrgPackageFromNCBI for what I believe is a well-annotated organism (Acropora millepora, taxon ID 45264). After being unsuccessful, I tried to run the example from Mark Carlson from this page :
makeOrgPackageFromNCBI(version = "0.1",
+ author = "Some One so@someplace.org",
+ maintainer = "Some One so@someplace.org",
+ outputDir = ".",
+ tax_id = "59729",
+ genus = "Taeniopygia",
+ species = "guttata")
But I get this return with an error at the bottom:
If files are not cached locally this may take awhile to assemble a 12 GB cache databse in the NCBIFilesDir directory. Subsequent calls to this function should be faster (seconds). The cache will try to rebuild once per day.
preparing data from NCBI ...
starting download for
[1] gene2pubmed.gz
[2] gene2accession.gz
[3] gene2refseq.gz
[4] geneinfo.gz
[5] gene2go.gz
getting data for gene2pubmed.gz
Error: no such table: gene2pubmeddate
I tried this same approach using inputs that I found from a published article that successfully did it with another organism:
makeOrgPackageFromNCBI(version = "0.1",
+ author = "Some One so@someplace.org",
+ maintainer = "Some One so@someplace.org",
+ outputDir = ".",
+ tax_id = "12993",
+ genus = "Cassiopea",
+ species = "xamachana")
But again, same error.
Does anyone know why this happening and a potential solution?
Thanks in advance!
Hi, thanks so much for your quick response. I was able to get that to work. Now I am stuck making this new orgdb work with clusterProfiler with my DESeq2 data, to do a GO Enrichment Analysis.
try to do enrichGO with this gene.df and this new orgdb
ego2 <- enrichGO(gene = gene.df$ENTREZID,
OrgDb = orgdb,
keyType = 'REFSEQ',
ont = "CC",
pAdjustMethod = "BH",
pvalueCutoff = 0.01,
qvalueCutoff = 0.05)
but, the error that comes out is
Error in testForValidKeytype(x, keytype) :
Invalid keytype: GOALL. Please use the keytypes method to see a listing of valid arguments.
so I tried playing around with different keytype inputs , but still get the same:
ego2 <- enrichGO(gene = gene.df$ENTREZID,
+ OrgDb = orgdb,
+ keyType = 'ENTREZID',
+ ont = "CC",
+ pAdjustMethod = "BH",
+ pvalueCutoff = 0.01,
+ qvalueCutoff = 0.05)
Error in testForValidKeytype(x, keytype) :
Invalid keytype: GOALL. Please use the keytypes method to see a listing of valid arguments.
Not sure where this GOALL is coming from. Any ideas how to pass both this new orgdb and my DESeq2 results through goenrich?
Hi, thanks so much for your quick response. I was able to get that to work. Now I am stuck making this new orgdb work with clusterProfiler with my DESeq2 data, to do a GO Enrichment Analysis.
Here is what I have done:
created and sorted a geneList:
head(geneList) NP612819.1 XP029213287.1 XP029179546.1 XP015773080.1 XP029193592.1 XP015748232.1 15.90933 12.44382 12.35324 12.29142 12.21528 12.07018
since these are in REFSEQ format, add ENTREZID with bitr
gene.df <- bitr(gene, fromType = "REFSEQ", + toType = c("ENTREZID"), + OrgDb = orgdb)
head(gene.df) REFSEQ ENTREZID 2 XP029213287.1 114976909 3 XP029179546.1 114947034 5 XP029193592.1 114959662 10 XP029187547.1 114954991 11 XP029198384.1 114963355 12 XP029179359.1 114946884
which seems to look good
try to do enrichGO with this gene.df and this new orgdb ego2 <- enrichGO(gene = gene.df$ENTREZID, OrgDb = orgdb, keyType = 'REFSEQ', ont = "CC", pAdjustMethod = "BH", pvalueCutoff = 0.01, qvalueCutoff = 0.05)
but, the error that comes out is
Error in testForValidKeytype(x, keytype) : Invalid keytype: GOALL. Please use the keytypes method to see a listing of valid arguments.
so I tried playing around with different keytype inputs , but still get the same:
Not sure where this GOALL is coming from. Any ideas how to pass both this new orgdb and my DESeq2 results through goenrich?
Many thanks again.