missing AnnBuilder parser
2
0
Entering edit mode
Francois Pepin ★ 1.3k
@francois-pepin-1012
Last seen 10.3 years ago
Hi everyone, I'm trying to rebuild my annotation package for an Agilent chip using Resourcerer and it's not finder one of the parsers. The basic error is: >resourcerer2BioC("Agilent_HumanGenome.zip", organism="human") Warning: cannot open file `NA' Warning in unifyMappings(base, eg, ug, otherSrc) : Failed to get or parse Entrez Gene data because of: Error in file(con, "r") : unable to open connection The missing file in question is the parser from Genbank to Entrez Gene (The Agilent annotations are based on GenBank). > getBaseParsers("gb") LL "/usr/local/lib/R/library/AnnBuilder/scripts/egAccParser" UG "/usr/local/lib/R/library/AnnBuilder/scripts/gbUGParser" I've convinced it to use the LocusLink parser instead as the IDs should be the same, and the code runs for about an hour and at the very end: ... [21] "/usr/local/lib/R/library/Resourcerer/temp/AgilentHsWg/data/AgilentHsW gREFSEQ.rda" [22] "/usr/local/lib/R/library/Resourcerer/temp/AgilentHsWg/data/AgilentHsW gSYMBOL.rda" [23] "/usr/local/lib/R/library/Resourcerer/temp/AgilentHsWg/data/AgilentHsW gUNIGENE.rda" exiting from: ABPkgBuilder(baseFile, srcUrls, baseMapType, otherSrc, pkgName, pkgPath, organism, version, makeXML, author, fromWeb) Warning: cannot open compressed file `/usr/local/lib/R/library/Resourcerer/temp/AgilentHsWg/data/AgilentHsW gLOCUSID.rda' Error in open.connection(con, "rb") : unable to open connection At which point I assume I'll need a proper parser for it. Any suggestions on what to do next? Having since learned my lesson: > sessionInfo() R version 2.0.0, 2004-10-04, x86_64-unknown-linux-gnu attached base packages: [1] "tools" "methods" "stats" "graphics" "grDevices" "utils" [7] "datasets" "base" other attached packages: Resourcerer AnnBuilder annotate XML Biobase "1.1.0" "1.5.12" "1.5.1" "0.95-6" "1.5.0" Francois
Annotation Organism annotate AnnBuilder Annotation Organism annotate AnnBuilder • 1.2k views
ADD COMMENT
0
Entering edit mode
John Zhang ★ 2.9k
@john-zhang-6
Last seen 10.3 years ago
I will have a look to see what might be wrong. Due to a recent switch from LocusLink to Entrez Gene, AnnBuilder and other packages that depend on AnnBuilder may not work as expected. We are in the process of fixing the problems. Sorry! >X-Original-To: jzhang@jimmy.harvard.edu >Delivered-To: jzhang@jimmy.harvard.edu >From: Francois Pepin <fpepin@cs.mcgill.ca> >To: bioconductor@stat.math.ethz.ch >Mime-Version: 1.0 >Date: Fri, 08 Apr 2005 14:37:36 -0400 >Content-Transfer-Encoding: 7bit >X-Virus-Scanned: by amavisd-new at stat.math.ethz.ch >Subject: [BioC] missing AnnBuilder parser >X-BeenThere: bioconductor@stat.math.ethz.ch >X-Mailman-Version: 2.1.5 >List-Id: The Bioconductor Project Mailing List <bioconductor.stat.math.ethz.ch> >List-Unsubscribe: <https: stat.ethz.ch="" mailman="" listinfo="" bioconductor="">, <mailto:bioconductor-request@stat.math.ethz.ch?subject=unsubscribe> >List-Archive: <https: stat.ethz.ch="" pipermail="" bioconductor=""> >List-Post: <mailto:bioconductor@stat.math.ethz.ch> >List-Help: <mailto:bioconductor- request@stat.math.ethz.ch?subject="help"> >List-Subscribe: <https: stat.ethz.ch="" mailman="" listinfo="" bioconductor="">, <mailto:bioconductor-request@stat.math.ethz.ch?subject=subscribe> >X-Spam-Checker-Version: SpamAssassin 3.0.1 (2004-10-22) on pascal.dfci.harvard.edu >X-Spam-Level: >X-Spam-Status: No, score=-2.6 required=3.0 tests=AWL,BAYES_00 autolearn=ham version=3.0.1 > >Hi everyone, > >I'm trying to rebuild my annotation package for an Agilent chip using >Resourcerer and it's not finder one of the parsers. > >The basic error is: >>resourcerer2BioC("Agilent_HumanGenome.zip", organism="human") > >Warning: cannot open file `NA' >Warning in unifyMappings(base, eg, ug, otherSrc) : > Failed to get or parse Entrez Gene data because of: > > Error in file(con, "r") : unable to open connection > > >The missing file in question is the parser from Genbank to Entrez Gene >(The Agilent annotations are based on GenBank). > >> getBaseParsers("gb") > LL >"/usr/local/lib/R/library/AnnBuilder/scripts/egAccParser" > UG > "/usr/local/lib/R/library/AnnBuilder/scripts/gbUGParser" > > >I've convinced it to use the LocusLink parser instead as the IDs should >be the same, and the code runs for about an hour and at the very end: > >... >[21] >"/usr/local/lib/R/library/Resourcerer/temp/AgilentHsWg/data/AgilentHs WgREFSEQ.r da" >[22] >"/usr/local/lib/R/library/Resourcerer/temp/AgilentHsWg/data/AgilentHs WgSYMBOL.r da" >[23] >"/usr/local/lib/R/library/Resourcerer/temp/AgilentHsWg/data/AgilentHs WgUNIGENE. rda" >exiting from: ABPkgBuilder(baseFile, srcUrls, baseMapType, otherSrc, >pkgName, > pkgPath, organism, version, makeXML, author, fromWeb) >Warning: cannot open compressed file >`/usr/local/lib/R/library/Resourcerer/temp/AgilentHsWg/data/AgilentHs WgLOCUSID. rda' >Error in open.connection(con, "rb") : unable to open connection > >At which point I assume I'll need a proper parser for it. Any >suggestions on what to do next? > >Having since learned my lesson: >> sessionInfo() >R version 2.0.0, 2004-10-04, x86_64-unknown-linux-gnu > >attached base packages: >[1] "tools" "methods" "stats" "graphics" "grDevices" "utils" >[7] "datasets" "base" > >other attached packages: >Resourcerer AnnBuilder annotate XML Biobase > "1.1.0" "1.5.12" "1.5.1" "0.95-6" "1.5.0" > >Francois > >_______________________________________________ >Bioconductor mailing list >Bioconductor@stat.math.ethz.ch >https://stat.ethz.ch/mailman/listinfo/bioconductor Jianhua Zhang Department of Medical Oncology Dana-Farber Cancer Institute 44 Binney Street Boston, MA 02115-6084
ADD COMMENT
0
Entering edit mode
Dear Bioc, I am curious about how BioC will handle recent switch EnterZ Gene from LocusLink? Could anyone please, point me out how can I access : From LocusLink ID --> to get GI number(s) or From GI numbers --> to get LocusLink Ids or From GI number(s) --> UniGene or GenBank Ids in BioC? Any help will be great help, Thank you in advance, Saurin
ADD REPLY
0
Entering edit mode
John Zhang ★ 2.9k
@john-zhang-6
Last seen 10.3 years ago
>I'm trying to rebuild my annotation package for an Agilent chip using >Resourcerer and it's not finder one of the parsers. > >The basic error is: >>resourcerer2BioC("Agilent_HumanGenome.zip", organism="human") I have checked in the bug fixes. Please look for a new build (1.1.1) of Resourcerer in the next few days. Meanwhile, you may try to fix the bug yourself by changing the argument baseMapType = c("gb", "ug", "ll") to baseMapType = c("gbNRef", "gb", "ug") of resourcerer2BioC and call the function by setting baseMapType = "gbNRef" (resourcerer2BioC("Agilent_HumanGenome.zip", organism = "human", baseMapType = "gbNRef")) Let me know if you run into any problem. Thanks. > >Warning: cannot open file `NA' >Warning in unifyMappings(base, eg, ug, otherSrc) : > Failed to get or parse Entrez Gene data because of: > > Error in file(con, "r") : unable to open connection > > >The missing file in question is the parser from Genbank to Entrez Gene >(The Agilent annotations are based on GenBank). > >> getBaseParsers("gb") > LL >"/usr/local/lib/R/library/AnnBuilder/scripts/egAccParser" > UG > "/usr/local/lib/R/library/AnnBuilder/scripts/gbUGParser" > > >I've convinced it to use the LocusLink parser instead as the IDs should >be the same, and the code runs for about an hour and at the very end: > >... >[21] >"/usr/local/lib/R/library/Resourcerer/temp/AgilentHsWg/data/AgilentHs WgREFSEQ.r da" >[22] >"/usr/local/lib/R/library/Resourcerer/temp/AgilentHsWg/data/AgilentHs WgSYMBOL.r da" >[23] >"/usr/local/lib/R/library/Resourcerer/temp/AgilentHsWg/data/AgilentHs WgUNIGENE. rda" >exiting from: ABPkgBuilder(baseFile, srcUrls, baseMapType, otherSrc, >pkgName, > pkgPath, organism, version, makeXML, author, fromWeb) >Warning: cannot open compressed file >`/usr/local/lib/R/library/Resourcerer/temp/AgilentHsWg/data/AgilentHs WgLOCUSID. rda' >Error in open.connection(con, "rb") : unable to open connection > >At which point I assume I'll need a proper parser for it. Any >suggestions on what to do next? > >Having since learned my lesson: >> sessionInfo() >R version 2.0.0, 2004-10-04, x86_64-unknown-linux-gnu > >attached base packages: >[1] "tools" "methods" "stats" "graphics" "grDevices" "utils" >[7] "datasets" "base" > >other attached packages: >Resourcerer AnnBuilder annotate XML Biobase > "1.1.0" "1.5.12" "1.5.1" "0.95-6" "1.5.0" > >Francois > >_______________________________________________ >Bioconductor mailing list >Bioconductor@stat.math.ethz.ch >https://stat.ethz.ch/mailman/listinfo/bioconductor Jianhua Zhang Department of Medical Oncology Dana-Farber Cancer Institute 44 Binney Street Boston, MA 02115-6084
ADD COMMENT
0
Entering edit mode
On Mon, 2005-04-11 at 14:49, John Zhang wrote: > I have checked in the bug fixes. Please look for a new build (1.1.1) of > Resourcerer in the next few days. Meanwhile, you may try to fix the bug yourself > by changing the argument baseMapType = c("gb", "ug", "ll") to baseMapType = > c("gbNRef", "gb", "ug") of resourcerer2BioC and call the function by setting > baseMapType = "gbNRef" (resourcerer2BioC("Agilent_HumanGenome.zip", organism = > "human", baseMapType = "gbNRef")) > > Let me know if you run into any problem. With a lot of begging and pleading, it ends up working: getProbe2ID (in Resourcerer) also has to be changed, but it doesn't know how to deal with gbNRef. It seems to be happy if I just give it 'gb'. ABPkgBuilder (in AnnBuilder) also needs to be changed to accept 'gbNRef'. But at the end (sending 'gbNRef to 'resourcerer2BioC', 'gb' to getProbe2ID and 'gbNRef' to ABPkgBuilder), there is the same error as before: [22] "/usr/local/lib/R/library/Resourcerer/temp/AgilentHsWg/data/Agile ntHsWgSYMBOL.rda" [23] "/usr/local/lib/R/library/Resourcerer/temp/AgilentHsWg/data/Agile ntHsWgUNIGENE.rda" exiting from: ABPkgBuilder(baseFile, srcUrls, baseMapType, otherSrc, pkgName, pkgPath, organism, version, makeXML, author, fromWeb) Warning: cannot open compressed file `/usr/local/lib/R/library/Resourc erer/temp/AgilentHsWg/data/AgilentHsWgLOCUSID.rda' Error in open.connection(con, "rb") : unable to open connection > traceback() 5: open.connection(con, "rb") 4: open(con, "rb") 3: load(llRda) 2: checkMapping(pkgName, tempOther, file.path(pkgPath, pkgName, "data", paste(pkgName, "LOCUSID.rda", sep = ""))) 1: resourcerer2BioC("Agilent_HumanGenome.zip", organism = "human", baseMapType = "gbNRef")) Building it with check=FALSE (along with the modifications above) seems to work fine, and I can install the package. The number of annotation has slightly increased since the last time I build it which is expected. The few mappings that I checked seem manually match. to show it working. I'd feel a bit better if the "checkMapping" function could confirm that the package was built properly. Francois
ADD REPLY

Login before adding your answer.

Traffic: 517 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6