Need better quality control for reliable installation of bioconductor packages?
2
0
Entering edit mode
Glynn, Earl ▴ 170
@glynn-earl-952
Last seen 10.2 years ago
Under Windows I just downloaded the new 2.0.1 version of R and installed all the packages: Packages | Install package(s) from CRAN ... I select all the packages for installation and that worked fine. This took about an hour. So with the new version of R, I thought I'd re-install all the Bioconductor packages. Like several times in the past, when I try to install all the Bioconductor packages I encounter problems: Packages | Install package(s) from Bioconductor ... I select all the packages for installation. The download take a long time but then the installation of packages fails. I wish the name of the temporary download directory was displayed earlier to make it easier to find. But I found the Bioconductor ZIP files and then tried Packages | Install Package(s) from local zip files ... Why is the installation of so many Bioconductor packages failing? For example: > install.packages(choose.files('',filters=Filters[c('zip','All'),]), .libPaths()[1], CRAN = NULL) Error in file(file, "r") : unable to open connection In addition: Warning messages: 1: error 1 in extracting from zip file 2: cannot open file `twilight/DESCRIPTION' > install.packages(choose.files('',filters=Filters[c('zip','All'),]), .libPaths()[1], CRAN = NULL) Error in file(file, "r") : unable to open connection In addition: Warning messages: 1: error 1 in extracting from zip file 2: cannot open file `iSPlot/DESCRIPTION' > install.packages(choose.files('',filters=Filters[c('zip','All'),]), .libPaths()[1], CRAN = NULL) Error in file(file, "r") : unable to open connection In addition: Warning messages: 1: error 1 in extracting from zip file 2: cannot open file `affycomp/DESCRIPTION' > install.packages(choose.files('',filters=Filters[c('zip','All'),]), .libPaths()[1], CRAN = NULL) Error in file(file, "r") : unable to open connection In addition: Warning messages: 1: error 1 in extracting from zip file 2: cannot open file `RdbiPgSQL/DESCRIPTION' This installation process is quite frustrating since one selects all the packages to install and then it fails. This must be repeated every time for a failure. So far I've seen at least 9 failures. What wrong with these DESCRIPTION directories? Why isn't the installation of bioconductor packages more reliable? (I've seen this same problem several times before). What quality control measures can be put in place to catch these problems and fix them automatically? efg -- Earl F. Glynn Scientific Programmer Bioinformatics Department Stowers Institute for Medical Research [[alternative HTML version deleted]]
PROcess PROcess • 1.4k views
ADD COMMENT
0
Entering edit mode
rgentleman ★ 5.5k
@rgentleman-7725
Last seen 9.6 years ago
United States
On Dec 10, 2004, at 10:21 AM, Glynn, Earl wrote: > Under Windows I just downloaded the new 2.0.1 version of R and > installed > all the packages: > > Packages | Install package(s) from CRAN ... > > I select all the packages for installation and that worked fine. This > took about an hour. > > So with the new version of R, I thought I'd re-install all the > Bioconductor packages. Like several times in the past, when I try to > install all the Bioconductor packages I encounter problems: > > Packages | Install package(s) from Bioconductor ... > > I select all the packages for installation. The download take a long > time but then the installation of packages fails. I wish the name of > the temporary download directory was displayed earlier to make it > easier > to find. But I found the Bioconductor ZIP files and then tried > Packages > | Install Package(s) from local zip files ... > > Why is the installation of so many Bioconductor packages failing? > > For example: > But you have not shown us what commands you are using? You can (and possibly should) use the reposTools library and the tools that we have built for dealing with the package dependencies that exist in and between many Bioconductor packages. > > > install.packages(choose.files('',filters=Filters[c('zip','All'),]), > .libPaths()[1], CRAN = NULL) > Error in file(file, "r") : unable to open connection > In addition: Warning messages: > 1: error 1 in extracting from zip file > 2: cannot open file `twilight/DESCRIPTION' > > > install.packages(choose.files('',filters=Filters[c('zip','All'),]), > .libPaths()[1], CRAN = NULL) > Error in file(file, "r") : unable to open connection > In addition: Warning messages: > 1: error 1 in extracting from zip file > 2: cannot open file `iSPlot/DESCRIPTION' > > Did you look to see what the problem is? Try just installing a single package, giving the correct path. These are just zip files open them and see if they are complete (it isn't very hard). Do you have Gtk properly installed? If not why would you want a package that relies on it? > install.packages(choose.files('',filters=Filters[c('zip','All'),]), > .libPaths()[1], CRAN = NULL) > Error in file(file, "r") : unable to open connection > In addition: Warning messages: > 1: error 1 in extracting from zip file > 2: cannot open file `affycomp/DESCRIPTION' > > > install.packages(choose.files('',filters=Filters[c('zip','All'),]), > .libPaths()[1], CRAN = NULL) > Error in file(file, "r") : unable to open connection > In addition: Warning messages: > 1: error 1 in extracting from zip file > 2: cannot open file `RdbiPgSQL/DESCRIPTION' > Do you have Postgres properly installed on your computer? If not, why would you expect this package to install or work? I don't think that blindly trying to install every package, regardless of whether you want or need to use it makes very much sense. As for why these particular ones have failed, one needs to look and see if they downloaded (maybe there was a time out), what the name of the downloaded file is, does it match the name in the command you issued, and then you can look at the zip file and see if it is ok. I suspect that there is a failure somewhere in one of those three places (but I could be wrong). > This installation process is quite frustrating since one selects all > the > packages to install and then it fails. This must be repeated every > time > for a failure. So far I've seen at least 9 failures. What wrong with > these DESCRIPTION directories? > You have chosen to install the development version of these packages, that means that you are working with unstable software. If you want stable software you can use the release version, or you can pay someone (like Insightful for that). But as I noted above, I suspect that the problem lies with the downloaded files, and that is the first place to start looking. > Why isn't the installation of bioconductor packages more reliable? > (I've > seen this same problem several times before). What quality control > measures can be put in place to catch these problems and fix them > automatically? Well, we would like to make it more stable, and accurate and informative user reports help us to do that. You could help, by trying to find out just what the problem is, and no we can't tell what is happening on your computer, so you have to be slightly more descriptive. As for what could make things easier, well, contributing money is one if you send us a lot we can fix it really fast. Otherwise, you can supply informative bug reports that can be used to figure out what has gone wrong. Robert > > efg > -- > Earl F. Glynn > Scientific Programmer > Bioinformatics Department > Stowers Institute for Medical Research > > > [[alternative HTML version deleted]] > > _______________________________________________ > Bioconductor mailing list > Bioconductor@stat.math.ethz.ch > https://stat.ethz.ch/mailman/listinfo/bioconductor > > +--------------------------------------------------------------------- -- ----+ | Robert Gentleman phone: (206) 667-7700 | | Head, Program in Computational Biology fax: | | Division of Public Health Sciences office: M2-B865 | | Fred Hutchinson Cancer Research Center email: rgentlem@fhcrc.org | +--------------------------------------------------------------------- -- ----+
ADD COMMENT
0
Entering edit mode
Glynn, Earl ▴ 170
@glynn-earl-952
Last seen 10.2 years ago
I'm using Windows 2000 on one machine and Windows XP on another machine. The XP machine is not attached to the network. I did a clean install of the new R 2.0.1 on the network machine. I downloaded the files once, burned a CD and used them to install R and Bioconductor on the XP PC not connected to the network. This went flawlessly for R, but not for Bioconductor. > But you have not shown us what commands you are using? But I believe I have. My description said I was using the Windows interface (from the Rgui.exe program from R 2.0.1) and I selected: R: Packages | Install package(s) from CRAN ... | One selects all the packages by clicking and dragging over the desired files. One then presses OK using the Rgui.exe interface. R installed everything and the complete installation worked fine as usual. My experience with Bioconductor installation is that there are usually several failures installing everything without messages that explain why. > You can (and possibly should) use the reposTools library > and the tools > that we have built for dealing with the package dependencies > that exist > in and between many Bioconductor packages. I understand about dependencies trees, but shouldn't that be a minimal consideration if I'm installing everything? From an end-user perspective, it's about the same amount of work to install everything as it is to install a few packages. Why not install all packages rather than spending multiple times installing just a few? I don't plan to re-download files, and possibly burn a CD every time I need to add R/Bioconductor to another PC. > Did you look to see what the problem is? Try just > installing a single > package, giving the correct path. These are just zip files open them > and see if they are complete (it isn't very hard). > Do you have Gtk properly installed? If not why would you want > a package > that relies on it? I was originally not trying to install just a single package. I'm was trying to install all the Bioconductor packages in one shot. It worked for R, I assumed that I could install all of the Bioconductor packages too -- perhaps a flawed assumption. > Do you have Postgres properly installed on your computer? > If not, why > would you expect this package to install or work? I do not have Postgres installed, and I understand I cannot execute the functions in that package. But I also can't see any documentation unless I install it. Why not install everything just to make documentation available? >I don't think that > blindly trying to install every package, regardless of > whether you want > or need to use it makes very much sense. I don't understand. All the R packages downloaded in ZIP format take ~169 MB. All The Bioconductor packages take ~133 MB. This is not very much disk space these days. We can download them once at our facility and then install them on multiple PCs (using the network or a CD), which is faster than redownloading them every time. > You have chosen to install the development version of > these packages, > that means that you are working with unstable software. All I did was pick the "defaults" from the Rgui.exe user interface under Windows and select all packages. I'm not sure why this would be requesting the development version, or an unstable set. You're blaming the messenger here. I'm using default settings inside the latest version of R. > Well, we would like to make it more stable, and accurate and > informative user reports help us to do that. You could help, > by trying > to find out just what the problem is OK, I did this to try to help (if someone wants the whole console file, I'd be happy to E-mail it): Six of the ZIP files are only 2 KB and were probably not correctly made or transferred: geneplotter_1.5.0.zip, GOstats_1.1.0.zip, iSPlot_1.0.3.zip, RdbPgSQL_1.0.9.zip, Rgraphviz_1.5.0.zip, twilight_1.0.1.zip. All of these failed to install and stopped the batch installation until they were avoided. When I tried another install of a single package that failed, e.g., the "twilight" package: > local({a<- CRAN.packages(CRAN=getOption("BIOC")) + install.packages(select.list(a[,1],,TRUE), .libPaths()[1], available=a, CRAN=getOption ("BIOC"), dependencies=TRUE)}) trying URL `http://www.bioconductor.org/bin/windows/contrib/2.0/PACKAGES' Content type `text/plain' length 46409 bytes opened URL downloaded 45Kb dependencies 'golubEsets''snow' are not available trying URL `http://www.bioconductor.org/bin/windows/contrib/2.0/twilight_1.0.1.zi p' Content type `text/html' length 1119 bytes opened URL downloaded 1119 bytes Error in file(file, "r") : unable to open connection In addition: Warning messages: 1: error 1 in extracting from zip file 2: cannot open file `twilight/DESCRIPTION' So there does seem to be a dependency problem. Windows XP says Affycomp_1.4.3.zip is 137 KB and is "invalid or corrupted." When I tried to download this one again, the length is being reported as 72,015,716, so perhaps there was some sort of transfer error on this one. But the second transfer failed too: trying URL `http://www.bioconductor.org/bin/windows/contrib/2.0/affycomp_1.4.3.zi p' Content type `application/zip' length 72015716 bytes opened URL downloaded 137Kb Error in file(file, "r") : unable to open connection In addition: Warning messages: 1: downloaded length 140288 != reported length 72015716 2: error 1 in extracting from zip file 3: cannot open file `affycomp/DESCRIPTION' But were these the same messages I saw when I tried to install everything? To find out, I did yet another R 2.0.1 install (using the local ZIPs for R 2.0.1) but installed all the Bioconductor libraries using the Rgui.EXE interface: Packages | Install package(s) from Bioconductor ... |
ADD COMMENT

Login before adding your answer.

Traffic: 594 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6