Dear Bioinformaticists,
I'm new to Bioconductor and having issue dowloading some raw data by getGEOSuppFiles(). I stuck exactly in the step of reading in celfiles with read.celfiles().
** This is my code below :
getGEOSuppFiles( "GSE68849")
untar("GSE68849/GSE68849_RAW.tar", exdir = "GSE68849/CEL") list.files("GSE68849/CEL")
celf <- list.files("GSE68849/CEL", full = T)
read.celfiles(celf)
** It outputs the following error :
Error in read.celfile.header(x) : Is GSE68849/CEL/GPL10558_HumanHT-12_V4_0_R1_15002873_B.txt.gz really a CEL file? tried reading as text, gzipped text, binary, gzipped binary, command console and gzipped command console formats
Any help would be much appreciated !
sessionInfo( )
R version 4.1.3 (2022-03-10) Platform: x86_64-w64-mingw32/x64 (64-bit) Running under: Windows 10 x64 (build 19041)
Matrix products: default
locale:
[1] LC_COLLATE=French_France.1252 LC_CTYPE=French_France.1252
[3] LC_MONETARY=French_France.1252 LC_NUMERIC=C
[5] LC_TIME=French_France.1252
attached base packages:
[1] stats graphics grDevices utils datasets methods
[7] base
other attached packages: [1] GEOquery_2.62.2 Biobase_2.54.0 BiocGenerics_0.40.0
loaded via a namespace (and not attached):
[1] compiler_4.1.3 pillar_1.7.0 BiocManager_1.30.18
[4] prettyunits_1.1.1 remotes_2.4.2 tools_4.1.3
[7] testthat_3.1.4 pkgbuild_1.3.1 pkgload_1.2.4
[10] memoise_2.0.1 lifecycle_1.0.1 tibble_3.1.7
[13] pkgconfig_2.0.3 rlang_1.0.2 DBI_1.1.2
[16] cli_3.3.0 curl_4.3.2 fastmap_1.1.0
[19] xml2_1.3.3 withr_2.5.0 dplyr_1.0.9
[22] hms_1.1.1 desc_1.4.1 generics_0.1.2
[25] fs_1.5.2 vctrs_0.4.1 devtools_2.4.3
[28] tidyselect_1.1.2 rprojroot_2.0.3 glue_1.6.2
[31] data.table_1.14.2 R6_2.5.1 processx_3.5.3
[34] fansi_1.0.3 sessioninfo_1.2.2 limma_3.50.3
[37] tidyr_1.2.0 tzdb_0.3.0 readr_2.1.2
[40] callr_3.7.0 purrr_0.3.4 magrittr_2.0.3
[43] ps_1.7.0 ellipsis_0.3.2 usethis_2.1.6
[46] assertthat_0.2.1 utf8_1.2.2 cachem_1.0.6
[49] crayon_1.5.1 brio_1.1.3
If you are new to Bioconductor, you should probably just use
getGEO
to read in the data, and then use thelimma
package to process and compare the samples. See the limma User's Guide for more information.Thank you for replying Sir !
I've already used getGEO and done comparing the samples by limma package.
Now if possible I want to download the raw data by getsuppfiles () in order to processing it and comparing the samples again so that finally I can compare the results of the data downloaded by getGeo() and the one downloaded by getsuppfiles ().
This is my idea. In fact I'm now wondering if there is any way to get the raw data from an Illumina array study ?
It works wonders. Thank you again sir !