I've just started with the oposSOM package to identify key genes in an array study I'm analyzing for a collaborator, and I have two questions about the package. Below is my R code and the error message, along with everything oposSOM reported during its workflow. "Eset" is an expression set generated using the oligo package.
> eset.matrix <- as.matrix(eset)
> env <- opossom.new(list(dataset.name="All",
+ error.model="all.samples",
+ dim.1stLvlSom=20,
+ dim.2ndLvlSom=20,
+ geneset.analysis=TRUE,
+ spot.coresize.groupmap=5,
+ spot.threshold.groupmap=0.75,
+ pairwise.comparison.list=list(
+ list(c("1.CEL", "2.CEL", "3.CEL", "4.CEL", "5.CEL", "6.CEL", "7.CEL", "8.CEL", "9.CEL", "11.CEL", "12.CEL", "13.CEL", "14.CEL", "16.CEL", "17.CEL", "18.CEL", "19.CEL", "21.CEL", "22.CEL", "23.CEL", "24.CEL", "25.CEL", "26.CEL", "28.CEL", "30.CEL", "31.CEL", "32.CEL", "33.CEL", "34.CEL", "35.CEL", "36.CEL", "37.CEL", "38.CEL", "39.CEL", "40.CEL", "41.CEL", "42.CEL", "43.CEL", "44.CEL", "45.CEL"), c("9.CEL", "19.CEL", "33.CEL", "34.CEL", "35.CEL", "45.CEL")),
+ list(c("1.CEL", "2.CEL", "3.CEL", "4.CEL", "5.CEL", "6.CEL", "7.CEL", "8.CEL", "9.CEL", "11.CEL", "12.CEL", "13.CEL", "14.CEL", "16.CEL", "17.CEL", "18.CEL", "19.CEL", "21.CEL", "22.CEL", "23.CEL", "24.CEL", "25.CEL", "26.CEL", "28.CEL", "30.CEL", "31.CEL", "32.CEL", "33.CEL", "34.CEL", "35.CEL", "36.CEL", "37.CEL", "38.CEL", "39.CEL", "40.CEL", "41.CEL", "42.CEL", "43.CEL", "44.CEL", "45.CEL"), c("31.CEL", "32.CEL", "33.CEL", "34.CEL", "37.CEL")))))
>
> env$indata <- eset.matrix
> env$group.labels <- c(rep("A", 6), rep("B", 40), rep("C", 5))
> env$group.colors <- c(rep("black", 6), rep("red", 40), rep("green", 5))
> opossom.run(env)
[2015-06-25 17:25:16][INFO] Started: Thu 25 Jun 2015 17:25:16
[2015-06-25 17:25:16][INFO] Setting: All
[2015-06-25 17:25:16][INFO] 1SOM Dim: 20
[2015-06-25 17:25:16][INFO] 2SOM Dim: 20
[2015-06-25 17:25:30][INFO] Load Annotation Data
[2015-06-25 17:25:32][INFO] Autodetecting annotation parameters
[2015-06-25 17:25:32][WARN] Could not find valid annotation parameters.
[2015-06-25 17:25:32][WARN] Disabling geneset analysis.
[2015-06-25 17:25:32][INFO] Processing SOM
[2015-06-25 17:28:48][INFO] Remaining ~ 17 min ~ 0.3 h
[2015-06-25 17:41:03][INFO] Saving environment image: All pre.RData
[2015-06-25 17:41:13][INFO] Processing Differential Expression
[2015-06-25 17:41:13][INFO] Processing Single Genes
[2015-06-25 17:42:12][INFO] [################################################]
[2015-06-25 17:42:12][INFO] Writing: All - Results/LPE/all_samples.bmp
[2015-06-25 17:42:23][INFO] Processing Metagenes
[2015-06-25 17:42:25][INFO] [################################################]
[2015-06-25 17:42:25][INFO] Writing: All - Results/LPE/Sigma_LPE.pdf
[2015-06-25 17:42:26][INFO] Detecting Spots
[2015-06-25 17:42:28][INFO] Plotting Sample Portraits
[2015-06-25 17:42:28][INFO] Writing: All - Results/Expression Portraits.pdf
[2015-06-25 17:42:29][INFO] Writing: All - Results/Expression Portraits alternative.pdf
[2015-06-25 17:42:32][INFO] Writing: All - Results/Rank Maps.pdf
[2015-06-25 17:42:47][INFO] Processing Supporting Information
[2015-06-25 17:42:47][INFO] Writing: All - Results/Supporting Maps&Profiles/Supporting Maps.pdf
[2015-06-25 17:49:41][INFO] Writing: All - Results/Supporting Maps&Profiles/Entropy Profiles.pdf
[2015-06-25 17:49:42][INFO] Writing: All - Results/Supporting Maps&Profiles/Topology Profiles.pdf
[2015-06-25 17:49:43][INFO] Processing 2nd level Metagene Analysis
[2015-06-25 17:49:43][INFO] Writing: All - Results/2nd lvl Metagene Analysis/Similarity Analysis.pdf
[2015-06-25 17:49:47][INFO] Writing: All - Results/2nd lvl Metagene Analysis/Correlation Analysis.pdf
[2015-06-25 17:49:52][INFO] Writing: All - Results/2nd lvl Metagene Analysis/Component Analysis.pdf
[2015-06-25 17:49:59][INFO] Writing: All - Results/2nd lvl Metagene Analysis/2nd lvl SOM.pdf
[2015-06-25 17:50:00][INFO] Processing Gene Lists
[2015-06-25 17:50:00][INFO] Writing: All - Results/CSV Sheets/Gene Lists - Global/*.csv
[2015-06-25 17:52:27][INFO] Writing: All - Results/CSV Sheets/Gene Lists - Local/*.csv
[2015-06-25 17:52:56][INFO] Processing Summary Sheets (Samples)
[2015-06-25 17:52:58][INFO] Writing: All - Results/Summary Sheets - Samples/*.pdf
[2015-06-25 17:53:42][INFO] Processing Summary Sheets (Spots)
[2015-06-25 17:53:42][INFO] Writing: All - Results/Summary Sheets - Integral/*.pdf
[2015-06-25 17:53:42][INFO] Writing: All - Results/CSV Sheets/Spot Lists/*.csv
[2015-06-25 18:05:46][INFO] Processing 3rd level Spot Analysis
[2015-06-25 18:05:46][INFO] Writing: All - Results/3rd lvl Spot Analysis/Overexpression Chromosomal Eichment.pdf
[2015-06-25 18:05:46][INFO] Writing: All - Results/3rd lvl Spot Analysis/Underexpression Chromosomal Eichment.pdf
[2015-06-25 18:05:46][INFO] Writing: All - Results/3rd lvl Spot Analysis/K-Means Cluster Chromosomal Eichment.pdf
[2015-06-25 18:05:46][INFO] Writing: All - Results/3rd lvl Spot Analysis/Group Overexpression Chromosomal Eichment.pdf
[2015-06-25 18:05:46][INFO] Writing: All - Results/3rd lvl Spot Analysis/Overexpression Spot Report.pdf
[2015-06-25 18:05:47][INFO] Writing: All - Results/3rd lvl Spot Analysis/Underexpression Spot Report.pdf
[2015-06-25 18:05:47][INFO] Writing: All - Results/3rd lvl Spot Analysis/K-Means Cluster Report.pdf
[2015-06-25 18:05:47][INFO] Writing: All - Results/3rd lvl Spot Analysis/Group Overexpression Report.pdf
[2015-06-25 18:05:47][INFO] Writing: All - Results/3rd lvl Spot Analysis/Overexpression Networks.pdf
[2015-06-25 18:05:48][INFO] Writing: All - Results/3rd lvl Spot Analysis/Underexpression Networks.pdf
[2015-06-25 18:05:49][INFO] Writing: All - Results/3rd lvl Spot Analysis/K-Means Cluster Networks.pdf
[2015-06-25 18:05:50][INFO] Writing: All - Results/3rd lvl Spot Analysis/Group Overexpression Networks.pdf
[2015-06-25 18:05:50][INFO] Generating HTML Report
[2015-06-25 18:05:50][INFO] Writing: All - Results/Summary.html
[2015-06-25 18:05:50][INFO] Writing: All - Results/Summary Sheets - Samples/0verview.html
[2015-06-25 18:05:50][INFO] Writing: All - Results/Summary Sheets - Integral/0verview.html
[2015-06-25 18:05:50][INFO] Saving environment image: All.RData
[2015-06-25 18:06:12][INFO] Processing Group-centered Analyses
[2015-06-25 18:06:12][INFO] Writing: All - Results/Summary Sheets - Groups/Expression Portraits Groups.pdf
[2015-06-25 18:06:13][INFO] Writing: All - Results/Summary Sheets - Groups/Group Assignment.pdf
Error in group.correlations[, names(S)] : subscript out of bounds
> traceback()
9: apply(group.correlations[, names(S)], 2, which.max)
8: plot.xy(xy.coords(x, y), type = type, ...)
7: points.default(b, rep(-0.2, ncol(indata)), pch = 15, cex = 1,
col = groupwise.group.colors[apply(group.correlations[, names(S)],
2, which.max)])
6: points(b, rep(-0.2, ncol(indata)), pch = 15, cex = 1, col = groupwise.group.colors[apply(group.correlations[,
names(S)], 2, which.max)])
5: fn()
4: util.call(pipeline.summarySheetsGroups, environment())
3: fn()
2: util.call(pipeline.groupAnalysis, env)
1: opossom.run(env)
First question, what caused this error? Reading the reference guide and Googling didn't turn anything up.
Second question, for the files that were correctly produced prior to the error, there is no gene ID/gene symbol annotated, despite there being columns for each of these in the output files. I assume this is connected to the "[WARN] Could not find valid annotation parameters." note, but as you'll see in my sessionInfo, I do have the annotation packages loaded (Affy Human 2.0 ST annotations). Is there an additional annotation package required by this package?
I've exceeded the character limit, I'll post sessionInfo in a followup. Sorry about that!
Thanks in advance for any insight you might have on either of these questions, it's greatly appreciated.
Alison Ziesel
And here's that SessionInfo.
> sessionInfo()
R version 3.1.3 (2015-03-09)
Platform: x86_64-apple-darwin13.4.0 (64-bit)
Running under: OS X 10.9.5 (Mavericks)
locale:
[1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8
attached base packages:
[1] stats4 parallel stats graphics grDevices utils datasets methods base
other attached packages:
[1] oposSOM_1.1.1 BiocInstaller_1.16.2 GOstats_2.32.0
[4] graph_1.44.1 Category_2.32.0 GO.db_3.0.0
[7] Matrix_1.1-5 hugene20sttranscriptcluster.db_8.2.0 org.Hs.eg.db_3.0.0
[10] annotate_1.44.0 XML_3.98-1.1 AnnotationDbi_1.28.2
[13] GenomeInfoDb_1.2.5 limma_3.22.7 pd.hugene.2.0.st_3.10.0
[16] RSQLite_0.11.4 DBI_0.3.1 oligo_1.30.0
[19] Biostrings_2.34.1 XVector_0.6.0 IRanges_2.0.1
[22] S4Vectors_0.4.0 Biobase_2.26.0 oligoClasses_1.28.0
[25] BiocGenerics_0.12.1
loaded via a namespace (and not attached):
[1] affxparser_1.38.0 affyio_1.34.0 AnnotationForge_1.8.2 ape_3.2 biomaRt_2.22.0
[6] bit_1.1-12 codetools_0.2-10 fastICA_1.2-0 fdrtool_1.2.14 ff_2.2-13
[11] foreach_1.4.2 genefilter_1.48.1 GenomicRanges_1.18.4 grid_3.1.3 GSEABase_1.28.0
[16] igraph_0.7.1 iterators_1.0.7 KernSmooth_2.23-14 lattice_0.20-30 nlme_3.1-120
[21] pixmap_0.4-11 preprocessCore_1.28.0 RBGL_1.42.0 RCurl_1.95-4.3 scatterplot3d_0.3-35
[26] som_0.3-5 splines_3.1.3 survival_2.38-1 tools_3.1.3 xtable_1.7-4
[31] zlibbioc_1.12.0