crlmm with Illumina arrays

0

Entering edit mode

Lavinia Gordon ▴ 480

@lavinia-gordon-2959

Last seen 10.6 years ago

To authors/users of crlmm (and probably VanillaICE) I am using crlmm with Illumina arrays (human610quadv1b) I have run into a few problems and would appreciate any tips on how to troubleshoot. The issues may be related to the quality of the arrays, however they have all run through the standard Illumina software without problems. I am following a (slightly) modified version of the script illumina_copynumber.Rnw, finishing with: crlmmWrapper(sampleSheet=samplesheet, arrayNames=arrayNames, arrayInfoColNames=list(barcode="SentrixBarcode_A", position="SentrixPosition_A"), saveDate=TRUE, cdfName=cdfName, load.it=FALSE, save.it=FALSE, intensityFile=file.path(outdir, "normalizedIntensities.rda"), crlmmFile=file.path(outdir, "snpsetObject.rda"), rgFile=file.path(outdir, "rgFile.rda")) If I run: update(filename, adj.bias=TRUE) I get this error: Processing OUTDIR/crlmmSetList_1.rda ... ---------------------------------------------------------------------- ------ - Estimating copy number for chromosome 1 ---------------------------------------------------------------------- ------ Error in computeCopynumber(object, CHR = CHR, ...) : formal argument "CHR" matched by multiple actual arguments (not just for chromosome 1) So instead I have been using 'computeCopyNumber': crlmmSetList <- computeCopynumber(crlmmSetList, CHR, bias.adj=TRUE, SNRmin=5, cdfName, batch=scanbatch) save(crlmmSetList, file=file.path(outdir, paste("crlmmSetList_", CHR, ".rda", sep=""))) This runs for all bar chromosome 23 and 24 (which is probably due to the fact that these .rda files are tiny, ~1.9kb) [1] "OUTDIR/crlmmSetList_23.rda" Fitting model for copy number estimation... Using 50 df for inverse chi squares. Estimating gender Error in kmeans(XMedian, c(min(XMedian[SNR > SNRmin]), max(XMedian[SNR > : initial centers are not distinct > traceback() 5: stop("initial centers are not distinct") 4: kmeans(XMedian, c(min(XMedian[SNR > SNRmin]), max(XMedian[SNR > SNRmin]))) 3: instantiateObjects(calls = calls, conf = conf, NP = NP, plate = plate, envir = envir, chrom = chrom, A = A, B = B, gender = gender, SNR = SNR, SNRmin = SNRmin, pkgname = cdfName) 2: .computeCopynumber(chrom = CHR, A = A(ABset), B = B(ABset), calls = calls(snpset), conf = confs(snpset), NP = A(NPset), plate = batch, envir = envir, SNR = ABset$SNR, bias.adj = FALSE, SNRmin = SNRmin, cdfName = cdfName, ...) 1: computeCopynumber(crlmmSetList, CHR, bias.adj = TRUE, SNRmin = 5, cdfName, batch = scanbatch) Then following a slightly modified version of the script copynumber.Rnw, beginning from the section: CHR <- 2 if(!exists("crlmmSetList")) load(file.path(outdir, paste("crlmmSetList_", CHR, ".rda", sep=""))) show(crlmmSetList) .... to if(!exists("hmmPredictions")){ + hmmPredictions <- viterbi(emission=emission.cn, + initialStateProbs=log(initialP), + tau=tau[, "transitionPr"], + arm=tau[, "arm"], + normalIndex=3, + normal2altered=0.1, + altered2normal=1, + altered2altered=0.01) + } I get: Error: NA/NaN/Inf in foreign function call (arg 1) > summaryis.naemission.cn)) Mode FALSE TRUE NA's logical 8806640 3473576 0 > head(tau) chromosome position arm transitionPr [1,] 2 8856 0 0.9998882 [2,] 2 14445 0 0.9998708 [3,] 2 20906 0 0.9999908 [4,] 2 21366 0 0.9999915 [5,] 2 21791 0 0.9999756 [6,] 2 23012 0 0.9999245 > traceback() 2: .C("viterbi", tmp[[1]], tmp[[2]], tmp[[3]], tmp[[4]], tmp[[5]], tmp[[6]], tmp[[7]], tmp[[8]], tmp[[9]], tmp[[10]], tmp[[11]], tmp[[12]], tmp[[13]]) 1: viterbi(emission = emission.cn, initialStateProbs = log(initialP), tau = tau[, "transitionPr"], arm = tau[, "arm"], normalIndex = 3, normal2altered = 0.1, altered2normal = 1, altered2altered = 0.01) > sessionInfo() R version 2.10.1 (2009-12-14) x86_64-unknown-linux-gnu locale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 [5] LC_MONETARY=C LC_MESSAGES=en_US.UTF-8 [7] LC_PAPER=en_US.UTF-8 LC_NAME=C [9] LC_ADDRESS=C LC_TELEPHONE=C [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] human610quadv1bCrlmm_1.0.1 RColorBrewer_1.0-2 [3] SNPchip_1.10.0 oligoClasses_1.8.0 [5] VanillaICE_1.8.0 crlmm_1.4.1 [7] Biobase_2.6.1 loaded via a namespace (and not attached): [1] affyio_1.14.0 annotate_1.24.1 AnnotationDbi_1.8.1 [4] Biostrings_2.14.10 DBI_0.2-5 ellipse_0.3-5 [7] genefilter_1.28.2 IRanges_1.4.9 mvtnorm_0.9-8 [10] preprocessCore_1.8.0 RSQLite_0.8-1 splines_2.10.1 [13] survival_2.35-7 tools_2.10.1 xtable_1.5-6 > Any advice greatly appreciated, with regards Lavinia Gordon Bioinformatics officer Murdoch Childrens Research Institute The Royal Children s Hospital, Flemington Road, Parkville, Victoria 3052, Australia

crlmm crlmm • 1.3k views

ADD COMMENT • link updated 15.2 years ago by Benilton Carvalho ★ 4.3k • written 15.2 years ago by Lavinia Gordon ▴ 480

0

Entering edit mode

Benilton Carvalho ★ 4.3k

@benilton-carvalho-1375

Last seen 5.1 years ago

Brazil/Campinas/UNICAMP

Hi Lavinia, how many samples are you using in this particular example? thanks, benilton On Wed, Feb 17, 2010 at 11:22 PM, Lavinia Gordon <lavinia.gordon at="" mcri.edu.au=""> wrote: > To authors/users of crlmm (and probably VanillaICE) > > I am using crlmm with Illumina arrays (human610quadv1b) > I have run into a few problems and would appreciate any tips on how to > troubleshoot. > The issues may be related to the quality of the arrays, however they have > all run through the standard Illumina software without problems. > > I am following a (slightly) modified version of the script > illumina_copynumber.Rnw, > finishing with: > crlmmWrapper(sampleSheet=samplesheet, > ? ? ? ? arrayNames=arrayNames, > ? ? ? ? arrayInfoColNames=list(barcode="SentrixBarcode_A", > position="SentrixPosition_A"), > ? ? ? ? saveDate=TRUE, > ? ? ? ? cdfName=cdfName, > ? ? ? ? load.it=FALSE, > ? ? ? ? save.it=FALSE, > ? ? ? ? intensityFile=file.path(outdir, "normalizedIntensities.rda"), > ? ? ? ? crlmmFile=file.path(outdir, "snpsetObject.rda"), > ? ? ? ? rgFile=file.path(outdir, "rgFile.rda")) > > If I run: > update(filename, adj.bias=TRUE) > I get this error: > Processing ?OUTDIR/crlmmSetList_1.rda ... > -------------------------------------------------------------------- -------- > - ? ? ? ?Estimating copy number for chromosome 1 > -------------------------------------------------------------------- -------- > Error in computeCopynumber(object, CHR = CHR, ...) : > ?formal argument "CHR" matched by multiple actual arguments > (not just for chromosome 1) > > So instead I have been using 'computeCopyNumber': > crlmmSetList <- computeCopynumber(crlmmSetList, CHR, bias.adj=TRUE, > SNRmin=5, cdfName, batch=scanbatch) > save(crlmmSetList, file=file.path(outdir, paste("crlmmSetList_", CHR, > ".rda", sep=""))) > This runs for all bar chromosome 23 and 24 (which is probably due to the > fact that these .rda files are tiny, ~1.9kb) > [1] "OUTDIR/crlmmSetList_23.rda" > Fitting model for copy number estimation... > Using 50 df for inverse chi squares. > Estimating gender > Error in kmeans(XMedian, c(min(XMedian[SNR > SNRmin]), max(XMedian[SNR > ?: > ?initial centers are not distinct >> traceback() > 5: stop("initial centers are not distinct") > 4: kmeans(XMedian, c(min(XMedian[SNR > SNRmin]), max(XMedian[SNR > > ? ? ? SNRmin]))) > 3: instantiateObjects(calls = calls, conf = conf, NP = NP, plate = plate, > ? ? ? envir = envir, chrom = chrom, A = A, B = B, gender = gender, > ? ? ? SNR = SNR, SNRmin = SNRmin, pkgname = cdfName) > 2: .computeCopynumber(chrom = CHR, A = A(ABset), B = B(ABset), calls = > calls(snpset), > ? ? ? conf = confs(snpset), NP = A(NPset), plate = batch, envir = envir, > ? ? ? SNR = ABset$SNR, bias.adj = FALSE, SNRmin = SNRmin, cdfName = cdfName, > ? ? ? ...) > 1: computeCopynumber(crlmmSetList, CHR, bias.adj = TRUE, SNRmin = 5, > ? ? ? cdfName, batch = scanbatch) > > Then following a slightly modified version of the script copynumber.Rnw, > beginning from the section: > CHR <- 2 > if(!exists("crlmmSetList")) load(file.path(outdir, paste("crlmmSetList_", > CHR, ".rda", sep=""))) > show(crlmmSetList) > .... > to > ?if(!exists("hmmPredictions")){ > + hmmPredictions <- viterbi(emission=emission.cn, > + ? initialStateProbs=log(initialP), > + ? tau=tau[, "transitionPr"], > + ? arm=tau[, "arm"], > + ? normalIndex=3, > + ? normal2altered=0.1, > + ? altered2normal=1, > + ? altered2altered=0.01) > + } > > I get: > > Error: NA/NaN/Inf in foreign function call (arg 1) >> summaryis.naemission.cn)) > ? Mode ? FALSE ? ?TRUE ? ?NA's > logical 8806640 3473576 ? ? ? 0 >> head(tau) > ? ? chromosome position arm transitionPr > [1,] ? ? ? ? ?2 ? ? 8856 ? 0 ? ?0.9998882 > [2,] ? ? ? ? ?2 ? ?14445 ? 0 ? ?0.9998708 > [3,] ? ? ? ? ?2 ? ?20906 ? 0 ? ?0.9999908 > [4,] ? ? ? ? ?2 ? ?21366 ? 0 ? ?0.9999915 > [5,] ? ? ? ? ?2 ? ?21791 ? 0 ? ?0.9999756 > [6,] ? ? ? ? ?2 ? ?23012 ? 0 ? ?0.9999245 >> traceback() > 2: .C("viterbi", tmp[[1]], tmp[[2]], tmp[[3]], tmp[[4]], tmp[[5]], > ? ? ? tmp[[6]], tmp[[7]], tmp[[8]], tmp[[9]], tmp[[10]], tmp[[11]], > ? ? ? tmp[[12]], tmp[[13]]) > 1: viterbi(emission = emission.cn, initialStateProbs = log(initialP), > ? ? ? tau = tau[, "transitionPr"], arm = tau[, "arm"], normalIndex = 3, > ? ? ? normal2altered = 0.1, altered2normal = 1, altered2altered = 0.01) >> sessionInfo() > R version 2.10.1 (2009-12-14) > x86_64-unknown-linux-gnu > > locale: > ?[1] LC_CTYPE=en_US.UTF-8 ? ? ? LC_NUMERIC=C > ?[3] LC_TIME=en_US.UTF-8 ? ? ? ?LC_COLLATE=en_US.UTF-8 > ?[5] LC_MONETARY=C ? ? ? ? ? ? ?LC_MESSAGES=en_US.UTF-8 > ?[7] LC_PAPER=en_US.UTF-8 ? ? ? LC_NAME=C > ?[9] LC_ADDRESS=C ? ? ? ? ? ? ? LC_TELEPHONE=C > [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C > > attached base packages: > [1] stats ? ? graphics ?grDevices utils ? ? datasets ?methods ? base > > other attached packages: > [1] human610quadv1bCrlmm_1.0.1 RColorBrewer_1.0-2 > [3] SNPchip_1.10.0 ? ? ? ? ? ? oligoClasses_1.8.0 > [5] VanillaICE_1.8.0 ? ? ? ? ? crlmm_1.4.1 > [7] Biobase_2.6.1 > > loaded via a namespace (and not attached): > ?[1] affyio_1.14.0 ? ? ? ?annotate_1.24.1 ? ? ?AnnotationDbi_1.8.1 > ?[4] Biostrings_2.14.10 ? DBI_0.2-5 ? ? ? ? ? ?ellipse_0.3-5 > ?[7] genefilter_1.28.2 ? ?IRanges_1.4.9 ? ? ? ?mvtnorm_0.9-8 > [10] preprocessCore_1.8.0 RSQLite_0.8-1 ? ? ? ?splines_2.10.1 > [13] survival_2.35-7 ? ? ?tools_2.10.1 ? ? ? ? xtable_1.5-6 >> > > Any advice greatly appreciated, > with regards > > Lavinia Gordon > Bioinformatics officer > Murdoch Childrens Research Institute > The Royal Children s Hospital, Flemington Road, Parkville, Victoria 3052, > Australia > > _______________________________________________ > Bioconductor mailing list > Bioconductor at stat.math.ethz.ch > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: > http://news.gmane.org/gmane.science.biology.informatics.conductor >

ADD COMMENT • link 15.2 years ago Benilton Carvalho ★ 4.3k

Login before adding your answer.