SCAN-normalized values on log2 scale?
1
0
Entering edit mode
Jenny Drnevich ★ 2.0k
@jenny-drnevich-2812
Last seen 5 months ago
United States

Hello,

I just recently used the SCAN.UPC package because I need to normalize some data from Affymetrix's Equine Gene 1.0 ST Arrays using custom CDF from Michigan's BrainArray, and this seemed the only package that I could use. Very nice package, pretty good documentation and easy to use! However, I couldn't find it explicitly stated anywhere in the vignette, the original paper, the SCAN help page nor searching on the support site what scale the resulting normalized values are on. From their distribution, I'm assuming it's a log scale, but since the range of values for my samples is only from -1.8 to 5.2, I assume it's not the standard log2? The equations in the paper use log() in several places, so it's on the natural log scale? Either way, please add the information to the documentation. 

Thanks,

Jenny

scan.upc • 1.9k views
ADD COMMENT
0
Entering edit mode

A slightly orthogonal point; Benilton added functionality to the pdInfoBuilder package intended primarily to accommodate the MBNI re-mapped probeset definitions. There isn't yet a vignette for this, but I will probably add one in devel soon. In the meantime, a short version goes like this.

Download the 'Zip of CDF, Seq, Map, Desc' file from MBNI (The last column here, if you are using e.g., Entrez Gene mappings). I will illustrate with Equus, since that is what you are using. Unzip the file somewhere, then start R in that directory.

## setup
library(pdInfoBuilder)
library(foreach)
library(doMC) ## if you have multiple cores
registerDoMC(4) ## or more, depending on how cool you are

## make package
unzip("equgene10st_EQca_ENTREZG_19.0.0.zip")
## this step uses multiple cores if you have them
tab <- cdf2table("equgene10st_EQca_ENSG.cdf")
## probably you want to use reasonable email and author names, unlike me...
pd <- new("GenericPDInfoPkgSeed", table = tab, pkgName = "pd.equgeneMBNI.1.0.st", version = "0.0.1", email = "me@mine.com", author = "me")
makePdInfoPackage(pd)

## install and use
install.packages("pd.equgeneMBNI.1.0.st/", repos = NULL)
library(oligo)
## you have to use the pkgname argument here!
dat <- read.celfiles(list.celfiles(), pkgname = "pd.equgeneMBNI.1.0.st")
eset <- rma(dat)

 

ADD REPLY
0
Entering edit mode
Thanks Jim! I'll run it tomorrow when I get back to work... Jenny
ADD REPLY
0
Entering edit mode
@stephen-piccolo-6761
Last seen 4.2 years ago
United States

Jenny,

Thanks for the helpful feedback. SCAN values are on a log2 scale and are centered at zero (more here: https://support.bioconductor.org/p/48544/). I just updated the documentation in the actual package to mention this.

Regards,

-Steve

ADD COMMENT
0
Entering edit mode

Thank you! BTW - your link above accidentally has the ending ) in the link, so if you click on it you get "404 Not Found" The link for others in the future is Expression Values for SCAN.UPC? And I'm getting a bit embarrassed as this is the 3rd time in a row I've failed to find a relevant post on the support site! This time it was because of the difference between the spellings of normalization and normalisation!! I should seek out training in google-fu... 

ADD REPLY

Login before adding your answer.

Traffic: 608 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6