Entering edit mode
pbachali
▴
50
@pbachali-9651
Last seen 4.8 years ago
Hi, I have a dataset from mogene20st platform. I am able to use the oligo package and read the celfiles and get the annotations. I am unable to use the brainarray CDF to read the celfiles. I realized that it's not added in pdInfobuilder. I referred to other posts where the cdf has been Pdinfobuilder. https://support.bioconductor.org/p/72378/
I followed the same steps and created z and seed. I am not able to install it. This is the error that R is throwing me.
> install.packages("pd.mogene21stmmense/", type="source",repos = NULL)
* installing *source* package ‘pd.mogene21stmmense’ ...
** R
** byte-compile and prepare package for lazy loading
Warning: package ‘Biostrings’ was built under R version 3.5.2
Error in eval(exprs[i], envir) : Unable to locate DB file
Error : unable to load R code in package ‘pd.mogene21stmmense’
ERROR: lazy loading failed for package ‘pd.mogene21stmmense’
* removing ‘/Library/Frameworks/R.framework/Versions/3.5/Resources/library/pd.mogene21stmmense’
Warning in install.packages :
installation of package ‘pd.mogene21stmmense/’ had non-zero exit status
Here is my code.
library(pdInfoBuilder)
library(ff)
library(doMC)
registerDoMC(10)
download.file("http://mbni.org/customcdf/19.0.0/ense.download/mogene21st_Mm_ENSE_19.0.0.zip","tmp.zip")
unzip("tmp.zip")
dir()
z <- cdf2table("mogene21st_Mm_ENSE.cdf")
seed <- new("GenericPDInfoPkgSeed", table=z, author = "me", email = "prathyusha.bachali@ampelbiosolutions.com", species = "Mus musculus", pkgName = "pd.mogene21stmmense")
makePdInfoPackage(seed)
install.packages("pd.mogene21stmmense/", type="source",repos = NULL)
Thanks in advance
Thanks a lot. I really appreciate it. I am trying to get the gene symbols, entrezgene for the features from the pd.mogene21st.mm.ense. I am trying to use biomrt but not able to get any results. library(annotate) ID <- featureNames(esetba) Symbol <- getSYMBOL(ID,"mogene21stmmentrezg.db") fData(esetba) <- data.frame(Symbol=Symbol)
But this approach gave me 0 gene symbols.
Your first post no longer makes sense to me. Do you want the mapping to be based on Ensembl (hence ense in the name) or NCBI (hence entrezg in the name)? If you want NCBI IDs, then you need to install from 'http://mbni.org/customcdf/23.0.0/entrezg.download/pd.mogene21st.mm.entrezg_23.0.0.tar.gz'.
I also pointed you to the wrong thing (probably). The mapping I used was 'ense' which is for Ensembl exon mappings, whereas you probably want the gene level mappings.
AND, the
featureNames
for any of the MBNI data are simply the underlying ID with an '_at' pasted on the end. So for example, the IDs for the ense package I installed yesterday areWhere the manfsetid column has the probeset (or
featureName
) identifiers. There are annotation packages for the NCBI remapped arrays (e.g., http://mbni.org/customcdf/23.0.0/entrezg.download/mogene21stmmentrezg.db23.0.0.tar.gz), but those are sort of superfluous, as you can just as easily use the org.Hs.eg.db package after stripping off the '_at' from the probeset IDs.Also
getSymbol
is old tech. You should be using eitherselect
ormapIds
. And if you do use the annotation packages from MBNI, you can also useannotateEset
from my affycoretools package to do the annotation.