Hello,
I am currently following this protocol:
https://www.bioconductor.org/packages//2.7/bioc/vignettes/oligo/inst/doc/V5ExonGene.pdf
to download Microarray data from GEO.
Firstly, when I try the command for rma analysis, I see this error.
> genePS <- rma(affyGeneFS, target="probeset")
Error in .local(object, ...) : unused argument (target = "probeset")
I then specified no target and it seemed to work.
> genePS <- rma(affyGeneFS)
Background correcting
Normalizing
Calculating Expression
However, in the next step, to obtain featureData, I see yet another error.
> featureData(genePS) = getNetAffx(genePS)
Error in getNetAffx(genePS) :
NetAffx Annotation not available in 'pd.mouse430.2'. Consider using 'biomaRt'.
I have 2 questions.
1) Why did specifying no target work?
2) How can I substitute the getNetAffx command with BiomaRt in order to obtain the featureData?
Thank you!
This was extremely informative and makes my understanding a lot clearer. I used the code you provided and it worked perfectly. Thanks so much, appreciate it James!
Hi James,
If I could just extend this question- I have another dataset that contains microarray data and the GEO platform mentioned is [MoGene-1_0-st] Affymetrix Mouse Gene 1.0 ST Array [transcript (gene) version]. I am not sure if that was useful information but I used the before mentioned protocol: https://www.bioconductor.org/packages//2.7/bioc/vignettes/oligo/inst/doc/V5ExonGene.pdf to download the microarray data and pdata(featureData(geneCore)) does not seem to have any gene names. In the end, I get gene IDs such as 10338001, 10338006 etc as my Limma DEG analysis output. Is there a way for me to get gene names/symbols? Do I have to use BioMart for this?
Thanks in advance.
The same recommendation applies, only now the annotation package is the
mogene10sttranscriptcluster.db
package. Alternatively, since this is a Gene ST arrray, you could use the in-built Affy dataOR
And do note that there are help pages for functions. ?annotateEset should be instructive, and if not, please tell me why so I can improve it.
Hi James,
Thank you for your reply. I went through ?annotateEset and have a few questions that I hope are not too basic.
1) Firstly, what would be the difference between mogene10strranscriptcluster.db and pd.mogebe.1.0.st.v1? And how would I know to look for these packages within the library "affycoretools"? Would this be based on the package written in the GEO link from where I get the dataset?
2) Before running your code, pData(featureData(geneCore) returned several columns such as probe_set_id, gene_assignment, pathways etc. After running the code above through, pData(featureData(geneCore) only shows 4 columns = probe_id, entrez_id, symbol and gene_name. I was wondering if it is possible to retain the original information provided in the featureData of geneCore as well?
3) After running the above code, some of the gene probe IDs seemed to be mapped to gene_names which display "NA". Is this to be expected?
4) Also, can BioMart be used to map the gene probe IDs to gene names? I know BioMart is related to Ensembl so I was wondering if Ensemble contained Affy information.
I apologize for the number of questions asked but I am new to this and would very much appreciate your advice!
Thank you.
rma
. It also happens to contain some annotation data, whichannotateEset
will extract and process. A further alternative is to note that GEO will automatically populate your featureData slot, and you could use those data as well. But do note that what GEO puts in there is pretty messy, and you have to parse out the useful bits yourself. A workflow showing all the ins and outs of this might be useful.That makes it very clear, thank you! appreciate it.