Using GEOquery, I'm not getting any features and a lot of the
methods don't work. What am I doing wrong?
> gse <- getGEO("GSE48206")
ftp://ftp.ncbi.nlm.nih.gov/geo/series/GSE48nnn/GSE48206/matrix/
Found 1 file(s)
GSE48206_series_matrix.txt.gz
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 2325 100 2325 0 0 2660 0 --:--:-- --:--:-- --:--:-- 2663
Using locally cached version of GPL16417 found here:
/tmp/Rtmp49zUTd/GPL16417.soft
> show(gse)
$GSE48206_series_matrix.txt.gz
ExpressionSet (storageMode: lockedEnvironment)
assayData: 0 features, 6 samples
element names: exprs
protocolData: none
phenoData
sampleNames: GSM1172734 GSM1172735 ... GSM1172739 (6 total)
varLabels: title geo_accession ... supplementary_file_2 (45 total)
varMetadata: labelDescription
featureData: none
experimentData: use 'experimentData(object)'
Annotation: GPL16417
> head(Meta(gse))
Error in (function (classes, fdef, mtable) :
unable to find an inherited method for function ‘Meta’ for signature ‘"list"’
> names(GSMList(gse))
Error in (function (classes, fdef, mtable) :
unable to find an inherited method for function ‘GSMList’ for
signature ‘"list"’
How do I tell from the GEO website whether count data is directly available through GEOquery?
Ah, good question. It does actually look like there is a series matrix file available. But if you click on the link, you can see on the FTP site that it is only 3 kb. But in general, if there is a link to the SRA, then you know it's RNA-Seq data, and submitters can either upload a series matrix of counts/TPM/RPKM/whatever, or they can upload individual files. At that point you have to look closer to see what they uploaded, and in what format.
Thank you for the thought. Would you please explain me more how to do it in Rsudio? Specifically for my data (GSE81540).
Appreciate your help.